"search" entries

Four short links: 1 December 2011

Four short links: 1 December 2011

DRM Good for Amazon, Arduino Updated, Open Source Foundations, Distributed Search

  1. Cutting Their Own Throats (Charlie Stross) — DRM on ebooks gives Amazon a great tool for locking ebook customers into the Kindle platform. This essay is gold and so very true. Read, believe.
  2. v1.0 of Arduino Out — this is the dev environment, with language additions and lots of features in the libraries. Glad to see the 1.0 stamp put on this important piece of the homebrew hardware world.
  3. Koha and Why We Need Foundations — Simon Phipps looks into the Koha trademark dispute and says that it shows why open source needs foundations (collective IP ownership).
  4. Majestic-12a World Wide Web search engine based on concepts of distributing workload in a similar fashion achieved by successful projects such as SETI@home and distributed.net.
Four short links: 8 November 2011

Four short links: 8 November 2011

Cell Operating System, Search Savvy, Smiling Sliders, and Recommendation Tools

  1. Attempts to Make a Cell Operating System (Science Daily) — finally we will be able to have the guaranteed quality of software and the safety of biological organisms.
  2. Why Kids Can’t Search (Clive Thompson) — kids need to be taught critical thinking skills about what they find on the web. Librarians are our national leaders in this fight; they’re the main ones trying to teach search skills to kids today.
  3. Smiley Slider — cute little way to get feedback. (via Jyri Tuulos)
  4. LensKitan open source toolkit for building, researching, and studying recommender systems.
Four short links: 25 October 2011

Four short links: 25 October 2011

Smart Thermostat, Lamer News, Expensive Meaning, and Hardware Kits

  1. Nest Learning Thermostat — learns how long it takes your house to adjust temperature, so can tell you not just “it’s 55 now” but “it’ll be 65 in 16 minutes”. Looks gorgeous as well as being a good example of embedded intelligence. Data really does make everything better.
  2. lamernews (Github) — an implementation of a Reddit / Hacker News style news web site written using Ruby, Sinatra, Redis and jQuery.
  3. Information is Cheap, Meaning is Expensive — interview with George Dyson. That quote is a wonderful summary of why data is important. But George also says: The danger is not that machines are advancing. The danger is that we are losing our intelligence if we rely on computers instead of our own minds. On a fundamental level, we have to ask ourselves: Do we need human intelligence? And what happens if we fail to exercise it? (via Mathew Ingram)
  4. Cubelets, Littlebits, and Others (Russell Davies) — he’s been playing with some sweet hardware kits. It’s not new and surprising behaviour in a toy and it’s not unbuildable with Lego or Mecanno. But there’s something different and good about being able to do it so quickly, roughly and spontaneously – throwing bits together and getting behaviour out. Not following instructions or typing laboriously. That ease makes it magical and educational – you start to understand the functions of things as a builder not a thinker. (Slightly, you know, slightly – at a lego level, not at a 5-year engineering degree level, but it’s a start.)
Four short links: 18 October 2011

Four short links: 18 October 2011

Search Education, Classic Source, Analyzing Encrypted VoIP, and SQL Injection

  1. Web Search Education (Google) — lesson plans and materials for teaching people how to use search, from operators to critically evaluating sites. This latter area is the weakest: when I teach innocents about the web, I show them organic vs paid results, discuss why people advertise, how people pay for their sites, noticing domain names and organizations, etc. I wonder how much of the weakness of Google’s materials is due to their business model.
  2. Metroid Source Code — reverse-engineered source code from the original classic Metroid. (via Hacker News)
  3. Speaker Recognition From Encrypted VoIP Communications (PDF) — speaker identification, even one encrypted VoIP communications, is 70-75% among a pool of 10 candidates. Impressive. (via Bruce Schneier)
  4. SQL Injection Cheat Sheet — rundown of the different techniques for doing SQL injection. (via GaĆ«tan De Brucker)
Four short links: 15 September 2011

Four short links: 15 September 2011

DOSBox in Javascript, Augmenting Humans, Energy-efficient Computation, and Searchable Text

  1. Javascript DOSBox — first cut at a DOS emulator in Javascript, capable of running Doom. As the author said in email to me, The ability to run arbitrary x86 code across platforms without a plugin is kinda cool.
  2. Blending Machines and Humans to Get Very High Accuracy (Greg Linden) — use experts to train the models, provide tools for experts to correct mistakes in the classifiers, and constantly evaluate all aspects of the system. This augmentation of human ability with computers lets us tackle problems that can’t be solved by computers alone.
  3. Electrical Efficiency of Computation (The Atlantic) — If a MacBook Air were as efficient as a 1991 computer, the battery would last 2.5 seconds. Cites research concluding that computations per kWh have doubled every 1.6 years since the 1940s. (via Hacker News)
  4. recoll — open source tool to make searchable the text buried in your computer (whether in zip files, mail attachments, whatever). (via One Thing Well)

Why an ebook still needs an index

An index in an ebook offers a level of discovery search can't touch.

Why should digital publishers invest in index creation? Because ebooks that give readers efficient ways to access what they need are ebooks that will sell.

When was the last time you mined your site's search data?

Lou Rosenfeld on the benefits of parsing and refining site search.

A gold mine is hiding in the data generated by website search engines, yet many site owners pay little attention to the analytics those engines yield. Author Lou Rosenfeld explains why site search is worth your time.

Searching in ebooks: A unique use case that requires a unique approach

Ereader search tools need to limit disruption and incorporate web search best practices.

The current crop of ereaders handle ebook searching in a variety of ways — some are useful and creative, some aren’t. Here, Pete Meyers looks at the state of ebook search and how it can be improved.

Four short links: 26 July 2011

Four short links: 26 July 2011

Advertising Keywords, Javascript Koans, Etsy Open Source Testing, Wieldy Selections

  1. Google Keyword Advertising — interesting infographic about the most lucrative advertising categories for Google. #20 is an eye-opener!
  2. Javascript Koans (GitHub) — an interactive learning environment that uses failing tests to introduce students to aspects of JavaScript in a logical sequence. (via Javascript Weekly)
  3. Etsy AB (GitHub) — Etsy’s framework for A/B testing, feature ramp up, and more. (via Randy J. Hunt)
  4. Chosen (GitHub) — a JavaScript plugin that makes long select boxes more wieldy. (via Steve Losh)
Four short links: 1 June 2011

Four short links: 1 June 2011

Fair Use, Equation UI, Startup Numbers, and Data Search Engine

  1. Putting Fair Use Forward (Chronicle of Higher Education) — lawyer and academic collaborating on guidelines for academic fair use, intended to remove the chilling effect of the fear of being sued. Great quotes: People deal with fuzzy laws all the time, she argues. “Obscenity is impossible to define, and yet people have some idea of when they’re committing an obscenity or not. You could walk through your life being haunted by the specter of litigation in every aspect of it. But people don’t usually do this in their other free-speech rights.” (via David Adler)
  2. Scrubbing Calculator — clever UI for solving equations without needing to know how to solve equations. Imminent death of mathematics skill in the US predicted, film at 11. (via Dan Meyer)
  3. Startup Genome — a report, written from research into 650 startups. Investors who provide hands-on help have little or no effect on the company’s operational performance. But the right mentors significantly influence a company’s performance and ability to raise money. (However, this does not mean that investors don’t have a significant effect on valuations and M&A) Balanced teams with one technical founder and one business founder raise 30% more money, have 2.9x more user growth and are 19% less likely to scale prematurely than technical or business-heavy founding teams.
  4. Zanran — search engine for graphs, charts, and data. (via Pia Waugh)