Fri

Apr 15
2005

Nat Torkington

Nat Torkington

Amazon adds concordances and text stats

Amazon has added text concordances and statistics to all the books it has full text for. This is very cool! Not only do they have the statistically improbable phrases, which work well to tag the book, but they also have reading level, most common words, and even words-per-dollar and words-per-pound.

It's great to see the wonks at Amazon going nuts with their data. I'm amazed that Google, with its vaunted philia for CS degrees, didn't do it first with the Google Print corpus. Either way, we're still in the early days--the information is very much a "by the way" side page, and not a primary navigation tool the way "best reviewed", "bestselling", and so on are sort options when viewing book search results.

In the near future I can easily imagine the "tags" (aka statistically improbable phrases) feeding into the category hierarchy and search results. If I search for "human neocortex", I should get On Intelligence as a result because that book has "human neocortex" as a SIP.

--Nat


tags:   | comments: 3   | Sphere It
submit:

 
Previous  |  Next

0 TrackBacks

TrackBack URL for this entry: http://blogs.oreilly.com/cgi-bin/mt/mt-t.cgi/3962

Comments: 3

  Rael Dornfest [04.25.05 02:28 PM]

A random dip finds that an ounce is worth around 3000 words ;-).

  RIck Beyer [04.28.05 06:06 AM]

I had some fun comparing various books, including my own, and posted the results. FYI, they should be selling Tolstoy's War and Peace at Walmart...50 thousand words to the dollar

  Donald [03.20.06 04:02 PM]

Can anybody tell me how to access this beast.

Thanks.

Post A Comment:

 (please be patient, comments may take awhile to post)






Type the characters you see in the picture above.