- Find Communities — algorithm for uncovering communities in networks of millions of nodes, for producing identifiable subgroups as in LinkedIn InMaps. (via Matt Biddulph’s Delicious links)
- Seven Ways to Think Like The Web (Jon Udell) — seven principles that will head off a lot of mistakes. They should be seared into the minds of anyone working in the web. 2. Pass by reference rather than by value. [pass URLs, not copies of data] […] Why? Nobody else cares about your data as much as you do. If other people and other systems source your data from a canonical URL that you advertise and control, then they will always get data that’s as timely and accurate as you care to make it.
- Interview with Marco Arment (Rands in Repose) — Most people assume that online readers primarily view a small number of big-name sites. Nearly everyone who guesses at Instapaper’s top-saved-domain list and its proportions is wrong. The most-saved site is usually The New York Times, The Guardian, or another major traditional newspaper. But it’s only about 2% of all saved articles. The top 10 saved domains are only about 11% of saved articles. (via Courtney Johnston’s Instapaper Feed)
"long tail" entries
Long Tail, Copyright vs Preservation, Diminished Reality, and Augmented Data
- Mechanical Turk Requester Activity: The Insignificance of the Long Tail — For Wikipedia we have the 1% rule, where 1% of the contributors (this is 0.003% of the users) contribute two thirds of the content. In the Causes application on Facebook, there are 25 million users, but only 1% of them contribute a donation. […] The lognormal distribution of activity, also shows that requesters increase their participation exponentially over time: They post a few tasks, they get the results. If the results are good, they increase by a percentage the size of the tasks that they post next time. This multiplicative behavior is the basic process that generates the lognormal distribution of activity.
- Copyright Destroying Historic Audio — so says the Library of Congress. Were copyright law followed to the letter, little audio preservation would be undertaken. Were the law strictly enforced, it would brand virtually all audio preservation as illegal. Copyright laws related to preservation are neither strictly followed nor strictly enforced. Consequently, some audio preservation is conducted.
- Diminished Reality (Ray Kurzweil) — removes objects from video in real time. Great name, “diminished reality”. (via Andy Baio)
- Data Enrichment Service — using linked government data to augment text with annotations and links. (via Jo Walsh on Twitter)
In an earlier post, I examined the average price of the Top 100 PAID apps and noted that the relationship between price and popularity was somewhat dependent on the category. But in the Book category, I concluded that the Top 10 PAID apps were on average cheaper than those ranked 91-100. But what if we examine all Book apps, will the long tail apps be pricier?
A city that has totally open, unrestricted access to data and partnerships with business has the best chance of becoming the healthy, "long tail" city of the future, with head, "meaty middle" and tail, all working together, all reinforcing each other, all driving each other forward. That's the future of cities. It might be time to ask yourself: how long is your city's tail shaping up to be? The answer may determine, to a large degree, how much your city is a thriving place to live in decades to come.
Back in 2004, when I spent most of my time doing format conversions and production automation, I had the privilege of turning much of what I learned doing things like batch running Word macros from the DOS command line with Ruby into a book, Word Hacks. Like our other Hacks books, it's a lesson in the value of curation and…
Last week we released 16 of our books as iPhone Apps (and on Saturday added The Twitter Book), and there's some interesting Long Tail data coming in. We've seen Long Tail behavior in the data from Safari Books Online and from Google Book Search, though in this case it's about geography: even though regions like Colombia, Belgium, and Greece are…
Is print on demand best suited for short print runs and Long Tail plays, or can it be used for more? Come share your thoughts on POD.
Borders is using its limited shelf space to give top selling books the "face-out" treatment. Poor sellers need not apply.