- Hashbangs (Dan Webb) — why those terrible #! URLs are a bad idea. Looks like they’re going away with pushState coming to browsers. As Dan says, “URLs are forever”. Let’s get them right. I’m fascinated by how URLs are changing meaning and use over time.
- DNA Sequencing on a USB Stick — this has been going the rounds, but I think there’s a time coming when scientific data generation can be crowdsourced. I care about a particular type of fish, but it hasn’t been sequenced. Can I catch one, sequence it, upload the sequence, and get insight into the animal by automated detection of similar genes from other animals? Let those who care do the boring work, let scientists work on the analysis.
- The US Recording Industry is Stealing From Me (Bruce Simpson) — automated content detection at YouTube has created an industry of parasites who claim copyright infringement and then receive royalties from the ads shown on the allegedly infringing videos.
- Ubuntu on Android — carry a desktop in your pocket? Tempting. It’s for manufacturers, not something you install on existing handsets, which I’m sure will create tension with the open source world at Ubuntu’s heart. Then again, creating tension with the open source world at Ubuntu’s heart does seem to be Canonical’s core competency ….
ENTRIES TAGGED "crowdsourcing"
The second in a series looking at the major themes of this year's TOC conference.
Several overriding themes permeated this year’s Tools of Change for Publishing conference. The second in a series looking at five of the major themes, here we take a look at data in publishing — how publishers can benefit, practical applications, and innovative ways it can be used.
Hashbangs URLs Must Go, Cheap DNA Sequencing, Content Detection Fail, and Ubuntu on Android
VoIP Drupal is a window onto the promises and challenges faced by a new open source project, including its documentation. A meeting at at MIT this week worked out some long-term plans for firming up VoIP Drupal's documentation and other training materials.
Bryan Sivak looks for ways to change the status quo.
Maryland's first chief innovation officer, Bryan Sivak, is looking for the levers that will help state government to be smarter, not bigger. From embracing collective intelligence to data-driven policy, Sivak is defining what it means to be innovative in government.
Ben Huh on the responsibilities attached to other people's failures.
The content you see on Cheezburger, Inc.’s Fail Blog often mixes humor and pain — but not always in equal proportions. Cheezburger CEO Ben Huh discusses the boundaries of a fail.
Hedonometrics and Twitter, Pricing Experiments, Crowdsourcing App Dev, and Flashcard Library
- Temporal Patterns of Happiness and Information in a Global Social Network: Hedonometrics and Twitter (PLOSone) — Tweets involving the ‘fake news’ comedian Stephen Colbert are both happier and of a higher information level than those concerning his senior colleague Jon Stewart. By contrast, tweets mentioning Glenn Beck are lower in happiness than both Colbert and Stewart but comparable to Colbert in information content.
- Pricing Experiments You Can Learn From — revealing the data from experiments which showed how to drive people towards higher prices.
- 10 Things I Learned at CrowdConf 2011 (Crowdflower) — Using his newly released crowdsourcing platform Coffee & Power, Philip [Rosedale] developed his entire company infrastructure and platform through a globally distributed workforce. 288 contributors in 127 locations worked together to get this startup off the ground in a whole new way. The Coffee & Power platform was built in 1,700 commits ranging from $6 quality checks all the way up to full source-code editing. One element of this process was developing the Hudat iPhone app. In less than a month for $2,485, the Coffee & Power community got this mobile app up and running.
- Andi — AGPL3-licensed spaced repetition flashcard system. (via Jack Kinsella)
Free Service Isn't Sustainable, Big Data, Crowdsourced Historic Science, and Cognitive Biases
- Don’t Be a Free User (Maciej Ceglowski) — pay for your free services, else they’ll go away.
- Katta — Lucene for massive data sets in the cloud. (via Pete Warden)
- Old Weather — crowdsourced transcription of old nautical journals to yield historical information for climate researchers. (via National Digital Forum)
- Siddhartha Mukherjee Talks About Cancer (Guardian) — fascinating profile of the author of a “biography of cancer”. Touches on the cognitive biases we’re all prone to, and their damaging effects on patients. Mukherjee cites a study which found that women with breast cancer recalled eating a high-fat diet, whereas women without cancer did not. But the very same study had asked both sets of women about their diets long before any of them developed cancer, and the diet of those who now had breast cancer had been no more fatty than the rest (via Courtney Johnston)
Internet Asthma Care, C Fulltext, Citizen Science, and Mozilla
- Cost-Effectiveness of Internet-Based Self-Management Compared with Usual Care in Asthma (PLoSone) — Internet-based self-management of asthma can be as effective as current asthma care and costs are similar.
- Apache Lucy — full-text search engine library written in C and targeted at dynamic languages. It is a “loose C” port of Apache Lucene™, a search engine library for Java.
- The Near Future of Citizen Science (Fiona Romeo) — near future of science is all about honing the division of labour between professionals, amateurs and bots. See Bryce’s bionic software riff. (via Matt Jones)
- Microsoft’s Patent Claims Against Android (Groklaw) — behold, citizen, the formidable might of Microsoft’s patents and how they justify a royalty from every Android device equal to that which you would owe if you built a Windows Mobile device: These Microsoft patents can be divided into several basic categories: (1) the ’372 and ’780 patents relate to web browsers; (2) the ’551 and ’233 patents relate to electronic document annotation and highlighting; (3) the ’522 patent relates to resources provided by operating systems; (4) the ’517 and ’352 patents deal with compatibility with file names once employed by old, unused, and outmoded operating systems; (5) the ’536 and ’853 patents relate to simulating mouse inputs using non-mouse devices; and (6) the ’913 patent relates to storing input/output access factors in a shared data structure. A shabby display of patent menacing.
Panagiotis Ipeirotis on the vagaries of semantic analysis and Mechanical Turk's quirks.
In a recent interview, NYU Professor Panagiotis Ipeirotis explained why a "good" online review is often perceived negatively. He also discussed Mechanical Turk's growing pains.