Pharmacovigilance — Signals from The Crowd (PDF) — in the NY Times’ words: Using automated software tools to examine queries by 6 million Internet users taken from Web search logs in 2010, the researchers looked for searches relating to an antidepressant, paroxetine, and a cholestorol lowering drug, pravastatin. They were able to find evidence that the combination of the two drugs caused high blood sugar. (via New York Times)
A New Basis for Copyright — NZ’s most technically-literate judge floats an idea for how copyright might be reimagined in a more useful way for the modern age by considering it in terms of human rights. Perhaps there should be consideration of a new copyright model that recognises content user rights against a backdrop of the right to receive and impart information and a truly balanced approach to information and expression that recognises that ideas expressed are building blocks for new ideas. Underpinning this must be a recognition on the part of content owners that the properties of new technologies dictate our responses, our behaviours, our values and our ways of thinking. These should not be seen as a threat but an opportunity. It cannot be a one-way street with traffic heading only in the direction dictated by content owners.
Everything We Know About Obama’s Big Data Operation (Pro Publica) — “White suburban women? They’re not all the same. The Latino community is very diverse with very different interests,” Dan Wagner, the campaign’s chief analytics officer, told The Los Angeles Times. “What the data permits you to do is figure out that diversity.”
cube (GitHub) — time-series data collection and analysis. Cube lets you compute aggregate statistics post hoc. It also enables richer analysis, such as quantiles and histograms of arbitrary event sets. Cube is built on MongoDB and available under the Apache License on GitHub.
1M Robots to Replace 1M Human Jobs at Foxconn (Singularity Hub) — Foxconn plant opening, making manufacturing robots, and they appear to be dogfooding by using them in other plants. $25k each, 10k+ made, and fits into the pattern: the number of operational robots in China increased by 42 percent from 2010 to 2011.
Mobile Overtaking Web — provocatively packaged extrapolations of ComScore and similar numbers to conclude that Americans spend more time interacting with mobile apps than with web sites. I’m sure you could beat an iPhone developer to death with the error bars.
Ghostery — a browser plugin to block trackers, web bugs, dodgy scripts, ads, and anything else you care to remove from your browsing experience. It looks like a very well done adblocker, but it’s done (a) closed-source and (b) for-profit. Blocking trackers is something every browser *should* do, but because browser makers make (or hope to make) money from ads, they don’t. In theory, Mozilla should do it. Even if they were to take up the mantle, though, they’re unlikely to make anything for IE or Chrome. So it’s in the hands of companies with inarticulate business models. (via Andy Baio)
Perspectives — Firefox plugin that lets you know when you’ve encountered an SSL certificate that’s different from the ones that other Perspectives users see (e.g., you’re being man-in-the-middled by Iran). (via Francois Marier)
Always Connected — “I’ve got a full day of staring at glowing rectangles ahead of me! Better get started …”. I have made mornings and evenings backlight-free zones in an effort to carve out some of the day free of glowing rectangles. (I do still read myself to sleep on the Kindle, though, but it’s not backlit)
Is Teaching MapReduce Healthy for Students? — Google’s narrow MapReduce API conflates logical semantics (define a function over all items in a collection) with an expensive physical implementation (utilize a parallel barrier). As it happens, many common cluster-wide operations over a collection of items do not require a barrier even though they may require all-to-all communication. But there’s no way to tell the API whether a particular Reduce method has that property, so the runtime always does the most expensive thing imaginable in distributed coordination: global synchronization. Detailed and interesting criticism of whether Hadoop is the BASIC of parallel tools. (via Pete Warden)
Implicit and Explicit Feedback — for preferences and recommendations, implicit signals (what people clicked on and actually listened to) turn out to be strongly correlated with what they would say if you asked. (via Greg Linden)
Twitter Storm (GitHub) — distributed realtime computation system, intended for realtime what Hadoop is to batch processing. Interesting because you improve most reporting and control systems when you move them closer to real-time. Eclipse-licensed open source.