- Sibyl: Google’s System for Large Scale Machine Learning (YouTube) — keynote at DSN2014 acting as an intro to Sibyl. (via KD Nuggets)
- Bitrot from 1997 — That’s 205 failures, an actual link rot figure of 91%, not 57%. That leaves only 21 URLs as 200 OK and containing effectively the same content.
- What We Do And Don’t Know About Software Effort Estimation — nice rundown of research in the field.
- fabric — simple yet powerful ssh library for Python.
Jer Thorp visualizes the history of "The Avengers."
In this week's visualization, The New York Times' data artist Jer Thorp visualizes the appearances of "The Avengers" in the comic book series.
A look at the historical accuracy of "Downton Abbey's" language.
Ben Schmidt ran the script of the "Downton Abbey" season two finale through Google Ngrams to see how the show's language matches up with history.
Transaction costs, crowdsourcing, and the persuasiveness of data were all in play long ago.
Examples from the Victorian era show that if we're going to improve the world with data, it's absolutely essential we stay grounded in reality.
Yelp's MRJob goes open source, CODATA catalogues historical data, one more telecom lawsuit, and Expedia cleans up their UI
In the latest Strata Week: Yelp makes MRJob open source; CODATA wants to inventory "threatened data"; a visualization untangles a telecom lawsuit; and analytics and a simple fix nets $12 million for Expedia.