- How Twitter Stores 250M Tweets a Day Using MySQL (High Scalability) — notes from a talk at the MySQL conference on how Twitter built a high-volume MySQL store.
- How The Atlantic Got Profitable With Digital First (Mashable) — Lauf says his team has focused on putting together premium advertising experiences that span print, digital, events and (increasingly) mobile.
- Data Mining Without Prejudice — an attempt to measure fit without pre-favouring one type of curve over another.
- It Is No Longer OK Not To Know How Congress Works (Clay Johnson) — looking for a specific innovation to try and change the way Washington works by the time Congress votes on SOPA is about as foolish as Steve Jobs trying to diet his way out of having pancreatic cancer.
ENTRIES TAGGED "Twitter"
A new look at Yahoo's traffic, the challenge of scaling Tumblr, and a host of visualization guidelines.
In this week's data news: Yahoo visualizes its front page traffic and demographics, why Tumblr is tougher to scale than Twitter, and a look at what you need to consider as you build visualizations.
A huge visualization captures tweets from the SOPA protest.
This week's visualization comes from Fred Benenson, who ranked and mapped tweets related to the SOPA protest.
Prank or mistake? A QR code on a TSA poster links to a non-TSA site.
Fred Trotter discovers that a QR code embedded in a TSA poster at the Orlando airport links to justinsomnia.org, which is about as far as you can get from a government website.
Maximum MySQL, Digital News, Unbiased Mining, and Congressional Clue
PHP Virtual Machine, Archive Your Tweets Easily, Prioritize Your Links, VC Memes
- The HipHop Virtual Machine (Facebook) — inside the new virtual machine for PHP from Facebook.
- PHP Fog’s Free Thinkup Hosting (Expert Labs) — ThinkUp archives your tweets and other social media activity for you to search, visualize, and analyze. PHPFog hosts PHP apps scalably, and I’m delighted to be an advisor. Andy’s made a video showing how to get up and running with ThinkUp in 3m. (This is impressive given how long I squinted at ThinkUp and swore trying to get it going on my colo box just a year ago)
- The Secret Lives of Links (Luke Wroblewski) — notes on a talk by Jared Spool. On the Walgreen’s site, 21% of people go to photos, 16% go to search, 11% go to prescriptions, 6% go to pharmacy link, 5% go to find stores. Total traffic is 59% for these five links. The total amount of page used for these 5 links is ~4% of page space. The most important stuff on the page occupies less than 1/20th of the page. This violates Fitts’s Law. Makes me think of the motor and sensory homunculi.
- VC Memes — the success kid is my favourite, I think.
EMC study looks at the state of data science, Carrier IQ and big data, and the welcome return of old tweets.
In this week's data news: EMC's new data science study predicts a data scientist shortage, why Carrier IQ is part of a "bizarre big-data triangle," and DataSift will soon offer access to an archive of old tweets.
Hedonometrics and Twitter, Pricing Experiments, Crowdsourcing App Dev, and Flashcard Library
- Temporal Patterns of Happiness and Information in a Global Social Network: Hedonometrics and Twitter (PLOSone) — Tweets involving the ‘fake news’ comedian Stephen Colbert are both happier and of a higher information level than those concerning his senior colleague Jon Stewart. By contrast, tweets mentioning Glenn Beck are lower in happiness than both Colbert and Stewart but comparable to Colbert in information content.
- Pricing Experiments You Can Learn From — revealing the data from experiments which showed how to drive people towards higher prices.
- 10 Things I Learned at CrowdConf 2011 (Crowdflower) — Using his newly released crowdsourcing platform Coffee & Power, Philip [Rosedale] developed his entire company infrastructure and platform through a globally distributed workforce. 288 contributors in 127 locations worked together to get this startup off the ground in a whole new way. The Coffee & Power platform was built in 1,700 commits ranging from $6 quality checks all the way up to full source-code editing. One element of this process was developing the Hudat iPhone app. In less than a month for $2,485, the Coffee & Power community got this mobile app up and running.
- Andi — AGPL3-licensed spaced repetition flashcard system. (via Jack Kinsella)
Info overload vs. consumption, how big data is shaping business, and why we need the "paperless book."
This week on O'Reilly: Author Clay Johnson explained why information consumption, not overload, is what needs to be managed. Also, Alistair Croll looked at the relationship between business intelligence and big data, and Todd Sattersten made a case for the paperless book.