- Rules for Revolutionaries — Carl Malamud’s talk to the WWW2010 Conference. Video, slides, and text available.
- Self-Improving Bayesian Sentiment Analysis for Twitter — a how-I-did-it for a homegrown project to do sentiment analysis on Twitter.
- LUXR — the Lean User Experience Residency program. LUXr brings user experience and design services to early stage teams in a lower cost, more efficient way than traditional project-based consulting. The latest from Adaptive Path’s Janice Fraser.
- My Top Ten Assertions About Data Warehouses (CACM) — Michael Stonebraker’s take on the data warehouse world, and his predictions cut across a lot of our O’Reilly trends. Assertion 5: “No knobs” is the only thing that makes any sense. It is pretty clear that human operational costs dominate the cost of running a data warehouse. […] Almost all DBMSs have 100 or more complicated tuning “knobs.” This requires DBAs to be “4-star wizards” and drives up operating costs. Obviously, the only thing that makes sense is to have a program that adjusts these knobs automatically. In other words, look for “no knobs” as the only way to cut down DBA costs. (via mikeolson on Twitter)
"sentiment analysis" entries
Built for emergencies, now available as open source and as a web service
Built for emergencies, Usahidi's mapping and social media monitoring tools also have commercial applications. Though open source, the tools are also available as for-pay hosted services.
The online encyclopedia is a great resource for data scientists
Wikipedia is an essential tool in the data scientist's armory. Today's Strata Gem shows how it can be used to help computers distinguish between different sense of common words.