Edd Dumbill
Strata Gems: Write your own visualizations
The Processing language is an easy way to get started with graphics
Visualization is a powerful way to turn data into a story. But if you're not a "graphics person", where do you start?
Strata Gems: Use Wikipedia as training data
The online encyclopedia is a great resource for data scientists
Wikipedia is an essential tool in the data scientist's armory. Today's Strata Gem shows how it can be used to help computers distinguish between different sense of common words.
Strata Gems: Try MongoDB without installing anything
An easy way to get started with a NoSQL database
Want to dip your toes into the world of NoSQL databases? In the first of our Strata Gems series, find out how explore MongoDB through your web browser.
Strata Week: Life, by the numbers
Mapping 311 data, the social effects of traffic, Google Maps and border disputes, and a natural language processing challenge.
This issue of Strata Week follows the path of data through cities, streets and border conflicts. We conclude our journey with a little brain work, as a programming challenge is announced to automatically identify topics and trends in Twitter and Facebook updates.
Strata Week: Building data startups
Strata registration opens, making money with data, dolphins and cellphones, data in the dirt
In this week's look at the world of data, learn how to build a money-making data startup, register for Strata 2011, and hear of new developments in the mining of offline social networks.
Strata Week: Army anomalies
Distributed video editing, big data tool updates, Riak continues to mature.
In this edition of Strata Week: the Army turns to big data to sniff out internal threats; CouchDB helps with collaborative video editing; Riak adds full-text searching; and a look at notable Hadoop World announcements.
Strata Week: Behind LinkedIn Signal
Life-size visualizations, how Hadoop is used, SciDB has its first release
In this edition of Strata Week: the open source technology behind LinkedIn Signal; Julia Grace on visualization; Hadoop usage survey results, and the first release of the SciDB project.
Sarah Novotny joins OSCON for 2011
OSCON is returning to Portland in July 2011.
The O'Reilly Open Source Convention will be returning to Portland, Oregon, July 25-29 2011, with program chairs Edd Dumbill and Sarah Novotny.
The SMAQ stack for big data
Storage, MapReduce and Query are ushering in data-driven products and services.
We're at the beginning of a revolution in data-driven products and services, driven by a software stack that enables big data processing on commodity hardware. Learn about the SMAQ stack, and where today's big data tools fit in.
Strata Week: The challenge of real-time analytics
Blue is the color, getting help with email overload.
In the latest edition of Strata Week: Google's introduction of a new search-indexing system highlights an important limitation of MapReduce and Hadoop. Can MapReduce adapt to real-time needs or will others follow Google in creating new architectures for real-time analytics?