Edd Dumbill

Edd Dumbill is a principal analyst for O'Reilly Radar, and program chair for the O'Reilly Strata Conference and the O'Reilly Open Source Convention.

Strata Gems: Write your own visualizations

The Processing language is an easy way to get started with graphics

Visualization is a powerful way to turn data into a story. But if you're not a "graphics person", where do you start?

Strata Gems: Use Wikipedia as training data

The online encyclopedia is a great resource for data scientists

Wikipedia is an essential tool in the data scientist's armory. Today's Strata Gem shows how it can be used to help computers distinguish between different sense of common words.

Strata Gems: Try MongoDB without installing anything

An easy way to get started with a NoSQL database

Want to dip your toes into the world of NoSQL databases? In the first of our Strata Gems series, find out how explore MongoDB through your web browser.

Strata Week: Life, by the numbers

Mapping 311 data, the social effects of traffic, Google Maps and border disputes, and a natural language processing challenge.

This issue of Strata Week follows the path of data through cities, streets and border conflicts. We conclude our journey with a little brain work, as a programming challenge is announced to automatically identify topics and trends in Twitter and Facebook updates.

Strata Week: Building data startups

Strata registration opens, making money with data, dolphins and cellphones, data in the dirt

In this week's look at the world of data, learn how to build a money-making data startup, register for Strata 2011, and hear of new developments in the mining of offline social networks.

Strata Week: Army anomalies

Distributed video editing, big data tool updates, Riak continues to mature.

In this edition of Strata Week: the Army turns to big data to sniff out internal threats; CouchDB helps with collaborative video editing; Riak adds full-text searching; and a look at notable Hadoop World announcements.

Strata Week: Behind LinkedIn Signal

Life-size visualizations, how Hadoop is used, SciDB has its first release

In this edition of Strata Week: the open source technology behind LinkedIn Signal; Julia Grace on visualization; Hadoop usage survey results, and the first release of the SciDB project.

Sarah Novotny joins OSCON for 2011

OSCON is returning to Portland in July 2011.

The O'Reilly Open Source Convention will be returning to Portland, Oregon, July 25-29 2011, with program chairs Edd Dumbill and Sarah Novotny.

The SMAQ stack for big data

Storage, MapReduce and Query are ushering in data-driven products and services.

We're at the beginning of a revolution in data-driven products and services, driven by a software stack that enables big data processing on commodity hardware. Learn about the SMAQ stack, and where today's big data tools fit in.

Strata Week: The challenge of real-time analytics

Blue is the color, getting help with email overload.

In the latest edition of Strata Week: Google's introduction of a new search-indexing system highlights an important limitation of MapReduce and Hadoop. Can MapReduce adapt to real-time needs or will others follow Google in creating new architectures for real-time analytics?