ENTRIES TAGGED "Hadoop"

What is Apache Hadoop?

What is Apache Hadoop?

A look at the components and functions of the Hadoop ecosystem.

Apache Hadoop has been the driving force behind the growth of the big data industry. But what does it do, and why do you need all its strangely-named friends, such as Oozie, Zookeeper and Flume?

Read Full Post | Comment |

Why Hadoop caught on

Doug Cutting on Hadoop's rise and why he's surprised at its growth.

Doug Cutting discusses Hadoop's current and near-term role, and the factors that made it a central part of data processing.

Read Full Post | Comments Off |
Top stories: January 23-27, 2012

Top stories: January 23-27, 2012

Finding the real pirates, Microsoft's plan for Hadoop and big data, and thoughts on a theoretical Amazon store.

This week on O'Reilly: Mike Loukides offered a different take on the piracy debates, Edd Dumbill looked at Microsoft's Hadoop-driven plan for big data, and we learned why Amazon retail stores aren't out of the question.

Read Full Post | Comment |
Strata Week: Genome research kicks up a lot of data

Strata Week: Genome research kicks up a lot of data

Where to store all that genome data? Also, clarifying the work of digital humanities scholars.

We take a look at the big data obstacles and opportunities for genomics, digital humanities scholars respond to Stanley Fish's mischaracterization of what they do with data, and Hadoop World and the Strata Conference merge.

Read Full Post | Comment: 1 |
Strata Newsletter: January 25, 2012

Strata Newsletter: January 25, 2012

Hadoop World and Strata merge. Microsoft's plan for big data.

Highlights from the 1/25/12 edition of the Strata newsletter include: Hadoop World and the Strata Conference are merging, and a look at how Microsoft is putting Hadoop to work.

Read Full Post | Comment |
Microsoft's plan for Hadoop and big data

Microsoft's plan for Hadoop and big data

Hadoop is a central part of Microsoft's data strategy.

Strata conference chair Edd Dumbill takes a look at Microsoft's plans for big data. By embracing Hadoop, the company aims to keep Windows and Azure as a standards-friendly option for data developers.

Read Full Post | Comment |
Top stories: January 16-20, 2012

Top stories: January 16-20, 2012

The perils of SOPA and PIPA, a survey of Hadoop products, and common mobile UI mistakes.

This week on O'Reilly: Tim O'Reilly explained why SOPA and PIPA are bad for the Internet and bad for business, Edd Dumbill offered an extensive survey of Hadoop solutions, and we looked at the most common mobile interface mistakes.

Read Full Post | Comment |
Big data market survey: Hadoop solutions

Big data market survey: Hadoop solutions

A survey of the Hadoop big data marketplace.

In this survey, Edd Dumbill explores the Hadoop-based big data solutions available on the market, contrasts the approaches of EMC Greenplum, IBM, Microsoft and Oracle and provides an overview of Hadoop distributions.

Read Full Post | Comments: 2 |
Strata Week: A .data TLD?

Strata Week: A .data TLD?

A proposal for a .data TLD, flavors of Hadoop, and a vote for pseudonymous commenters.

In this week's data news, Stephen Wolfram calls for a .data top-level domain and Cloudera responds to Hadoop version 1.0.

Read Full Post | Comments: 2 |
Strata Week: Unfortunately for some, Uber's dynamic pricing worked

Strata Week: Unfortunately for some, Uber's dynamic pricing worked

Dynamic pricing angers some Uber users, Hadoop hits 1.0, a possible set back for open-access research.

Uber's dynamic pricing worked as intended on New Year's Eve, but not everyone is happy about that. Elsewhere, Hadoop reaches the 1.0 milestone and proposed legislation seeks to repeal an open-access research policy.

Read Full Post | Comment |