ENTRIES TAGGED "Hadoop"
A look at the components and functions of the Hadoop ecosystem.
Apache Hadoop has been the driving force behind the growth of the big data industry. But what does it do, and why do you need all its strangely-named friends, such as Oozie, Zookeeper and Flume?
Doug Cutting on Hadoop's rise and why he's surprised at its growth.
Doug Cutting discusses Hadoop's current and near-term role, and the factors that made it a central part of data processing.
Where to store all that genome data? Also, clarifying the work of digital humanities scholars.
We take a look at the big data obstacles and opportunities for genomics, digital humanities scholars respond to Stanley Fish's mischaracterization of what they do with data, and Hadoop World and the Strata Conference merge.
Hadoop is a central part of Microsoft's data strategy.
Strata conference chair Edd Dumbill takes a look at Microsoft's plans for big data. By embracing Hadoop, the company aims to keep Windows and Azure as a standards-friendly option for data developers.
A survey of the Hadoop big data marketplace.
In this survey, Edd Dumbill explores the Hadoop-based big data solutions available on the market, contrasts the approaches of EMC Greenplum, IBM, Microsoft and Oracle and provides an overview of Hadoop distributions.
A proposal for a .data TLD, flavors of Hadoop, and a vote for pseudonymous commenters.
In this week's data news, Stephen Wolfram calls for a .data top-level domain and Cloudera responds to Hadoop version 1.0.
Dynamic pricing angers some Uber users, Hadoop hits 1.0, a possible set back for open-access research.
Uber's dynamic pricing worked as intended on New Year's Eve, but not everyone is happy about that. Elsewhere, Hadoop reaches the 1.0 milestone and proposed legislation seeks to repeal an open-access research policy.