- Hadoop Hits 1.0 — open source distributed computation engine, heavily used in big data analysis, hits 1.0.
- Sparse and Low-Rank Approximation Wiki — interesting technique: instead of sampling at 2x the rate you need to discriminate then compressing to trade noise for space, use these sampling algorithms to (intelligently) noisily sample at the lower bit rate to begin with. Promises interesting applications particularly in for sensors (e.g., the Rice single pixel camera). (via siah)
- Rise of Printer Malware — firmware attacks embedded in printed documents. Another reminder that not only is it hard to write safe software, your mistakes can be epically bad. (via Cory Doctorow)
- Electric Circuits and Transistors Made From Cotton — To make it conductive, the researchers coated cotton threads in a variety of other materials. To make conductive “wires,” the team coated the threads with gold nanoparticles, and then a conductive polymer. To turn a cotton wire into a semiconductor, it was dipped in another polymer, and then a further glycol coating to make it waterproof. Neat materials hack that might lend a new twist to wearables.
ENTRIES TAGGED "Hadoop"
Doug Cutting on Hadoop's rise and why he's surprised at its growth.
Doug Cutting discusses Hadoop's current and near-term role, and the factors that made it a central part of data processing.
Where to store all that genome data? Also, clarifying the work of digital humanities scholars.
We take a look at the big data obstacles and opportunities for genomics, digital humanities scholars respond to Stanley Fish's mischaracterization of what they do with data, and Hadoop World and the Strata Conference merge.
Hadoop is a central part of Microsoft's data strategy.
Strata conference chair Edd Dumbill takes a look at Microsoft's plans for big data. By embracing Hadoop, the company aims to keep Windows and Azure as a standards-friendly option for data developers.
A survey of the Hadoop big data marketplace.
In this survey, Edd Dumbill explores the Hadoop-based big data solutions available on the market, contrasts the approaches of EMC Greenplum, IBM, Microsoft and Oracle and provides an overview of Hadoop distributions.
A proposal for a .data TLD, flavors of Hadoop, and a vote for pseudonymous commenters.
In this week's data news, Stephen Wolfram calls for a .data top-level domain and Cloudera responds to Hadoop version 1.0.
Dynamic pricing angers some Uber users, Hadoop hits 1.0, a possible set back for open-access research.
Uber's dynamic pricing worked as intended on New Year's Eve, but not everyone is happy about that. Elsewhere, Hadoop reaches the 1.0 milestone and proposed legislation seeks to repeal an open-access research policy.