"data tools" entries

When was the last time you mined your site's search data?

Lou Rosenfeld on the benefits of parsing and refining site search.

A gold mine is hiding in the data generated by website search engines, yet many site owners pay little attention to the analytics those engines yield. Author Lou Rosenfeld explains why site search is worth your time.

The Daily Dot wants to tell the web’s story with social data journalism

A new media startup tries to mine the social web for stories.

The newly launched Daily Dot is trying an experiment in community journalism, where the community is the Internet. To support their goal, they’re applying the lens of data journalism to the social web.

The Daily Dot wants to tell the web's story with social data journalism

A new media startup tries to mine the social web for stories.

The newly launched Daily Dot is trying an experiment in community journalism, where the community is the Internet. To support their goal, they’re applying the lens of data journalism to the social web.

Strata Week: Twitter's coming Storm, data and maps from the London riots

Twitter plans to open source its Hadoop-like data processing tool, Storm.

This week's data news includes Twitter's plans to open-source its Hadoop-like data processing tool and some of the various mapping and real-time data efforts tracking the London riots.

Strata Week: Twitter’s coming Storm, data and maps from the London riots

Twitter plans to open source its Hadoop-like data processing tool, Storm.

This week's data news includes Twitter's plans to open-source its Hadoop-like data processing tool and some of the various mapping and real-time data efforts tracking the London riots.

Strata Week: When does data access become data theft?

Questions surround the Aaron Swartz case and Microsoft wants to help scholars with big data.

Aaron Swartz faces felony charges for downloading "big data" (more than 4 million academic journals) from the MIT library, Microsoft's new data tool is aimed at scholars, and David Eaves looks at open data efforts in Canada.

Top stories: July 4-8, 2011

Why Java matters, inside Node.js, predicting Android's ubiquity

This week on O'Reilly: We offered seven reasons why Java is worth your time, the utility of Node.js was duly noted, and Marko Gargenta offered three Android predictions that have nothing to do with mobile phones.

Get started with Hadoop: From evaluation to your first production cluster

Best practices for evaluating Hadoop and setting up an initial cluster (updated March 2012)

Focusing on the Hadoop Distributed File System (HDFS) and MapReduce, this in-depth piece — updated March 2012 — offers tips for organizations that are looking to evaluate Hadoop and deploy an initial cluster.

Strata Week: Data Without Borders

Work on data projects that matter, data journalism, and a social graph of the Marvel universe.

This week's big data news includes a call for Data Without Borders, data journalism catches the Knight Foundation's attention, IBM's new big data appliance, and a social graph built around the Marvel universe.

Strata Week: Hadoop competition heats up

Vendors jockey for Hadoop positioning, Facebook visualizes PHP modules, Shaq's stats

Competition among Hadoop vendors heats up, Facebook visualizes its PHP code modules, and a Many Eyes tool visualizes the stats from Shaquille O'Neal's basketball career.