"streams" entries

Pipelining and Real-time Analytics with MapReduce Online

Pipelining and Real-time Analytics with MapReduce Online

Some organizations create their own real-time analysis tools, while others turn to specialized solutions. In a previous post, I highlighted SQL-based real-time analytic tools that can handle large amounts of data. I noted that other big data management systems such as MPP databases and MapReduce/Hadoop were too batch-oriented to deliver analysis in near real-time. At least for MapReduce/Hadoop systems things may have changed slightly. A group of researchers from UC Berkeley and Yahoo recently modified MapReduce to allow for pipelining between operators.

Comments: 2
Big Data and Real-time Structured Data Analytics

Big Data and Real-time Structured Data Analytics

The emergence of sensors as sources of Big Data highlights the need for real-time analytic tools. Popular web apps like Twitter, Facebook, and blogs are also faced with having to analyze (mostly unstructured) data in near real-time. But as Truviso founder and UC Berkeley CS Professor Michael Franklin recently noted, there are mountains of structured data generated by web apps that lend themselves to real-time analysis.

Comments: 10

Analytics: Are Streams the New Hits?

The definition of an online video stream can mean different things on different sites. This kind of ambiguity hurts everyone involved.

Comments Off