Four short links: 5 November 2010

Stream Processing, Semantic Web, Location Services, and PDF Extraction

  1. S4S4 is a general-purpose, distributed, scalable, partially fault-tolerant, pluggable platform that allows programmers to easily develop applications for processing continuous unbounded streams of data. Open-sourced (Apache license) by Yahoo!.
  2. RDF and Semantic Web: Can We Reach Escape Velocity? (PDF) — spot-on presentation from the data.gov.uk linked data advisor. It nails, clearly and in only 12 slides, why there’s still resistance to linked data uptake and what should happen to change this. Amen! (via Simon St Laurent)
  3. Pew Internet Report on Location-based Services10% of online Hispanics use these services – significantly more than online whites (3%) or online blacks (5%).
  4. Slate — Python library for extracting text from PDFs easily.
tags: , , , , , ,