Four short links: 5 November 2010
Stream Processing, Semantic Web, Location Services, and PDF Extraction
- S4 — S4 is a general-purpose, distributed, scalable, partially fault-tolerant, pluggable platform that allows programmers to easily develop applications for processing continuous unbounded streams of data. Open-sourced (Apache license) by Yahoo!.
- RDF and Semantic Web: Can We Reach Escape Velocity? (PDF) — spot-on presentation from the data.gov.uk linked data advisor. It nails, clearly and in only 12 slides, why there’s still resistance to linked data uptake and what should happen to change this. Amen! (via Simon St Laurent)
- Pew Internet Report on Location-based Services — 10% of online Hispanics use these services – significantly more than online whites (3%) or online blacks (5%).
- Slate — Python library for extracting text from PDFs easily.