- See the World as a Colour-Blind Person Would — filters that let you see images as protanopes, deuteranopes, and even tritanopes would see them. I am protanoptic (if that’s a word) and I can vouch that the “after” pix look the same as “before” to me. Care, because about 8% of men have some form of colourblindness and hate you and your “red is bad, green is good” visual cues. (via Flowing Data)
- Waffles — seeks to be the world’s most comprehensive collection of command-line tools for machine learning and data mining.
- LinkedIn Open Sources Index and Query Services — full-text index and retrieval engine, APIs, and a framework to manage indexes on infrastructure-as-a-service.
ENTRIES TAGGED "visualisation"
A BBC visualization maps every traffic casualty in the UK between 1999-2010.
More than a decade's worth of traffic accident data is plotted out in a BBC visualization. Locations with a history of accidents are hard to miss.
An artist blends sound and design to visualize the prelude to Bach's Cello Suites.
It's rare when a visualization sounds as good as it looks, but that's the case with Alexander Chen's sonic and visual rendering of Bach's Cello Suites.
Distributed Drug Money, Science Game, Beautiful Machine Learning, and Stream Event Processing
- Silk Road (Gawker) — Tor-delivered “web” site that is like an eBay for drugs, currency is Bitcoins. Jeff Garzik, a member of the Bitcoin core development team, says in an email that bitcoin is not as anonymous as the denizens of Silk Road would like to believe. He explains that because all Bitcoin transactions are recorded in a public log, though the identities of all the parties are anonymous, law enforcement could use sophisticated network analysis techniques to parse the transaction flow and track down individual Bitcoin users. “Attempting major illicit transactions with bitcoin, given existing statistical analysis techniques deployed in the field by law enforcement, is pretty damned dumb,” he says. The site is viewable here, and here’s a discussion of delivering hidden web sites with Tor. (via Nelson Minar)
- Dr Waller — a big game using DC Comics characters where players end up crowdsourcing science on GalaxyZoo. A nice variant on the captcha/ESP-style game that Luis von Ahn is known for. (via BoingBoing)
- Machine Learning Demos — hypnotically beautiful. Code for download.
- Esper — stream event processing engine, GPLv2-licensed Java. (via Stream Event Processing with Esper and Edd Dumbill)
PC in JS, Musical Visualization, S3 Parallel, and Tech-led Ed
- US Home Prices as Opera (Flowing Data) — reminded me of Douglas Adams’s “Dirk Gently’s Holistic Detective Agency” which has software that turns your company’s performance numbers into music. The yearly accounts of most British companies emerged sounding like the Dead March from “Saul”, but in Japan they went for it like a pack of rats. It produced lots of cheery company anthems that started well, but if you were going to criticise you’d probably say that they tended to get a bit loud and squeaky at the end.
- s3cmd Parallel — command-line tool with parallel uploads to s3. (via Nelson Minar)
- Eight of China’s Top Nine Government Officials are Scientists (Singularity Hub) — the article’s idiotic reduction to performance on standardised tests misses America’s primary strength against China, namely creative and flexible workforce. China will get there, but it’s not there yet.
MongoDB for Guardian, Visualization Book, Mobile CouchDB, and Fast Approximate String Retrieval
- Why We Chose MongoDB for Guardian.co.uk (SlideShare) — they’re using MongoDB’s flexible schema, as schema upgrades were pain in their previous system (Oracle). I think of these as the database equivalent of dynamic typing in languages like Perl and Ruby. (via Paul Rowe)
- Solving Problems with Visual Analytics — This book is the result of a community effort of the partners of the VisMaster Coordinated Action funded by the European Union. The overarching aim of VisMaster was to create a research roadmap that outlines the current state of visual analytics across many disciplines, and to describe the next steps that have to be taken to foster a strong visual analytics community, thus enabling the development of advanced visual analytic applications. (via Mark Madsen)
- iOS-Couchbase (GitHub) — a build of distributed key-value store CouchDB, which will keep your mobile data in sync with a remote store. No mean feat given CouchDB itself has Erlang as a dependency. (via Mike Olson)
- SimString — A fast and simple algorithm for approximate string retrieval in C++ with Python and Ruby bindings, opensourced with modified BSD license. (via Matt Biddulph)
A hidden file in iOS 4 is regularly recording the position of devices.
Pete Warden and Alasdair Allan have discovered that iPhones and 3G iPads running iOS 4 are regularly recording the location of devices into a hidden file.