- In Flawed, Epic Anonymous Book, the Abyss Gazes Back (Wired) — Quinn Norton’s review of a book about Anonymous is an excellent introduction to Anonymous. Anonymous made us, its mediafags, masters of hedging language. The bombastic claims and hyperbolic declarations must be reported from their mouths, not from our publications. And yet still we make mistakes and publish lies and assumptions that slip through. There is some of this in all of journalism, but in a world where nothing is true and everything is permitted, it’s a constant existential slog. It’s why there’s not many of us on this beat.
- Titan (GitHub) — Apache2-licensed distributed graph database optimized for storing and processing large-scale graphs within a multi-machine cluster. Cassandra and HBase backends, implements the Blueprints graph API. (via Hacker News)
- Extra Second This June — we’re getting a leap second this year: there’ll be 2012 June 30, 23h 59m 60s. Calendars are fun.
- On Creativity (Beta Knowledge) — I wanted to create a game where even the developers couldn’t see what was coming. Of course I wasn’t thinking about debugging at this point. The people who did the debugging asked me what was a bug. I could not answer that. — Keita Takahashi, game designer (Katamari Damacy, Noby Noby Boy). Awesome quote.
ENTRIES TAGGED "databases"
On Anonymous, Graph Database, Leap Second, and Debugging Creativity
How to think about choosing a database.
A relational database is no longer the default choice. Mike Loukides charts the rise of the NoSQL movement and explains how to choose the right database for your application.
Graph ORM, Graphic Computation, Web Intents, and Async RPC
- Bulbflow — a Python framework for graph databases: it’s like an ORM for graphs. (via Joshua Schachter)
- Nomograms — the lost art of graphical computing. (via John D Cook)
- Web Intents — adding Android-style Intents to the web. Services register their intention to be able to handle an action on the user’s behalf. Applications request to start an Action of a certain verb (share, edit, view, pick etc) and the system will find the appropriate Services for the user to use based on the user’s preference.
- Finagle (GitHub) — Twitter’s asynchronous network stack for the JVM that you can use to build asynchronous Remote Procedure Call (RPC) clients and servers in Java, Scala, or any JVM-hosted language. Finagle provides a rich set of tools that are protocol independent.
SQL Injection, Optical Stick, SQL for Crowdsourcing, and DIY Medical Records
- SQL Injection Pocket Reference (Google Docs) — just what it sounds like. (via ModSecurity SQL Injection Challenge: Lessons Learned)
- isostick: The Optical Drive in a Stick (KickStarter) — clever! A USB memory stick with drivers that emulate optical drives so you can boot off .iso files you’ve put on the memory stick. (via Extreme Tech)
- CrowdDB: Answering Queries with Crowdsourcing (Berkeley) — CrowdDB uses human input via crowdsourcing to process queries that neither database systems nor search engines can adequately answer. It uses SQL both as a language for posing complex queries and as a way to model data. (via Big Data)
- The DIY Electronic Medical Record (Bryce Roberts) — I had a record of my daily weight, my exercising (catalogued by type), my walking, my calories burned and now, with the addition of Zeo, my nightly sleep patterns. All of this data had been passively collected with little to no manual input required from me. Total investment in this personal sensor network was in the range of a couple hundred dollars. And, as I rummaged through my data it began to hit me that what I’ve really been doing is creating my own DIY Electronic Medical Record. The Quantified Self is about more than obsessively cataloguing your bowel movements in low-contrast infographics. I’m less enthused by the opportunities to publicly perform private data, a-la the wifi body scale, than I am by opportunities to gain personal insight.
Questions surround the Aaron Swartz case and Microsoft wants to help scholars with big data.
Aaron Swartz faces felony charges for downloading "big data" (more than 4 million academic journals) from the MIT library, Microsoft's new data tool is aimed at scholars, and David Eaves looks at open data efforts in Canada.
- Tilt — clever Firefox plugin that lets you view the DOM on your page in 3D. Excellent for visually understanding the structure and layout of your page. I can’t wait to see the applications of this in debugging and teaching.
- Improving Hadoop Efficiency on Graph Data — three techniques: clustering data instead of randomly partitioning across nodes; allowing different data to be replicated differently; graph-optimized storage. (via Big Data)
- Learnings from the Long View (PDF) — scenario planning lessons learned from the founder of the field. Most of the scenarios they talk about are near and dear to the O’Reilly heart: smart networks, augmented reality, synthetic biology, energy from bacteria, super macro- and micro-manufacturing. (via Rob Passarella)
Rare Visualization, Google+ Tech, Scala+Erlang, and In-Database Analytics
- Slopegraphs — a nifty Tufte visualization which conveys rank, value, and delta over time. Includes pointers to how to make them, and guidelines for when and how they work. (via Avi Bryant)
- scalang (github) — a Scala wrapper that makes it easy to interface with Erlang, so you can use two hipster-compliant built-to-scale technologies in the same project. (via Justin Sheehy)
- Madlib — an open-source library for scalable in-database analytics. It provides data-parallel implementations of mathematical, statistical and machine learning methods for structured and unstructured data. (via Mike Loukides)
God Games, Digitised History, git Database, and App Framework
- Let There Be Smite (Pippin Barr) — simple diversion for the 4th of July. It won’t be easy for God to save America. (via Pippin’s blog)
- Basel Wear — to answer the question I know was burning on your lips: “what *did* the Swiss wear in 1634?” Impressively detailed pictures from a 1634 book that is now online. One of the reasons I’m in favour of digitizing cultural collections is that we’re more likely to encounter them on the net and so ask questions like “how did people dress in 1634?”, “why did everyone carry keys?”, and “what is a Sexton?”
- databranches: Using git as a Database — it’s important to approach your design for using git as a database from the perspective of automated merging. Get the merging right and the rest will follow. I’ve chosen to use the simplest possible merge, the union merge: When merging parent trees A and B, the result will have all files that are in either A or B, and files present in both will have their lines merged (and possibly reordered or uniqed).
- Joshfire — open source (dual-licensed GPLv2 and commercial) multiplatform development framework built on HTML5.