"nosql" entries

Four short links: 13 August 2010

Four short links: 13 August 2010

Scientific Literacy, Load Balancing, Indoors Geolocation, and iPhone Security

  1. The Myth of Scientific LiteracyI’d love it if there was a simple course we could send our elected officials on which would guarantee future science policy would be reliably high quality. Being educated in science (or even “about science”) isn’t going to do it. It’s social connections that will. We need to keep our elected officials honest, constantly check they are applying the evidence we want them to, in the ways we want them to. And if the scientific community want to be listened to, they need to work to build connections. Get political and scientific communities overlapping, embed scientists in policy institutions (and vice versa), get MP’s constituents onside to help foster the sorts of public pressure you want to see: build trust so scientists become people MPs want to be briefed by. (via foe on Twitter)
  2. Three Papers on Load Balancing (Alex Popescu) — three papers on distributed hash tables.
  3. Meridian — iPhone app that does in-building location, sample app is the AMNH Explorer which shows you maps of where you are. Uses wifi-based positioning. (via raffi on Twitter)
  4. Fixing What Apple Won’t — the jailbreakers are releasing security patches for systems that Apple have abandoned. (via ardgedee on Twitter)
Four short links: 5 August 2010

Four short links: 5 August 2010

Delicious Graphs, Charities and Data, Climate Psychology, Data Structure Portability

  1. Delicious Links Clustered and Stacked (Matt Biddulph) — six years of his delicious links, k-means clustered by tag and graphed. The clusters are interesting, but I wonder whether Matt can identify significant life/work events by the spikes in the graph.
  2. Open Data and the Voluntary Sector (OKFN) — Open data will give charities new ways to find and share information on the need of their beneficiaries – who needs their services most and where they are located. The sharing of information will be key to this – it’s not just about using data that the government has opened up, but also opening your own data.
  3. Cognitive and Behavioral Challenges in Responding to Climate ChangeAt the deepest level, large scale environmental problems such as global warming threaten people’s sense of the continuity of life – what sociologist Anthony Giddens calls ontological security. Ignoring the obvious can, however, be a lot of work. Both the reasons for and process of denial are socially organized; that is to say, both cognition and denial are socially structured. Denial is socially organized because societies develop and reinforce a whole repertoire of techniques or “tools” for ignoring disturbing problems. Fascinating paper. (via Jez)
  4. Blueprintsprovides a collection of interfaces and implementations to common, complex data structures. Blueprints contains a property graph model its implementations for TinkerGraph, Neo4j, and SAIL. Also, it contains an object document model and implementations for TinkerDoc, CouchDB, and MongoDB. In short, Blueprints provides a one stop shop for implemented interfaces to help developers create software without being tied to particular underlying data management systems.
Four short links: 25 June 2010

Four short links: 25 June 2010

More NoSQL, Data Medicine, Startups to Government, and Cake-and-eat-it Open Source

  1. Membasean open-source (Apache 2.0 license) distributed, key-value database management system optimized for storing data behind interactive web applications. These applications must service many concurrent users; creating, storing, retrieving, aggregating, manipulating and presenting data in real-time. Supporting these requirements, membase processes data operations with quasi-deterministic low latency and high sustained throughput. (via Hacker News)
  2. Sergey’s Search (Wired) — Sergey Brin, one of the Google founders, learned he had a gene allele that gave him much higher odds of getting Parkinson’s. His response has been to help medical research, both with money and through 23andme. Langston decided to see whether the 23andMe Research Initiative might be able to shed some insight on the correlation, so he rang up 23andMe’s Eriksson, and asked him to run a search. In a few minutes, Eriksson was able to identify 350 people who had the mutation responsible for Gaucher’s. A few clicks more and he was able to calculate that they were five times more likely to have Parkinson’s disease, a result practically identical to the NEJM study. All told, it took about 20 minutes. “It would’ve taken years to learn that in traditional epidemiology,” Langston says. “Even though we’re in the Wright brothers early days with this stuff, to get a result so strongly and so quickly is remarkable.”
  3. Startup.gov (YouTube) — Anil Dash talk at Personal Democracy Forum on applying insights from startups to government. I hope the more people say this, the greater the odds it’ll be acted on.
  4. Open Core Software — Marten Mickos (ex-MySQL) talks up “open core” (open source base, proprietary extensions) as a way to resolve the conflict of “change the world with open source” and “make money”. Brian Aker disagrees: There has been no successful launch of an open core company that has reached any significant size, especially of the size that Marten hints at in the article. My take: there are three reasons for open source (freedoms, price, and development scale) and if you close the source to part of your product then the whole product loses those benefits. If you open source enough that the open source bit has massive momentum, then you probably don’t have enough left proprietary to gain huge financial benefit.
Four short links: 28 April 2010

Four short links: 28 April 2010

Fair Use Economy, Deconstituted Appliances, 3D Vision, Redis for Fun and Profit

  1. Fair Use in the US Economy (PDF) — prepared by IT lobby in the US, it’s the counterpart to Big ©’s fictitious billions of dollars of losses due to file sharing. Take each with a grain of salt, but this is interesting because it talks about the industries and businesses that the fair use laws make possible.
  2. Disassembled Household Appliances — neat photos of the pieces in common equipment like waffle irons, sandwich makers, can openers, etc. (via evilmadscientist)
  3. GelSight — gel block on a sheet of glass, lit from below with lights and then scanned with cameras, lets you easily capture 3D qualities of the objects pressed into it. Very cool demo–you can see finger prints, pulse, and even make out designs on a $100 bill.
  4. Redis Tutorial (Simon Willison) — Redis is a very fast collection of useful behaviours wrapped around a distributed key-value store. You get locks, IDs, counters, sets, lists, queues, replication, and more.

Big data analytics: From data scientists to business analysts

The growing popularity of Big Data management tools (Hadoop; MPP, real-time SQL, NoSQL databases; and others) means many more companies can handle large amounts of data. But how do companies analyze and mine their vast amounts of data? For companies that already have large amounts of data in Hadoop, there's room for even simpler tools that would allow business users to directly interact with Big Data.

Brian Aker on post-Oracle MySQL

A deep look at Oracle's motivations and MySQL's future

In time for next week’s MySQL Conference & Expo, Brian Aker discussed a number of topics with us, including Oracle’s motivations for buying Sun and the rise of NoSQL.

Joe Stump on data, APIs, and why location is up for grabs

The SimpleGEO CTO and former Digg architect discusses NoSQL and location's future

I recently had a long conversation with Joe Stump, CTO of SimpleGeo, about location, geodata, and the NoSQL movement. Stump, who was formerly lead architect at Digg, had a lot to say. Here’s the highlights, you can find the full interview elsewhere on Radar.

Four short links: 17 March 2010

Four short links: 17 March 2010

MySQL, MySociety, NoSQL DB, and NoSQL Conference Notes

  1. Common MySQL Queries — a useful reference.
  2. MySociety’s Next 12 Months — two new projects, FixMyTransport and “Project Fosbury”. The latter is a more general tool to help people organise their own campaigns for change.
  3. riak — scalable key-value store with JSON interface. (via joshua on Delicious)
  4. Notes from NoSQL Live Boston — full of juicy nuggets of info from the NoSQL conference.
Four short links: 16 March 2010

Four short links: 16 March 2010

Platform Games, NoSQL Conf, Ratings, and How to Teach

  1. Government is an Elephant (Public Strategist) — if Government is to be a platform, it will end up competing with the members of its ecosystems (the same way Apple’s Dashboard competed with Konfabulator, and Google’s MyMaps competed with Platial). If you think people squawk when a company competes, just wait until the competition is taxpayer-funded ….
  2. Recordings from NoSQL Live Boston — also available in podcasts.
  3. Modeling Scale Usage Heterogeneity the Bayesian Way — people use 1-5 scales in different ways (some cluster around the middle, some choose extremes, etc.). This shows how to identify the types of users, compensate for their interpretation of the scale, and how it leads to more accurate results.
  4. Building a Better Teacher — fascinating discussion about classroom management that applies to parenting, training, leading a meeting, and many other activities that take place outside of the school classroom. (via Mind Hacks)

NoSQL conference coming to Boston

On March 11 Boston will join several other cities who have host conferences on the movement broadly known as NoSQL. Cassandra, CouchDB, HBase, HypergraphDB, Hypertable, Memcached, MongoDB, Neo4j, Riak, SimpleDB, Voldemort, and probably other projects as well will be represented at the one-day affair. The interviews I had with various projects leaders for this article turned up a recurring usage pattern for NoSQL. What connects the users is that they carry out web-related data crunching, searching, and other Web 2.0 related work. I think these companies use NoSQL tools because they’re the companies who understand leading-edge technologies and are willing to take risks in those areas. As the field gets better known, usage will spread.