"nosql" entries

Four short links: 11 January 2011

Four short links: 11 January 2011

Microsoft and the Web, URL Library, Optimism, and NoSQL Instruction

  1. Dive Into 2010 (Mark Pilgrim) — Mark wrote a hugely popular guide to HTML5 which was available online and published by O’Reilly. 6% of visitors used some version of Internet Explorer. That is not a typo. The site works fine in Internet Explorer — the site practices what it preaches, and the live examples use a variety of fallbacks for legacy browsers — so this is entirely due to the subject matter. Microsoft has completely lost the web development community.
  2. google-url — the Google URL-parsing library, designed to be embeddable.
  3. Reasons to be Cheerful (Charlie Stross) — if all we ever do is gripe about ways in which the world is not perfect, we will make ourselves miserable and fail to appreciate ways in which things are getting better. Important.
  4. NoSQL Tapes — videos of lectures on NoSQL topics. (via Hacker News)
Four short links: 26 October 2010

Four short links: 26 October 2010

NoSQL Experience, Connected Future, Hacktivism, and Mobile UI Guidelines

  1. 12 Months with MongoDB (Worknik) — every type of retrieval got faster than their old MySQL store, and there are some other benefits too. They note that the admin tools aren’t really there for MongoDB, so “there is a blurry hand-off between IT Ops and Engineering.” (via Hacker News)
  2. Dawn of a New Day — Ray Ozzie’s farewell note to Microsoft. Clear definition of the challenges to come: At first blush, this world of continuous services and connected devices doesn’t seem very different than today. But those who build, deploy and manage today’s websites understand viscerally that fielding a truly continuous service is incredibly difficult and is only achieved by the most sophisticated high-scale consumer websites. And those who build and deploy application fabrics targeting connected devices understand how challenging it can be to simply & reliably just ‘sync’ or ‘stream’. To achieve these seemingly simple objectives will require dramatic innovation in human interface, hardware, software and services. (via Tim O’Reilly on Twitter)
  3. A Civic Hacktivism Abecedary — good ideas matched with exquisite quotes and language. My favourite: Kick at the darkness until it bleeds daylight. (via Francis Irving on Twitter)
  4. UI Guidelines for Mobile and Web Programming — collection of pointers to official UI guidelines from Nokia, Apple, Microsoft, MeeGo, and more.
Four short links: 21 October 2010

Four short links: 21 October 2010

MySQL as NoSQL, Handmade SLR, Mac App Store, and Datamining Privacy Workshop

  1. Using MysQL as NoSQL750,000+ qps on a commodity MySQL/InnoDB 5.1 server from remote web clients.
  2. Making an SLR Camera from Scratch — amazing piece of hardware devotion. (via hackaday.com)
  3. Mac App Store Guidelines — Apple announce an app store for the Macintosh, similar to its app store for iPhones and iPads. “Mac App” no longer means generic “program”, it has a new and specific meaning, a program that must be installed through the App store and which has limited functionality (only one can run at a time, it’s full-screen, etc.). The list of guidelines for what kinds of programs you can’t sell through the App Store is interesting. Many have good reasons to be, but It creates a store inside itself for selling or distributing other software (i.e., an audio plug-in store in an audio app) is pure greed. Some are afeared that the next step is to make the App store the only way to install apps on a Mac, a move that would drive me away. It would be a sad day for Mac-lovers if Microsoft were to be the more open solution than Apple. cf the Owner’s Manifesto.
  4. Privacy Aspects of Data Mining — CFP for an IEEE workshop in December. (via jschneider on Twitter)
Four short links: 20 October 2010

Four short links: 20 October 2010

Bad Game Mechanics, Under NoSQL Covers, the LAN of Things, and the Smithsonian Commons

  1. Pwned: Gamification and its Discontents (Slideshare) — hear, hear! Video games are not fun because they’re video games, but if and only they are well-designed. Just adding something from games isn’t a guarantee for fun. (via jameshome on Twitter)
  2. Redis Under the Hood — explanation of the insides and mechanisms of this popular distributed key-value store. (via tlockney on delicious)
  3. The LAN of Things (Mike Kuniavsky) — Before we can have an Internet of Things, we will need to have a LAN of things.[…] Most of the utility of a LAN came from its local functionality. Thus, before we can build a useful (from a user perspective) Internet of Things, we need to learn to build useful LANs of Things. […] I think it’s important to start thinking about what the highly localized uses of sparsely distributed technology can be. What can we do when there are only a couple of things with RFIDs in our house? What totally great service can be built on having two light switches that report their telemetry in the house? What totally valuable information can you tell me if I only wear my motion sensor every once in a while? Love it. (via Matt Jones on Delicious)
  4. Mike Edson’s Talk at Powerhouse Museum — the Director of Web and New Media Technology at the Smithsonian is smart, articulate, and trying to do something cool with the Smithsonian Commons prototype. (via sebchan on Twitter)

The SMAQ stack for big data

Storage, MapReduce and Query are ushering in data-driven products and services.

We're at the beginning of a revolution in data-driven products and services, driven by a software stack that enables big data processing on commodity hardware. Learn about the SMAQ stack, and where today's big data tools fit in.

Four short links: 14 September 2010

Four short links: 14 September 2010

Facebook Bank, New in NoSQL, Twitter Numbers, and Open Source EEG Driver

  1. ASB Bank’s Facebook Virtual Branch — the world’s first Facebook branch of a bank, where you can live chat with tellers. (via Vaughn Davis)
  2. SciDB — GPLv3 NoSQL database. In addition to being multi-dimensional and offering array based scaling from megabytes to petabytes and running on tens of thousands clustered nodes, SciDB’s will be write once read many, allow bulk load rather than single road insert, provide parallel computation, be designed for automatic rather than manual administration, and work with R, Matlab, IDL, C++ and Python. (that from The Register) (via jsteeleeditor on Twitter)
  3. Twitter By The Numbers (Raffi Krikorian) — given to answer the question “what’s so hard about delivering 140 characters?”. They hit a peak of 3283 inbound tweets/second. Every time Lady Gaga tweets, 6.1M people have to get it. (via Alex Russell)
  4. EmoKit — an open source driver to the $300 Emotiv EPOC EEG headset. (via BoingBoing)
Four short links: 13 September 2010

Four short links: 13 September 2010

Community Deconstructed, Sparklines Explained, NoSQL Navigated, and Foxconn Surveyed

  1. Open Source Community Types (Simon Phipps) — draws a distinction between extenders and deployers to take away the “who do you mean?” confusion that comes with the term “community”.
  2. Sparklines — Tufte’s coverage of sparkline graphs in Beautiful Evidence. (via Hacker News)
  3. Why NoSQL Matters (Heroku blog) — a very nice precis of the use cases for various NoSQL systems. Frequently-written, rarely read statistical data (for example, a web hit counter) should use an in-memory key/value store like Redis, or an update-in-place document store like MongoDB. I’m sure there are as many opinions as there are people, but I’d welcome a “if you want to do X, look at Y” guide to the NoSQL space. If you know of such a beast, please leave pointers in the comments. Thanks!
  4. The Man Who Makes Your iPhone (BusinessWeek) — a fascinating survey of Foxconn’s CEO, history, operations, culture, and plans. This line resonated for me: “I never think I am successful,” he says. “If I am successful, then I should be retired. If I am not retired, then that means I should still be working hard, keeping the company running.”
Four short links: 9 September 2010

Four short links: 9 September 2010

Thumb Drives and the Cloud, FCC APIs, Mining on GFS, Check Your Prose with Scribe

  1. CloudUSBa USB key containing your operating environment and your data + a protected folder so nobody can access you data, even if you lost the key + a backup program which keeps a copy of your data on an online disk, with double password protection. (via ferrouswheel on Twitter)
  2. FCC APIs — for spectrum licenses, consumer broadband tests, census block search, and more. (via rjweeks70 on Twitter)
  3. Sibyl: A system for large scale machine learning (PDF) — paper from Google researchers on how to build machine learning on top of a system designed for batch processing. (via Greg Linden)
  4. The Surprisingness of What We Say About Ourselves (BERG London) — I made a chart of word-by-word surprisingness: given the statement so far, could Scribe predict what would come next?

Data Week: Becoming a data scientist

Data Pointed, CouchDB in the Cloud, Launching Strata

Data Week is a new series that brings together notable stories and developments from the data world. Links in this edition include: the connection between visualizations and art, advice on becoming a data scientist, BigCouch goes open source, and more.

Four short links: 1 September 2010

Four short links: 1 September 2010

Faces in R, Open Source Web Analytics, Small File Store, Building Mapper

  1. R Library for Chernoff Facesfaces represent the rows of a data matrix by faces. plot.faces plots faces into a scatterplot. Interesting emotional way to visualize data, which was used to good effect (though not with this library) by BERG in Schooloscope. (via the tutorial at Flowing Data)
  2. Piwik — GPLed web analytics package.
  3. Pomegranate — a data store for billions of tiny files. (via the High Scalability blog interview with the creator of Pomegranate)
  4. New Backpack Makes 3D Maps of Buildings — the backpack indoor equivalent of the Google Maps cars, from Berkeley researchers.