- Apertus — open source cinema camera. (via joshua on Delicious)
- A Survey of Collaborative Filtering Techniques — From basic techniques to the state-of-the-art, we attempt to present a comprehensive survey for CF techniques, which can be served as a roadmap for research and practice in this area. (via bos on Delicious)
- Drizzle Replication using RabbitMQ as Transport — we’re watching the growing use of message queues in web software, and here’s an interesting application. (via sogrady on Delicious)
- Facebook Data Team: Distributed Data Analysis at Facebook — job ad from Facebook gives numbers on company use of their Hive data warehouse tool built on top of Hadoop: Today, Facebook counts 29% of its employees (and growing!) as Hive users. More than half (51%) of those users are outside of Engineering. They come from distinct groups like User Operations, Sales, Human Resources, and Finance. Many of them had never used a database before working here. Thanks to Hive, they are now all data ninjas who are able to move fast and make great decisions with data. (via Simon Willison)
ENTRIES TAGGED "opensource"
Open Source Cinema Camera, Collaborative Filtering, Message Queue for Replication, Facebook Data Warehouse Numbers
Sexy HTTP Parser, 9/11 Pager Leaks, Open Source Science, GLAM and Newspapers
- http-parser — This is a parser for HTTP messages written in C. It parses both requests and responses. The parser is designed to be used in performance HTTP applications. It does not make any allocations, it does not buffer data, and it can be interrupted at anytime. It only requires about 128 bytes of data per message stream (in a web server that is per connection). Extremely sexy piece of coding. (via sungo on Twitter)
- Wikileaks to Release 9/11 Pager Intercepts — they’re trickling the half-million messages out in simulated real time. The archive is a completely objective record of the defining moment of our time. We hope that its revelation will lead to a more nuanced understanding of the event and its tragic consequences. (via cshirky on Twitter)
- Promoting Open Source Science — interesting interview with an open science practitioner, but also notable for what it is: he was interviewed and released the text of the interview himself because his responses had been abridged in the printed version. (via suze on Twitter)
- Copyright, Findability, and Other Ideas from NDF (Julie Starr) — a newspaper industry guru attended the National Digital Forum where Galleries, Libraries, Archives, and Museums talk about their digital issues, where she discovered that newspapers and GLAMs have a lot in common. We can build beautiful, rich websites till the cows come home but they’re no good to anyone if people can’t easily find all that lovely content lurking beneath the homepage. That’s as true for news websites as it is for cultural archives and exhibitions, and it’s a topic that arose often in conversation at the NDF conference. I’ve been cooling on destination websites for a while. You need to have a destination website, of course, but you need even more to have your content out where your audience is so they can trip over it often and usefully.
Chumby One, Gorgeous IE Debugger, Freer Than Free, and Phone-a-Friend for Government IT
- Chumby One (Bunnie Huang) — new Chumby product released. In addition to being about half the price of the original chumby, the new device added some features: it has an FM radio, and it has support for a rechargeable lithium ion battery (although it’s not included with the device, you have to buy one and install it yourself). There’s also a knob so you can easily/quickly adjust the volume. But I don’t think those are really the significant new features. What really gets me excited about this one is that it’s much more hackable.
- Deep Tracing of Internet Explorer (John Resig) — very sexy deep inspection of Internet Explorer. Finally, something IE does better than Firefox (other than exploits). dynaTrace Ajax works by sticking low-level instrumentation into Internet Explorer when it launches, capturing any activity that occurs – and I mean virtually any activity that you can imagine. (via Simon Willison)
- Less Than Free — begins by talking about Google giving away turn-by-turn directions on Android, and then analyses Google’s “less than free” business model: Additionally, because Google has created an open source version of Android, carriers believe they have an “out” if they part ways with Google in the future. I then asked my friend, “so why would they ever use the Google (non open source) license version.” Here was the big punch line – because Google will give you ad splits on search if you use that version! That’s right; Google will pay you to use their mobile OS. I like to call this the “less than free” business model. This is a remarkable card to play. Because of its dominance in search, Google has ad rates that blow away the competition. To compete at an equally “less than free” price point, Symbian or windows mobile would need to subsidize. Double ouch!!
- Expert Labs — a new independent initiative to help policy makers in our government take advantage of the expertise of their fellow citizens. How does it work? Simple: 1. We ask policy makers what questions they need answered to make better decisions. 2. We help the technology community create the tools that will get those answers. 3. We prompt the scientific & research communities to provide the answers that will make our country run better. New non-profit from Anil Dash.
The view from the eye of a recession isn't great. When companies are going bust, unemployment growing, and everyone's scouring their budgets for costs to cut, it can be hard to see opportunities. However, when Tim pointed to Stephen O'Grady's fine set of 2010 predictions I found myself popping with "oh, so naturally this will happen next …" thoughts. Think…
I’ve been holding my breath for so long waiting for this memo that I may not remember how to start breathing again, but here it is. The Department of Defense Deputy CIO Dave Wennergren has signed and released “Clarifying Guidance on Open Source Software.”
Yesterday, the new media team at the White House announced via the Associated Press that whitehouse.gov is now running on Drupal, the open source content management system. That Drupal implementation is in turn running on a Red Hat Linux system with Apache, MySQL and the rest of the LAMP stack. Apache Solr is the new White House search engine. This move is obviously a big win for open source. While open source is already widespread throughout the government, its adoption by the White House will almost certainly give permission for much wider uptake.
Open Source, Gov 2.0, Gaming, Education
- Our Open Source School — blog of Albany Senior High School in New Zealand, which only runs open source software.
- Behind The Scenes at What Do They Know — interesting post showing details behind the What Do They Know web site. In the last year there have been only seven significant cases where requests have been hidden from public view on the site due to concerns relating to potential libel and defamation. Three of those cases have involved groups of twenty or so requests made by the same one or two users. While actual number of requests we have had to hide is around 70 (0.4% of the total) even this small fraction overstates the situation due to the repetition of the same potentially libelous accusations comments in different requests. In all cases we have kept as much information up on the site as possible. Our policy with respect to all requests to remove information from the site is that we only take down information in exceptional circumstances; generally only when the law requires us to do so.
- The Complete History of Lemmings — a must-read for videogamers from the early 90s. Theres been much debate over the choice of colours as well, but the colours were selected, not because they were the easiest to choose, but because of the PC EGA palette. With the limited choice, it was decided the green hair was nicer than blue, and with that, the final Lemming was born. I was actually the next person to code up a demo on the Commodore 64, but I only got so far as having a single Lemming walking over the landscape before Dave put me onto another project.
- Google Replaces TeleAtlas — Tele Atlas confirms that Google has decided to stop using Tele Atlas map data for the U.S. Google will now use its own map data. Our relationship with Google for map coverage continues outside of the U.S. in dozens of geographies.
Involuntarily Opened Geodata, Sense Organ, Doc Vis, 3D Open Source Bodies
- Wikileaks Now Holds UK Postcode Database — the UK does not have open geodata in the way that we know it. A state-owned enterprise, Ordnance Survey, is responsible for maintaining all sorts of baseline data and they charge (through the nose) for that data. This is the release of 1,841,177 post codes, geographic boundaries, and more. Postcodes in the UK are far more useful than US ZIP codes–they identify a handful of houses, rather than a few thousand houses.
- My New Sense Organ — a strap with buzzers and a compass, so you always have physical reminder of orientation. For people like me who can get lost putting on pants in the morning, this would be a godsend. (via Slashdot)
- Saving is Obsolete — EtherPad adds a Wave-like replay feature to help you see the history of a document.
- Open Source 3D People — incredible software to design realistic 3D faces and bodies. (via glynmoody on Twitter)