- School of Data — free online courses around data science and visualization.
- libshorttext — classify and analyse short-text of things like titles, questions, sentences, and short messages. MIT-style open source license, Python and C++ source.
- Letterboxd — a site for movie lovers from Kiwi Foo alums. I love people who build experiences to help people express their love of things.
- RadioBlocks and SimpleMesh — mesh networking for Arduino.
ENTRIES TAGGED "machine learning"
Four short links: 26 Feb 2013
Data Classes, Short Text Classification, Movie Lovers, and Mesh Networking for Hardware Hackers
Four short links: 12 February 2013
Handmade Hardware, Tab Silencer, Surprise and Models, and Sciencey GIFs
- Your USB Sticks Are Made With Chopsticks (Bunnie Huang) — behind-the-scenes on how USB sticks are made.
- mutetab — find and kill the Chrome tab making all the damn noise! (via Nelson Minar)
- Visualization, Modeling, and Surprises (John D Cook) — paraphrases Hadley Wickham: Visualization can surprise you, but it doesn’t scale well. Modelling scales well, but it can’t surprise you.
- Head Like an Orange — science animated GIFs, assembled from nature documentaries. (via Ed Yong)
Four short links: 24 January 2013
Google's Autonomous Cars, DIY BioPrinter, Forms Validation, and Machine Learning Workflow
- Google’s Driverless Car is Worth Trillions (Forbes) — Much of the reporting about Google’s driverless car has mistakenly focused on its science-fiction feel. [...] In fact, the driverless car has broad implications for society, for the economy and for individual businesses. Just in the U.S., the car puts up for grab some $2 trillion a year in revenue and even more market cap. It creates business opportunities that dwarf Google’s current search-based business and unleashes existential challenges to market leaders across numerous industries, including car makers, auto insurers, energy companies and others that share in car-related revenue.
- DIY BioPrinter (Instructables) — Think of it as 3D printing, but with squishier ingredients! How to piggyback on inkjet printer technology to print with your own biomaterials. It’s an exciting time for biohackery: FOO Ewan Birney is kicking ass and taking names, he was just involved in a project storing and retrieving data from DNA.
- Parsley — open-sourced forms validation library in Javascript.
- ADAMS — open sourced workflow tool for machine learning, from the excellent people at Waikato who brought you WEKA. ADAMS = Advanced Data mining And Machine learning System.
Four short links: 18 December 2012
Tweet Cred, C64 History, Performance Articles, Return of Manufacturing
- Credibility Ranking of Tweets During High Impact Events (PDF) — interesting research. Situational awareness information is information that leads to gain in the knowledge or update about details of the event, like the location, people affected, causes, etc. We found that on average, 30% content about an event, provides situational awareness information about the event, while 14% was spam. (via BoingBoing)
- The Commodore 64 — interesting that Chuck Peddle (who designed the 6502) and Bob Yannes (who designed the SID chip) are still alive. This article safely qualifies as Far More Than You Ever Thought You Wanted To Know About The C64 but it is fascinating. The BASIC housed in its ROM (“BASIC 2.0″) was painfully antiquated. It was actually the same BASIC that Tramiel had bought from Microsoft for the original PET back in 1977. Bill Gates, in a rare display of naivete, sold him the software outright for a flat fee of $10,000, figuring Commodore would have to come back soon for another, better version. He obviously didn’t know Jack Tramiel very well. Ironically, Commodore did have on hand a better BASIC 4.0 they had used in some of the later PET models, but Tramiel nixed using it in the Commodore 64 because it would require a more expensive 16 K rather than 8 K of ROM chips to house.
- The Performance Calendar — an article each day about speed. (via Steve Souders)
- Mr China Comes to America (The Atlantic) — long piece on the return of manufacturing to America, featuring Foo camper Liam Casey.
Software that keeps an eye on Grandma
Networked sensors and machine learning make it easy to see when things are out of the ordinary.
Much of health care — particularly for the elderly — is about detecting change, and, as the mobile health movement would have it, computers are very good at that. Given enough sensors, software can model an individual’s behavior patterns and then figure out when things are out of the ordinary — when gait slows, posture stoops or bedtime…
Two crucial questions for the smart grid
Who will own the data the industrial Internet generates, and how will users fare under an onslaught of optimization problems?
In a lively panel discussion at last week’s IEEE Industrial Electronics Society meeting in Montreal, two questions related to the smart grid (the prospective electrical distribution system that will set prices dynamically and let consumers sell electricity to other users easily) arose that I think we’ll hear much more about in coming years:
Who will own the data? One important…
Listening for tired machinery
Cheap sensors and sophisticated software keep expensive machines running smoothly
Software is making its way into places where it hasn’t usually been before, like the cutting surfaces of very fast, ultra-precise machine tools.
A high-speed milling machine can run at 42,000 RPM as it fabricates high-quality machine components within tolerances of a few microns. Excessive wear in that environment can lead to a failure that ruins an expensive part, but…
Four short links: 18 October 2012
Medical Data Commons, Verizon Sell You, Doctor Watson, and Weedkilling Drones
- Let’s Pool Our Medical Data (TED) — John Wilbanks (of Science Commons fame) gives a strong talk for creating an open, massive, mine-able database of data about health and genomics from many sources. Money quote: Facebook would never make a change to something as important as an advertising with a sample size as small as a Phase 3 clinical trial.
- Verizon Sells App Use, Browsing Habits, Location (CNet) — Verizon Wireless has begun selling information about its customers’ geographical locations, app usage, and Web browsing activities, a move that raises privacy questions and could brush up against federal wiretapping law. To Verizon, even when you do pay for it, you’re still the product. Carriers: they’re like graverobbing organ harvesters but without the strict ethical standards.
- IBM Watson About to Launch in Medicine (Fast Company) — This fall, after six months of teaching their treatment guidelines to Watson, the doctors at Sloan-Kettering will begin testing the IBM machine on real patients. [...] On the screen, a colorful globe spins. In a few seconds, Watson offers three possible courses of chemotherapy, charted as bars with varying levels of confidence–one choice above 90% and two above 80%. “Watson doesn’t give you the answer,” Kris says. “It gives you a range of answers.” Then it’s up to [the doctor] to make the call. (via Reddit)
- Robot Kills Weeds With 98% Accuracy — During tests, this automated system gathered over a million images as it moved through the fields. Its Computer Vision System was able to detect and segment individual plants – even those that were touching each other – with 98% accuracy.
Four short links: 9 October 2012
ID-based Democracy, Web Documentation, American Telco Gouging, and Stats Cookbook
- Finland Crowdsourcing New Laws (GigaOm) — online referenda. The Finnish government enabled something called a “citizens’ initiative”, through which registered voters can come up with new laws – if they can get 50,000 of their fellow citizens to back them up within six months, then the Eduskunta (the Finnish parliament) is forced to vote on the proposal. Now this crowdsourced law-making system is about to go online through a platform called the Open Ministry. Petitions and online voting are notoriously prone to fraud, so it will be interesting to see how well the online identity system behind this holds up.
- WebPlatform — wiki of information about developing for the open web. Joint production of many of the $BIGCOs of the web and the W3C, so will be interesting to see, as it develops, whether it has the best aspects of each or the worst.
- Why Your Phone, Cable, Internet Bills Cost So Much (Yahoo) — “The companies essentially have a business model that is antithetical to economic growth,” he says. “Profits go up if they can provide slow Internet at super high prices.” Excellent piece!
- Probability and Statistics Cookbook (Matthias Vallentin) — The cookbook contains a succinct representation of various topics in probability theory and statistics. It provides a comprehensive reference reduced to the mathematical essence, rather than aiming for elaborate explanations. CC-BY-NC-SA licensed, LaTeX source on github.
Four short links: 4 October 2012
Vannevar Bush, Topic Transparency, Ancient Maps, and Concussion Sensors
- As We May Think (Vannevar Bush) — incredibly prescient piece he wrote for The Atlantic in 1945.
- Transparency and Topic Models (YouTube) — a talk from DataGotham 2012, by Hanna Wallach. She uses latent Dirichlet allocation topic models to mine text data in declassified documents where the metadata are useless. She’s working on predicting classification durations (AWESOME!). (via Matt Biddulph)
- Slippy Map of the Ancient World — this. is. so. cool!
- Technology in the NFL — X2IMPACT’s Concussion Management System (CMS) is a great example of this trend. CMS, when combined with a digital mouth guard, also made by X2, enables coaches to see head impact data in real-time and asses concussions through monitoring the accelerometers in a players mouth guard. That data helps teams to decide whether to keep a player on the field or take them off for their own safety. Insert referee joke here.
Radar
Radar on
Radar on
Radar on
Radar on 