ENTRIES TAGGED "data"

Four short links: 29 November 2012

Four short links: 29 November 2012

Internet of Zings, Public Domain Alternate Universe, Web Engineers Tools, and Dashboards for All

  1. Is It The Internet of Things? — we’ve moved from “they ignore you” to “they laugh at you”. Next up, “they fight you”, then finally the earless RFID-enabled location-aware ambient-sensing Network of All wins. (via BERG London)
  2. The 2012 We Could Have Had — list of famous and interesting works which would have entered the public domain had we not had the 1976 extension of copyright law.
  3. Web Engineer’s Online Toolbox a list of online, Web-based tools that Web engineers can use for their work in development, testing, debugging and documentation.
  4. Indianapolis Museum of Art Dashboard — everyone should have a HUD showing the things they care about. (via Courtney Johnston)
Comment |
Four short links: 15 November 2012

Four short links: 15 November 2012

Video Effects, Old School, Data Set, and Games Numbers

  1. Atkinson Dithering in Real Time — a Processing app that renders what the video camera sees, as though it were an original Mac black and white image.
  2. Patching Binariesa patch for a crashing bug during import of account transactions or when changing a payee of a downloaded transaction in Microsoft Money Sunset Deluxe. Written with no source, simply by debugging the executable as it shipped for XP.
  3. Book Crossing DatasetContains 278,858 users (anonymized but with demographic information) providing 1,149,780 ratings (explicit / implicit) about 271,379 books.
  4. Network Games Market Update (Cartagena Capital) — The myth that players use mobile only ‘on the go’ has been shattered. Smartphones and tablets are now mainstream gaming platforms in their own right and a significant proportion of players play in stationary use case scenarios. Stats abound, including 38% of tablet gamers play more than five hours per week compared to 20% of mobile phone gamer.
Comment |
Four short links: 9 November 2012

Four short links: 9 November 2012

Civil Drones, Fencing the Public Domain, Quantified Spy, and Data Daemons for Fun and Metrics

  1. Helping Drones Play Nice With Other AviationThe U.S. airspace is quickly being filled with simultaneously flying drones. To such an extent, unmanned aircraft could soon become a nightmare for the ATC controllers. The ADS-B will improve Predator B’s crew situational awareness making the drone capable to operate more freely and safely in domestic and international airspace in accordance with civilian air traffic and airspace rules and regulations.
  2. Reclaiming NZ’s Digitised HeritageOut of a sample of 100 books: 50% of NZ Heritage Books (published before 1890) have been digitised; 90% of digitised texts are fully accessible; 98% of accessible texts are downloadable; Despite all works being in the public domain, only one did not have any licencing restrictions applied to its use. Most groups who digitise then go on to put restrictions around their use. [T]here are also many instances where arbitrary restrictions are being applied to the detriment of the public good.
  3. Self-Spy (GitHub) — Log everything you do on the computer, for statistics, future reference and all-around fun!
  4. statsd (GitHub) — Etsy’s data-gathering daemon, written up in an excellent blog post.
Comment |
Four short links: 6 November 2012

Four short links: 6 November 2012

Internet of Copters, Privacy Game, Visualizing Data, Secure Configs

  1. Tilt-to-Fly Controller and Copter (Kickstarter) — This looks totally awesome and hackable. The controller has a USB port, the protocol is documented, and you can even connect your own electronics payload, like an Arduino, camera, or homebrewed project to the auxiliary serial (UART + power) port.
  2. The Privacy Game (The Open University) — This game is designed to highlight how privacy and consent work online. Players make decisions about which information they reveal, who they reveal it to and why. For example, you may decide to trade some information for gifts when shopping on a website; or you may decide to keep other information secret when posting on a social networking site. (via BoingBoing)
  3. statwing — very easy analysis and visualization of data.
  4. duraconfa collection of hardened configuration files for SSL/TLS services. It’s easy to reduce crypto effectiveness with crappy choices and options, so it’s good to have solid configurations to go from.
Comment |
Four short links: 18 October 2012

Four short links: 18 October 2012

Medical Data Commons, Verizon Sell You, Doctor Watson, and Weedkilling Drones

  1. Let’s Pool Our Medical Data (TED) — John Wilbanks (of Science Commons fame) gives a strong talk for creating an open, massive, mine-able database of data about health and genomics from many sources. Money quote: Facebook would never make a change to something as important as an advertising with a sample size as small as a Phase 3 clinical trial.
  2. Verizon Sells App Use, Browsing Habits, Location (CNet) — Verizon Wireless has begun selling information about its customers’ geographical locations, app usage, and Web browsing activities, a move that raises privacy questions and could brush up against federal wiretapping law. To Verizon, even when you do pay for it, you’re still the product. Carriers: they’re like graverobbing organ harvesters but without the strict ethical standards.
  3. IBM Watson About to Launch in Medicine (Fast Company) — This fall, after six months of teaching their treatment guidelines to Watson, the doctors at Sloan-Kettering will begin testing the IBM machine on real patients. [...] On the screen, a colorful globe spins. In a few seconds, Watson offers three possible courses of chemotherapy, charted as bars with varying levels of confidence–one choice above 90% and two above 80%. “Watson doesn’t give you the answer,” Kris says. “It gives you a range of answers.” Then it’s up to [the doctor] to make the call. (via Reddit)
  4. Robot Kills Weeds With 98% AccuracyDuring tests, this automated system gathered over a million images as it moved through the fields. Its Computer Vision System was able to detect and segment individual plants – even those that were touching each other – with 98% accuracy.
Comment |

New ethics for a new world

The biggest threat that a data-driven world presents is an ethical one.

Since the first of our ancestors chipped stone into weapon, technology has divided us. Seldom more than today, however: a connected, always-on society promises health, wisdom, and efficiency even as it threatens an end to privacy and the rise of prejudice masked as science. On its surface, a data-driven society is more transparent, and makes better uses of its resources….
Read Full Post | Comments: 4 |

Open health data in practice: Increase your access to lab results

Voice your support for a proposed federal rule that expands patients' access to test results.

I’m convinced that there’s a wave of innovation coming in healthcare, driven by new kinds of data, new ways of extracting meaning from that data, and new business models that data can enable.  That’s one of the reasons why we launched our StrataRx Conference, which focuses on the importance of data science to the future of health care. Unfortunately, much…
Read Full Post | Comments: 4 |

Apple’s maps

Apple's maps problem isn't about software or design. It's about data.

I promise not to make any snarky remarks about Apple’s maps disaster, and the mistakes of letting a corporate vendetta get in the way of good business decisions. Oops, I lied. But it’s good to see that Tim Cook agrees, at…
Read Full Post | Comments: 13 |
Four short links: 24 September 2012

Four short links: 24 September 2012

Open Publishing, Theatre Sensing, Reddit First, and Math Podcasts

  1. Open Monograph Pressan open source software platform for managing the editorial workflow required to see monographs, edited volumes and, scholarly editions through internal and external review, editing, cataloguing, production, and publication. OMP will operate, as well, as a press website with catalog, distribution, and sales capacities. (via OKFN)
  2. Sensing Activity in Royal Shakespeare Theatre (NLTK) — sensing activity in the theatre, for graphing. Raw data available. (via Infovore)
  3. Why Journalists Love Reddit (GigaOM) — “Stories appear on Reddit, then half a day later they’re on Buzzfeed and Gawker, then they’re on the Washington Post, The Guardian and the New York Times. It’s a pretty established pattern.”
  4. Relatively Prime: The Toolbox — Kickstarted podcasts on mathematics. (via BoingBoing)
Comment |

Data is the real business model for social

IBM's Marie Wallace on the unrealized potential of social data.

As social media websites gather ever-growing data stores, they might be better served by finding ways to make profitable use of that data instead serving ads as their chief means of raising revenue. While the data might give them the information they need to serve more targeted ads — although in my experience they still have a ways to…
Read Full Post | Comment: 1 |