- DSPL: DataSet Publishing Language (Google Code) — a representation language for the data and metadata of datasets. Datasets described in this format can be processed by Google and visualized in the Google Public Data Explorer. XML metadata on CSV, geo-enabled, with linkable data. (via Michal Migurski on Delicious)
- Why is Evidence So Hard for Politicians — Ben Goldacre nails how politicians go about “evidence-based policy making”: So the Minister has cherry picked only the good findings, from only one report, while ignoring the peer-reviewed literature. Most crucially, he cherry-picks findings he likes whilst explicitly claiming that he is fairly citing the totality of the evidence from a thorough analysis. I can produce good evidence that I have a magical two-headed coin, if I simply disregard all the throws where it comes out tails.
- Celery: Distributed Task Queue — asynchronous task queue/job queue based on distributed message passing. It is focused on real-time operation, but supports scheduling as well. MIT-style licensed, written in Python, RabbitMQ is the recommended message broker. (via Joshua Schachter on Delicious)
- pixelfari — Safari hacked to look like it’s running on an 8-bit computer. This sense of playfulness with the medium is something I love about the best coders. They think “ha, wouldn’t it be funny if …” and then can make it happen.
ENTRIES TAGGED "policy"
Data Sets, Data-driven Policy, Task Queues, and 8-Bit Browser
Stemming Demo, Mapping Service, Value of Data, and The Magic of the Valley
- Demo of Stemming Algorithms — type in text and see what it looks like when stemmed with different algorithms provided by NLTK. (via zelandiya on Twitter)
- Crowdmap — hosted Ushahidi. (via dvansickle on Twitter)
- Opinions vs Data — talks about the usability of a new gmail UI element, but notable for this quote from Jakob Nielsen: In my two examples, the probability of making the right design decision was vastly improved when given the tiniest amount of empirical data. (via mcannonbrookes on Twitter)
- The Next Silicon Valley — long and detailed list of the many forces contributing to Silicon Valley’s success as tech hub, arguing that the valley’s position is path-dependent and can simply be grown ab initio in some aspiring nation’s co-prosperity zone of policy whim. (via imran and timoreilly on Twitter)
Scientific Literacy, Load Balancing, Indoors Geolocation, and iPhone Security
- The Myth of Scientific Literacy — I’d love it if there was a simple course we could send our elected officials on which would guarantee future science policy would be reliably high quality. Being educated in science (or even “about science”) isn’t going to do it. It’s social connections that will. We need to keep our elected officials honest, constantly check they are applying the evidence we want them to, in the ways we want them to. And if the scientific community want to be listened to, they need to work to build connections. Get political and scientific communities overlapping, embed scientists in policy institutions (and vice versa), get MP’s constituents onside to help foster the sorts of public pressure you want to see: build trust so scientists become people MPs want to be briefed by. (via foe on Twitter)
- Three Papers on Load Balancing (Alex Popescu) — three papers on distributed hash tables.
- Meridian — iPhone app that does in-building location, sample app is the AMNH Explorer which shows you maps of where you are. Uses wifi-based positioning. (via raffi on Twitter)
- Fixing What Apple Won’t — the jailbreakers are releasing security patches for systems that Apple have abandoned. (via ardgedee on Twitter)
Network Neutrality, Open Data, Science Policy, and the Android Army
- A Review of Verizon and Google’s Net Neutrality Proposal (EFF) — a mixture of good and bad, is the verdict. I am ready to give Google credit for getting Network Neutrality back on the regulatory agenda, whether or not this proposal was a strawman.
- Ten Principles for Opening Up Government Information (Sunlight Foundation) — We have updated and expanded upon the Sebastopol list and identified ten principles that provide a lens to evaluate the extent to which government data is open and accessible to the public. The list is not exhaustive, and each principle exists along a continuum of openness. The principles are completeness, primacy, timeliness, ease of physical and electronic access, machine readability, non-discrimination, use of commonly owned standards, licensing, permanence and usage costs.
- What If the Web Really Worked for Science? Reimagining Data Policy and Intellectual Property (video) — a talk by James Boyle on IP and science policy.
- Winners of the Apps for Army Challenge — more Android apps than iPhone in the winners. (via Alex)
Health, Profit, Policy, and Semantic Web Software
- The Men Who Stare at Screens (NY Times) — What was unexpected was that many of the men who sat long hours and developed heart problems also exercised. Quite a few of them said they did so regularly and led active lifestyles. The men worked out, then sat in cars and in front of televisions for hours, and their risk of heart disease soared, despite the exercise. Their workouts did not counteract the ill effects of sitting. (via Andy Baio)
- Caring with Cash — describes a study where “pay however much you want” had high response rate but low average price, “half goes to charity” barely changed from the control (fixed price) response rate, but “half goes to charity and you can pay what you like” earned more money than either strategy.
- Behavioural Economics a Political Placebo? (NY Times) — As policymakers use it to devise programs, it’s becoming clear that behavioral economics is being asked to solve problems it wasn’t meant to address. Indeed, it seems in some cases that behavioral economics is being used as a political expedient, allowing policymakers to avoid painful but more effective solutions rooted in traditional economics. (via Mind Hacks)
- Protege — open source ontology editor and knowledge-base framework.
Being Wrong, Science Malfunding, Touch-screen Libraries, Mining Flickr Photos
- Ira Glass on Being Wrong (Slate) — fascinating interview with Ira Glass on the fundamental act of learning: being wrong. I had this experience a couple of years ago where I got to sit in on the editorial meeting at the Onion. Every Monday they have to come up with like 17 or 18 headlines, and to do that, they generate 600 headlines per week. I feel like that’s why it’s good: because they are willing to be wrong 583 times to be right 17. (via Hacker News)
- Real Lives and White Lies in the Funding of Scientific Research (PLoSBiology) — very clear presentation of the problems with the current funding models of scientific research, where the acknowledged best scientists spend most of their time writing funding proposals. K.’s plight (an authentic one) illustrates how the present funding system in science eats its own seed corn. To expect a young scientist to recruit and train students and postdocs as well as producing and publishing new and original work within two years (in order to fuel the next grant application) is preposterous.
- jQTouch Roadmap — interesting to me is the primary distinction between Sencha and jQTouch, namely that jQT is for small devices (phones) only, while Sencha handles small and large (tablet) touch-screen devices. (via Simon St Laurent)
- Travel Itineraries from Flickr Photo Trails (Greg Linden) — clever idea, to use metadata extracted from Flickr photos (location, time, etc.) to construct itineraries for travellers, saying where to go, how long to spend there, and how long to expect to spend getting from place to place. Another story of the surprise value that can be extracted from overlooked data.
Bioinformatics Myths, Internet Policy, Archivist Tools, Life Visualisations
- The Mythology of Bioinformatics — worth reading this (reprinted from 2002!) separate of hype from history.
- Policy and Internet — new journal, with articles such as The Case Against Mass E-mails: Perverse Incentives and Low Quality Public Participation in U.S. Federal Rulemaking: This paper situates a close examination of the 1000 longest modified MoveOn.org-generated e-mails sent to the Environmental Protection Agency (EPA) about its 2004 mercury rulemaking, in the broader context of online grassroots lobbying. The findings indicate that only a tiny portion of these public comments constitute potentially relevant new information for the EPA to consider. The vast majority of MoveOn comments are either exact duplicates of a two-sentence form letter, or they are variants of a small number of broad claims about the inadequacy of the proposed rule. This paper argues that norms, rules, and tools will emerge to deal with the burden imposed by these communications. More broadly, it raises doubts about the notion that online public participation is a harbinger of a more deliberative and democratic era. (via Jordan at InternetNZ)
- Xena — GPL-licensed Java software from National Archives of Australia, to detect the file formats of “digital objects” and then converting them into open formats for preservation.
- Nebul.us — startup that aggregates and visualises your online activity. In private beta, but there’s a screenshot and brief discussion on Flowing Data.
Finland makes broadband access a right, $7 billion US stimulus for rural broadband improvements
As our economy continues to lose mass in favor of information-based goods (U.S. exports lost 50% of their physical weight per dollar from 1993 to 1999*) and we continue to see the decoupling of workforce from workplace, connectivity is a critical factor in economic exchange and competitive advantage. Countries that build wide, fast networks to the last mile will have a huge leg up. This week gave us two reasons to reconsider the state of broadband connectivity in the US.
In the fall of 2005, the Authors Guild, which then had about 8000 members, and five publishers sued Google for copyright infringement. Many copyright professionals expected the Authors Guild v. Google case to be the most important fair use case of the 21st century. This column argues that the proposed settlement of this lawsuit is a privately negotiated compulsory license primarily designed to monetize millions of orphan works. It will benefit Google and certain authors and publishers, but it is questionable whether the authors of most books in the corpus (the “dead souls” to which the title refers) would agree that the settling authors and publishers will truly represent their interests when setting terms for access to the Book Search corpus.