<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>O&#039;Reilly Radar &#187; Alex Howard</title>
	<atom:link href="http://radar.oreilly.com/alexh/feed" rel="self" type="application/rss+xml" />
	<link>http://radar.oreilly.com</link>
	<description>Insight, analysis, and research about emerging technologies</description>
	<lastBuildDate>Fri, 24 May 2013 12:45:24 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>Towards a more open world</title>
		<link>http://radar.oreilly.com/2013/05/towards-a-more-open-world.html</link>
		<comments>http://radar.oreilly.com/2013/05/towards-a-more-open-world.html#comments</comments>
		<pubDate>Wed, 01 May 2013 21:47:00 +0000</pubDate>
		<dc:creator>Alex Howard</dc:creator>
				<category><![CDATA[Gov 2.0]]></category>
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://radar.oreilly.com/?p=57092</guid>
		<description><![CDATA[Last September, I gave a 5 minute Ignite talk at the tenth Ignite DC. The video just became available. My talk, embedded below, focused on what I&#8217;ve been writing about here at Radar for the past three years: open government, &#8230; ]]></description>
				<content:encoded><![CDATA[<p>Last September, I gave a 5 minute <a href="http://igniteshow.com">Ignite</a> talk at the tenth <a href="http://www.ignite-dc.com/">Ignite DC</a>. The video just became available. My talk, embedded below, focused on what I&#8217;ve been writing about here at Radar for the past three years: open government, journalism, media, mobile technology and more.</p>
<p><code><iframe src="http://blip.tv/play/g9M1g4eYZgI.x?p=1" height="350" width="600" frameborder="0"></iframe></code></p>
<p>The 20 slides that I used for the Ignite were a condensed version of a much longer presentation I&#8217;d created for a talk on open data and journalism in Moldova, also I&#8217;ve embedded below.</p>
<p><iframe src="http://www.slideshare.net/slideshow/embed_code/12921799" height="356" width="427" frameborder="0"></iframe></p>
<div style="margin-bottom: 5px"></div>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2013/05/towards-a-more-open-world.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Linking open data to augmented intelligence and the economy</title>
		<link>http://radar.oreilly.com/2013/04/linking-open-data-to-augmented-intelligence-and-the-economy.html</link>
		<comments>http://radar.oreilly.com/2013/04/linking-open-data-to-augmented-intelligence-and-the-economy.html#comments</comments>
		<pubDate>Tue, 30 Apr 2013 17:33:53 +0000</pubDate>
		<dc:creator>Alex Howard</dc:creator>
				<category><![CDATA[Gov 2.0]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[artificial intelligence]]></category>
		<category><![CDATA[midata]]></category>
		<category><![CDATA[ODI]]></category>
		<category><![CDATA[open data]]></category>
		<category><![CDATA[open data economy]]></category>
		<category><![CDATA[personal data]]></category>
		<category><![CDATA[smart disclosure]]></category>
		<category><![CDATA[Web science]]></category>

		<guid isPermaLink="false">http://radar.oreilly.com/?p=57072</guid>
		<description><![CDATA[After years of steady growth, open data is now entering into public discourse, particularly in the public sector. If President Barack Obama decides to put the White House&#8217;s long-awaited new open data mandate before the nation this spring, it will &#8230; ]]></description>
				<content:encoded><![CDATA[<p>After years of steady growth, open data is now entering into public discourse, particularly in the public sector. If President Barack Obama decides to put the White House&#8217;s long-awaited <a href="http://www.federalnewsradio.com/519/3182982/White-House-to-mandate-machine-readable-open-data">new open data mandate</a> before the nation this spring, it will finally enter the mainstream.</p>
<p>As more governments, businesses, media organizations and institutions adopt open data initiatives, interest in the evidence behind  release and the outcomes from it is similarly increasing. High hopes abound in many sectors, from development to energy to health to safety to transportation.</p>
<p>&#8220;Today, the digital revolution fueled by open data is starting to do for the modern world of agriculture what the industrial revolution did for agricultural productivity over the past century,&#8221; said Secretary of Agriculture Tom Vilsack, speaking at the <a href="http://blogs.usda.gov/2013/04/30/open-agricultural-data-at-your-fingertips/">G-8 Open Data for Agriculture Conference</a>.</p>
<p>As other countries consider releasing their public sector information as data and machine-readable formats onto the Internet, they&#8217;ll need to consider and learn from <a href="http://strata.oreilly.com/2012/06/uk-cabinet-office-relaunches-d.html">years of effort at data.gov.uk</a>, <a href="http://strata.oreilly.com/2011/12/data-gov-open-source.html">data.gov</a> in the United States, and <a href="http://radar.oreilly.com/2013/04/developing-caribbean-jamaica-open-data.html">Kenya</a> in Africa.</p>
<p><a href="http://radar.oreilly.com/2013/04/linking-open-data-to-augmented-intelligence-and-the-economy.html/nigel_shadbolt" rel="attachment wp-att-57073"><img class="alignright size-full wp-image-57073" alt="nigel_shadbolt" src="http://s.radar.oreilly.com/wp-files/2/2013/04/nigel_shadbolt.jpg" width="380" height="270" /></a>One of the crucial sources of analysis for the success or failure of open data efforts will necessarily be research institutions and academics. That&#8217;s precisely why research from the <a href="http://radar.oreilly.com/2013/02/gavin-starks-open-data-institute.html">Open Data Institute</a> and Professor <a href="http://users.ecs.soton.ac.uk/nrs/">Nigel Shadbolt</a> (<a href="https://twitter.com/Nigel_Shadbolt">@Nigel_Shadbolt</a>) will matter in the months and years ahead.</p>
<p>In the following interview, Professor Shadbolt and I discuss what lies ahead. His responses were lightly edited for content and clarity.<br />
<span id="more-57072"></span></p>
<h2>How does your research on artificial intelligence (AI) relate to open data?</h2>
<p>AI has always fascinated me. The quest for understanding what makes us smart and how we can make computers smart has always engaged me. While we’re trying to understand the principles of human intelligence and build a &#8220;brain in a box, smarter robots&#8221; or better speech processing algorithms, the world’s gone and done a different kind of AI: augmented intelligence. The web, with billions of human brains, has a new kind of collective and distributive capability that we couldn’t even see coming in AI. A number of us have coined a phrase, &#8220;Web science,&#8221; to understand the Web at a systems level, much as we do when we think about human biology. We talk about &#8220;systems biology&#8221; because there are just so many elements: technical, organizational, cultural.</p>
<p>The Web really captured my attention ten years ago as this really new manifestation of collective problem-solving. If you think about the link into earlier work I’d done, in what was called &#8220;knowledge engineering&#8221; or knowledge-based systems, there the problem was that all of the knowledge resided on systems on people’s desks. What the web has done is finish this with something that looks a lot like a supremely distributed database. Now, that distributed knowledge base is one version of the Semantic Web. The way I got into open data was the notion of using linked data and semantic Web technologies to integrate data <em>at scale</em> across the web &#8212; and one really high value source of data is open government data.<em><br />
</em></p>
<h2>What was the reason behind the founding and funding of the <a href="http://radar.oreilly.com/2013/02/gavin-starks-open-data-institute.html">Open Data Institute</a> (ODI)?</h2>
<p>The open government data piece originated in work I did in 2003 and 2004. We were looking at this whole idea of putting new data-linking standards on the Web. I had a project in the United Kingdom that was working with government to show the opportunities to use these techniques to link data. As in all of these things, that work was reported to Parliament. There was real interest in it, but not really top-level heavy &#8220;political cover&#8221; interest. Tim Berners-Lee’s engagement with the previous prime minister led to Gordon Brown appointing Tim and I to look at setting up data.gov.uk, getting data released and then the current coalition government taking that forward.</p>
<p>Throughout this time, Tim and I have been arguing that we could really do with a central focus, an institute whose principal motivation was working out how we could find real value in this data. The ODI does exactly that. It’s got about $16 million of public money over five years to incubate companies, build capacity, train people, and ensure that the public sector is supplying high quality data that can be consumed. The fundamental idea is that you ensure high quality supply by generating a strong demand side. The good demand side isn’t just public sector, it’s also the private sector.</p>
<h2>What have we learned so far about what works and what doesn’t? What are the strategies or approaches that have some evidence behind them?</h2>
<p>I think there are some clear learnings. One that I’ve been banging on about recently has been that yes, it really does matter to turn the dial so that governments have a presumption to publish non-personal public data. If you would publish it anyway, under a Freedom of Information request or whatever your local legislative equivalent is, why aren’t you publishing it anyway as open data? That, as a behavioral change. is a big one for many administrations where either the existing workflow or culture is, “Okay, we collect it. We sit on it. We do some analysis on it, and we might give it away piecemeal if people ask for it.” We should construct publication process from the outset to presume to publish openly. That’s still something that we are two or three years away from, working hard with the public sector to work out how to do and how to do properly.</p>
<p>We’ve also learned that in many jurisdictions, the amount of [open data] expertise within administrations and within departments is slight. There just isn’t really the skillset, in many cases. for people to know what it is to publish using technology platforms. So there’s a capability-building piece, too.</p>
<p>One of the most important things is it’s not enough to just put lots and lots of datasets out there. It would be great if the &#8220;presumption to publish&#8221; meant they were all out there anyway &#8212; but when you haven’t got any datasets out there and you’re thinking about where to start, the tough question is to say, “How can I publish data that matters to people?”</p>
<p>The data that matters is revealed in the fact that if we look at the download stats on these various UK, US and other [open data] sites. There&#8217;s a very, very distinctive parallel curve. Some datasets are very, very heavily utilized. You suspect they have high utility to many, many people. Many of the others, if they can be found at all, aren’t being used particularly much. That’s not to say that, under that long tail, there isn’t large amounts of use. A particularly arcane open dataset may have exquisite use to a small number of people.</p>
<p>The real truth is that it’s easy to republish your national statistics. It’s much harder to do a serious job on publishing your spending data in detail, publishing police and crime data, publishing educational data, publishing actual overall health performance indicators. These are tough datasets to release. As people are fond of saying, it holds politicians’ feet to the fire. It’s easy to build a site that’s full of stuff &#8212; but does the stuff actually <em>matter</em>? And does it have any economic utility?</p>
<h2>Page views and traffic aren&#8217;t ideal metrics for measuring success for an open data platform. What should people measure, in terms of actual outcomes in citizens’ lives? Improved services or money saved? Performance or corrupt politicians held accountable? Companies started or new markets created?</h2>
<p>You’ve enumerated some of them. It’s certainly true that one of the challenges is to instrument the effect or the impact. Actually, it’s the last thing that governments, nation states, regions or cities who are enthused to do this thing do. It’s quite hard.</p>
<p>Datasets, once downloaded, may then be virally reproduced all over the place, so that you don’t notice it from a government site. One of the requirements in most of the open licensing which is so essential to this effort is usually has a requirement for essential attribution. Those licenses should be embedded in the machine readable datasets themselves. Not enough attention is paid to that piece of process, to actually noticing when you’re looking at other applications, other data and publishing efforts, that attribution is there. We should be smarter about getting better sense from the attribution data.</p>
<p>The other sources of impact, though: How do you evidence actual internal efficiencies and internal government-wide benefits of open data? I had an interesting discussion recently, where the department of IT had said, “You know, I thought this was all stick and no carrot. I thought this was all in overhead, to get my data out there for other people’s benefits, but we’re now finding it so much easier to re-consume our own data and repurpose it in other contexts that it’s taken a huge amount of friction out of our own publication efforts.”</p>
<p>Quantified measures would really help, if we had standard methods to notice those kinds of impacts. Our economists, people whose impact is around understanding where value is created, really haven’t embraced open markets, particularly open data markets, in a very substantial way. I think we need a good number of capable economists pilling into this, trying to understand new forms of value and what the values are that are created.</p>
<p>I think a lot of the traditional models don’t stand up here. Bizarrely, it’s much easier to measure impact when information scarcity exists and you have something that I don’t, and I have to pay you a certain fee for that stuff. I can measure that value. When you’ve taken that asymmetry out, when you’ve made open data available more widely, what are the new things that flourish? In some respects, you’ll take some value out of the market, but you’re going to replace it by wider, more distributed, capable services. This is a key issue.</p>
<p>The ODI will certainly be commissioning and is undertaking work in this area. We <a href="http://radar.oreilly.com/2013/02/roi-open-data-economy-value.html">published a piece of work jointly with Deloitte</a> in London, looking at evidence-linked methodology.</p>
<h2>You mentioned the demand-side of open data. What are you learning in that area &#8212; and what&#8217;s being done?</h2>
<p>There’s an interesting tension here. If we turn the dial in the governmental mindset to the &#8220;presumption to publish&#8221; &#8212; and in the UK, our public data principles actually <em>embrace</em> that as government policy &#8212; you are meant to publish unless there’s an issue in personal information or national security why you would not. In a sense, you say, “Well, we just publish everything out there? That’s what we’ll do. Some of it will have utility, and some of it won’t.”</p>
<p>When the Web took off, and you offered pages as a business or an individual, you didn’t foresee the link-making that would occur. You didn’t foresee that PageRank would ultimately give you a measure of your importance and relevance in the world and could even be monetized after the fact. You didn’t foresee that those pages have their own essential network effect, that the more pages there are that interconnect, that there’s value being created out of it and so there&#8217;s is a strong argument [for publishing them].</p>
<p>So, you know, just publish. In truth, the demand side is an absolutely great and essential test of whether actually [publishing data] does matter.</p>
<p>Again, to take the Web as an analogy, large amounts of the Web are unattended to, neglected, and rot. It’s just stuff nobody cares about, actually. What we’re seeing in the open data effort in the UK is that it’s clear that some data is very privileged. It’s at the center of lots of other datasets.</p>
<p>In particular, [data about] location, occurrence, and when things occurred, and stable ways of identifying those things which are occurring. Then, of course, the data space that relates to companies, their identifications, the contracts they call, and the spending they engage in. That is the meat and drink of business intelligence apps all across the planet. If you started to turn off an ability for any business intelligence to access legal identifiers or business identifiers, all sorts of oversight would fall apart, apart from anything else.</p>
<p>The demand side [of open data] can be characterized. It’s not just economic. It will have to do with transparency, accountability and regulatory action. The economic side of open data gives you huge room for maneuver and substantial credibility when you can say, “Look, this dataset of spending data in the UK, published by local authorities, is the subject of detailed analytics from companies who look at all data about how local authorities and governments are spending their data. They sell procurement analysis insights back to business and on to third parties and other parts of the business world, saying &#8216;This is the shape of how the UK PLC is buying.&#8217;&#8221;</p>
<h2>What are some of the lessons we can learn from how the World Wide Web grew and the value that it’s delivered around the world?</h2>
<p>That’s always a worry, that, in some sense, the empowered get more powerful. What we do see is that, in open data in particular, new sorts of players couldn&#8217;t enter the game at all.</p>
<p>My favorite example is in mass transportation. In the UK, we have to fight quite hard to get some of the data from bus, rail and other forms of transportation made openly available. Until that was done, there was a pretty small number of supplies from this market.</p>
<p>In London, where all of it was made available from the Transport for London Authority, there’s just been an explosion of apps and businesses who are giving you subtly and distinct experiences as users of that data. I’ve got about eight or nine apps on my phone that give me interestingly distinctive views of moving about the city of London. I couldn’t have predicted or anticipated many of those exist.</p>
<p>I’m sure the companies who held that data could’ve spent large amounts of money and still not given me anything like the experience I now have. The flood of innovation around the data has really been significant and many, many more players and stakeholders in that space.</p>
<p>The Web taught us that serendipitous reuse, where you can’t anticipate where the bright idea comes from, is what is so empowering. The flipside of that is that it also reveals that, in some cases, the data isn’t necessarily of a quality that you might’ve thought. This effort might allow for civic improvement or indeed, business improvement in some cases, where businesses come and improve the data the state holds.</p>
<h2>What’s happening in the UK with the so-called &#8220;<a href="http://www.guardian.co.uk/money/2012/nov/16/midata-consumer-power-see-spending-pattern">MiData Initiative</a>,&#8221; which posits that people have a right to access and use <a href="http://radar.oreilly.com/2012/04/what-is-smart-disclosure.html">personal data disclosed</a> to them?</h2>
<p>I think this is every bit as potentially disruptive and important as open government data. We’re starting to see the emergence of what we might think of as a new class of important data, &#8220;<a href="http://www.weforum.org/reports/personal-data-emergence-new-asset-class">personal assets</a>.&#8221;</p>
<p>People have talked about &#8220;<a href="http://en.wikipedia.org/wiki/Personal_information_management">personal information management</a> systems&#8221; for a long time now. Frequently, it’s revolved around managing your calendar or your contact list, but it’s much deeper. Imagine that you, the consumer, or you, the citizen, had a central locus of authority around data that was relevant to you: consumer data from retail, from the banks that you deal with, from the telcos you interact with, from the utilities you get your gas, water and electricity from. Imagine if that data infosphere was something that you could access easily, with a right to reuse and redistribute it as you saw fit.</p>
<p>The canonical example, of course, is <a href="http://strata.oreilly.com/2012/06/mhealth-healthdata-ehealth-innovation-opendata.html">health data</a>. It isn’t all data that business holds, it’s also data the <em>state</em> holds, like your health records, educational transcript, welfare, tax, or any number of areas.</p>
<p>In the UK, we’ve been working towards empowering consumers, in particular through this MiData program. We’re trying to get to a place where consumers have a right to data held about their transactions by businesses, [released] back to them in a reusable and flexible way. We’ve been working on a voluntary program in this area for the last year. We have a consultation on taking up power to require large companies to give that information back. There is a commitment to the UK, for the first time, to get health records back to patients as data they control, but I think it has to go much more widely.</p>
<p><a href="http://radar.oreilly.com/2013/02/smart-disclosure-personal-data-ownership.html">Personal data is a natural complement to open data</a>. Some of the most interesting applications I’m sure we’re going to see in this area are where you take your personal data and enrich it with open data relating to businesses, the services of government, or the actual trading environment you’re in. In the UK, we’ve got six large energy companies that compete to sell energy to you.</p>
<p>Why shouldn’t groups and individuals be able to get together and collectively purchase in the same way that corporations can purchase and get their discounts? Why can’t individuals be in a spot market, effectively, where it’s easy to move from one supplier to another? Along with those efficiencies in the market and improvements in service delivery, it’s about <a href="http://hbr.org/2013/01/smarter-information-smarter-consumers/ar/1">empowering consumers</a> at the end of the day.</p>
<p><em>This post is part of our ongoing series on the <a href="http://radar.oreilly.com/tag/data-economy">open data economy</a>.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2013/04/linking-open-data-to-augmented-intelligence-and-the-economy.html/feed</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>White House Science Fair praises future scientists and makers</title>
		<link>http://radar.oreilly.com/2013/04/white-house-science-fair-makers.html</link>
		<comments>http://radar.oreilly.com/2013/04/white-house-science-fair-makers.html#comments</comments>
		<pubDate>Mon, 29 Apr 2013 20:06:14 +0000</pubDate>
		<dc:creator>Alex Howard</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[citizen science]]></category>
		<category><![CDATA[education]]></category>
		<category><![CDATA[learning]]></category>
		<category><![CDATA[MakerFaire]]></category>
		<category><![CDATA[makers]]></category>
		<category><![CDATA[making]]></category>
		<category><![CDATA[science]]></category>
		<category><![CDATA[Science fair]]></category>
		<category><![CDATA[white house]]></category>

		<guid isPermaLink="false">http://radar.oreilly.com/?p=57043</guid>
		<description><![CDATA[There are few ways to better judge a nation&#8217;s character than to look at how its children are educated. What values do their parents, teachers and mentors demonstrate? What accomplishments are celebrated? In a world where championship sports teams are &#8230; ]]></description>
				<content:encoded><![CDATA[<p>There are few ways to better judge a nation&#8217;s character than to look at how its children are educated. What values do their parents, teachers and mentors demonstrate? What accomplishments are celebrated? In a world where championship sports teams are idolized and superstar athletes are feted by the media, it was gratifying to see science, students and teachers get their moment in the sun at the White House last week. </p>
<p>&#8220;&#8230;one of the things that I’m concerned about is that, as a culture, we’re great consumers of technology, but we’re not always properly respecting the people who are in the labs and behind the scenes creating the stuff that we now take for granted,&#8221; said President Barack Obama, &#8220;and we’ve got to give the millions of Americans who work in science and technology not only the kind of respect they deserve but also new ways to engage young people.&#8221;</p>
<div id="attachment_57045" class="wp-caption aligncenter" style="width: 610px"><img src="http://s.radar.oreilly.com/wp-files/2/2013/04/white-house-science-fair.jpg" alt="President Obama at White House Science Fair" width="600" height="450" class="size-full wp-image-57045" />
<p class="wp-caption-text">President Barack Obama talks with Evan Jackson, 10, Alec Jackson, 8, and Caleb Robinson, 8, from McDonough, Ga., at the 2013 White House Science Fair in the State Dining Room. (Official White House Photo by Chuck Kennedy)</p>
</div>
<p>An increasingly fierce global competition for talent and natural resources has put a premium on developing scientists and engineers in the nation&#8217;s schools. (On that count, last week, the President announced a <a>plan</a> to <a href="http://www.nytimes.com/2013/04/23/us/politics/obama-promotes-science-careers-at-white-house-fair.html">promote careers in the sciences</a> and expand federal and private-sector initiatives to encourage students to study STEM. </p>
<p>&#8220;America has always been about discovery, and invention, and engineering, and science and evidence,&#8221; said the President, last week. &#8220;That&#8217;s who we are.  That&#8217;s in our DNA.  That&#8217;s how this country became the greatest economic power in the history of the world.  That&#8217;s how we’re able to provide so many contributions to people all around the world with our scientific and medical and technological discoveries.&#8221;</p>
<p><span id="more-57043"></span></p>
<p>Unfortunately, the role models that far too much of the media hold up for young people are all too frequently pulled from the stage, screen and playing fields, as opposed to laboratories, universities and schools. </p>
<p>In recent years, the success of technology entrepreneurs has shifted that dynamic, but in the American academy, <a href="http://www.nytimes.com/2012/01/22/education/edlife/how-big-time-sports-ate-college-life.html?pagewanted=all&amp;_r=0">big time sports have been eating college life</a>, with huge stadiums and rallies for stars and comparatively little notice given to National Merit awards or fellowship winners.  When the President said in 2012 that science, technology, engineering and math (STEM) education and young people&#8217;s scientific achievements don&#8217;t belong in the back pages of newspapers, his media criticism was quietly scathing: a culture of celebrity is not geared to the more quiet, sustained achievement required to attain a graduate degree or patent, though both may have more enduring value to society than a pop album.</p>
<p>This is a dynamic that clearly troubles President Obama, and one that he has used the bully pulpit and the platform of the White House to drawn national attention towards over the past five years.</p>
<p>&#8220;If you win the NCAA championship, you come to the White House,&#8221; <a href="http://www.whitehouse.gov/the-press-office/remarks-president-education-innovate-campaign">said the President</a>, in 2009. &#8220;Well, if you&#8217;re a young person and you&#8217;ve produced the best experiment or design, the best hardware or software, you ought to be recognized for that achievement, too.  Scientists and engineers ought to stand side by side with athletes and entertainers as role models, and here at the White House we&#8217;re going to lead by example.&#8221;</p>
<p>In the years since, the White House has tried to carry through on that pledge, hosting three science fairs and involving national leaders in science education, including Bill Nye,&#8221;The Science Guy,&#8221; Reading Rainbow host Levar Burton, and astrophysicist Neil deGrasse Tyson. </p>
<p><a href="http://radar.oreilly.com/2013/04/white-house-science-fair-makers.html/neil-tyson" rel="attachment wp-att-57046"><img src="http://s.radar.oreilly.com/wp-files/2/2013/04/neil-tyson.jpg" alt="neil tyson" width="612" height="612" class="alignright size-full wp-image-57046" /></a></p>
<p>&#8220;The White House Science Fair is a way of showing everyone that science is cool,&#8221; said Tyson, the <a href="http://www.haydenplanetarium.org/tyson/">director of the Hayden Planetarium</a> in New York City, in an interview. Engaging the public about the wonders of the universe and encouraging kids to be curious about how our world works has been a core part of his career, driven by his infectious good humor. </p>
<h2>Making science and technology education more fun</h2>
<p>As Greg Ferenstein noted at TechCrunch, there were some pretty <a href="http://techcrunch.com/2013/04/22/3-awesome-and-inspiring-inventions-from-the-white-house-science-fair/">awesome inventions at the White House science fair</a>, from mind-controlled prosthetics to improved cancer detection methodologies to a bicycle-powered water purification system. You can see a list of the <a href="http://www.whitehouse.gov/the-press-office/2013/04/22/new-details-president-obama-host-white-house-science-fair">White House science fair projects</a> at WhiteHouse.gov and <a href="http://www.youtube.com/watch?v=2HfgGDOZPCQ">watch President Obama tour the exhibits</a> on YouTube.</p>
<p>One of the notable components of the science fairs has been the involvement of kids from the <a href="http://radar.oreilly.com/2011/11/dale-dougherty-make-white-house.html">maker movement</a>. In the summer of 2013, the <a href="http://makered.org/">Maker Education Initiative</a> will host a season-long <a href="https://www.webmaker.org/en-US/party/">Maker Party</a> where students can learn, design and create.</p>
<p>&#8220;We&#8217;re a nation of tinkerers and dreamers and believers in a better tomorrow,&#8221; said President Obama at the 2012 White House Science Fair, recognizing the long-history of creative innovation in American garages, basements and barns.</p>
<blockquote class="twitter-tweet"><p>Photo: President Obama demos @<a href="https://twitter.com/joey_hudy">joey_hudy</a>&#8216;s Extreme Marshmallow Cannon @ the <a href="https://twitter.com/search/%23WHScienceFair">#WHScienceFair</a> <a href="http://t.co/qOPidsnA" title="http://wh.gov/0qf">wh.gov/0qf</a> <a href="http://t.co/a7sxWGLi" title="http://twitter.com/whitehouse/status/167019841998438400/photo/1">twitter.com/whitehouse/sta…</a></p>
<p>&mdash; The White House (@whitehouse) <a href="https://twitter.com/whitehouse/status/167019841998438400">February 7, 2012</a></p></blockquote>
<p>In 2012, <a href="http://oreillyradar.tumblr.com/post/17233197256/raw-video-marshmallow-launch-at-the-white-house">President Obama famously helped young maker Joey Hudy</a> to fire his &#8220;extreme marshmallow cannon.&#8221; 14-year old Ben Hylack, the maker of a telepresence robot, said that <a href="http://whatsnext.blogs.cnn.com/2012/02/08/how-maker-faire-changed-my-life/">Makerfaire changed his life</a>. In 2013, &#8220;Super Awesome Sylvia&#8221; <a href="http://blog.makezine.com/2013/04/21/super-awesome-sylvia-goes-to-white-house-science-fair/">represented Maker Faire at the White House Science Fair</a>, showing the President her watercolor drawbot.</p>
<p>Experimenting with more independent projects that let kids tinker are important but only part of a puzzle that includes parents, teachers and libraries.</p>
<p><a href="http://radar.oreilly.com/2013/04/white-house-science-fair-makers.html/neil-tyson-2" rel="attachment wp-att-57047"><img src="http://s.radar.oreilly.com/wp-files/2/2013/04/neil-tyson1.jpg" alt="neil-tyson" width="612" height="612" class="alignright size-full wp-image-57047" /></a></p>
<p>&#8220;We should be focused on getting kids &#8216;making&#8217;, yes, but that misses a tacit recognition that kids are by nature scientists,&#8221; said Tyson. &#8220;What we should be talking about is how to keep kids interested and get out of their way as they learn.&#8221;</p>
<p>For that to happen, we&#8217;ll need to encourage children to keep asking questions, teach them how to learn to answer them, and praise inquisitive students.</p>
<p>&#8220;Acts of curiosity are what make up acts of science,&#8221; he said. &#8220;Adult scientists are just kids who never grew up.&#8221;</p>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2013/04/white-house-science-fair-makers.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Sprinting toward the future of Jamaica</title>
		<link>http://radar.oreilly.com/2013/04/developing-caribbean-jamaica-open-data.html</link>
		<comments>http://radar.oreilly.com/2013/04/developing-caribbean-jamaica-open-data.html#comments</comments>
		<pubDate>Thu, 18 Apr 2013 13:00:51 +0000</pubDate>
		<dc:creator>Alex Howard</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[data journalism]]></category>
		<category><![CDATA[Gov 2.0]]></category>
		<category><![CDATA[government as a platform]]></category>
		<category><![CDATA[hackathon]]></category>
		<category><![CDATA[Jamaica]]></category>
		<category><![CDATA[open data]]></category>
		<category><![CDATA[open government]]></category>
		<category><![CDATA[software development]]></category>

		<guid isPermaLink="false">http://radar.oreilly.com/?p=56884</guid>
		<description><![CDATA[Creating the conditions for startups to form is now a policy imperative for governments around the world, as Julian Jay Robinson, minister of state in Jamaica&#8217;s Ministry of Science, Technology, Energy and Mining, reminded the attendees at the &#8220;Developing the &#8230; ]]></description>
				<content:encoded><![CDATA[<p><a href="http://radar.oreilly.com/2011/07/tech-startups-idea-conditions.html">Creating the conditions for startups</a> to form is now a policy imperative for governments around the world, as Julian Jay Robinson, minister of state in Jamaica&#8217;s Ministry of Science, Technology, Energy and Mining, reminded the attendees at the &#8220;<a href="http://developingcaribbean.org/">Developing the Caribbean</a>&#8221; conference last week in Kingston, Jamaica.</p>
<p><img src="http://s.radar.oreilly.com/wp-files/2/2013/04/photo-221.jpg" alt="photo-22" width="600" height="450" class="aligncenter size-full wp-image-56886" /></p>
<p>Robinson said Jamaica is working on deploying wireless broadband access, securing networks and stimulating tech entrepreneurship around the island, a set of priorities that would have sounded of the moment in Washington, Paris, Hong Kong or Bangalore. He also described open access and <a href="http://www.guardian.co.uk/global-development-professionals-network/2013/apr/16/open-data-governance-jay-naidoo">open data as fundamental parts of democratic governance</a>, explicitly aligning the release of public data with economic development and anti-corruption efforts. Robinson also pledged to help ensure that Jamaica&#8217;s open data efforts would be successful, offering a key ally within government to members of civil society.</p>
<p>The interest in adding technical ability and capacity around the Caribbean was sparked by other efforts around the world, particularly <a href="http://strata.oreilly.com/2011/07/open-kenya-government-data.html">Kenya&#8217;s open government data efforts</a>. That&#8217;s what led the organizers to invite Paul Kukubo to speak about Kenya&#8217;s experience, which Robinson noted might be more relevant to Jamaica than that of the global north. <span id="more-56884"></span></p>
<p>Kukubo, the head of Kenya&#8217;s Information, Communication and Technology Board, was a key player in getting the country&#8217;s open data initiative off the ground and evangelizing it to developers in Nairobi. At the conference, Kukubo gave Jamaicans two key pieces of advice. First, open data efforts must be aligned with national priorities, from reducing corruption to improving digital services to economic development.</p>
<p>&#8220;You can&#8217;t do your open data initiative outside of what you&#8217;re trying to do for your country,&#8221; said Kukubo.</p>
<p>Second, political leadership is essential to success. In Kenya, the president was personally involved in open data, Kukubo said. Now that a new president has been officially elected, however, there are new questions about what happens next, particularly given that pickup in Kenya&#8217;s development community hasn&#8217;t been as dynamic as officials might have hoped. There&#8217;s also a significant issue on the demand-side of open data, with respect to the absence of a Freedom of Information Law in Kenya.</p>
<p>When I asked Kukubo about these issues, he said he expects a Freedom of Information law will be passed this year in Kenya. He also replied that the momentum on open data wasn&#8217;t just about the supply side.</p>
<p>&#8220;We feel that in the usage side, especially with respect to the developer ecosystem, we haven&#8217;t necessarily gotten as much traction from developers using data and interpreting cleverly as we might have wanted to have,&#8221; he said. &#8220;We&#8217;re putting putting more into that area.&#8221;</p>
<p>With respect to leadership, Kukubo pointed out that newly elected Kenyan President Uhuru Kenyatta drove open data release and policy when he was the minister of finance. Kukubo expects him to be very supportive of open data in office.</p>
<p>The development of open data in Jamaica, by way of contrast, has been driven by academia, said professor Maurice McNaughton, director of the Center of Excellence at the Mona School of Business at the University of the West Indies (UWI). The Caribbean Open Institute, for instance, has been working closely with Jamaica&#8217;s Rural Agriculture Development Authority (RADA). There are high hopes that releases of more data from RADA and other Jamaican institutions will improve Jamaica&#8217;s economy and the effectiveness of its government.</p>
<p>Open data could add $35 million annually to the Jamaican economy, said Damian Cox, director of the Access to Information Unit in the Office of the Prime Minister, citing a United Nations estimate. Cox also explicitly aligned open data with measuring progress toward <a href="http://www.un.org/millenniumgoals/">Millennium Development Goals</a>, positing that increasing the availability of data will enable the civil society, government agencies and the UN to more accurately assess success.</p>
<h2>The development of (open) data-driven journalism</h2>
<p>Developing the Caribbean focused on the demand side of open data as well, particularly the role of intermediaries in collecting, cleaning, fact checking, and presenting data, matched with necessary narrative and context. That kind of work is precisely what data-driven journalism does, which is why it was one of the major themes of the conference. I was invited to give an <a href="http://www.slideshare.net/digiphile/data-journalism-overview">overview of data-driven journalism</a> that connected some trends and highlighted the best work in the field.</p>
<p>I&#8217;ve written quite a bit about <a href="http://radar.oreilly.com/2012/12/six-ways-data-journalism-is-making-sense-of-the-world-around-the-world.html">how data-driven journalism is making sense of the world</a> elsewhere, with a <a href="http://radar.oreilly.com/2012/11/investigating-data-journalism.html">report</a> yet to come. What I found in Jamaica is that media there have long since begun experimenting in the field, from the investigative journalism at <a href="http://panoscaribbean.org/home">Panos Caribbean</a> to the relatively recent launch of <a href="http://digjamaica.com/faq">diGJamaica</a> by the <a href="http://jamaica-gleaner.com/">Gleaner Company</a>.</p>
<p>diGJamaica is modeled upon <a href="http://archive.org/stream/handbookjamaica00cundgoog/handbookjamaica00cundgoog_djvu.txt">the Jamaican Handbook</a> and includes more than a million pages from <em>The Gleaner</em> newspaper, going back to 1834. The site publishes <a href="http://digjamaica.com/directories">directories</a> of public entities and <a href="http://digjamaica.com/data">public data</a>, including visualizations. It charges for access to the archives.</p>
<h2>Legends and legacies</h2>
<div id="attachment_56897" class="wp-caption aligncenter" style="width: 610px"><img src="http://s.radar.oreilly.com/wp-files/2/2013/04/photo-21.jpg" alt="Usain Bolt in Jamaica" width="600" height="450" class="size-full wp-image-56897" /><p class="wp-caption-text">Olympic champion <a href="http://en.wikipedia.org/wiki/Usain_Bolt">Usain Bolt</a>, photographed in his (fast) car at the UWI/Usain Bolt Track in Mona, Jamaica.</p></div>
<p>Normally, meeting the fastest man on earth would be the most memorable part of any trip. The moment that left the deepest impression from my journey to the Caribbean, however, came not from encountering Usain Bolt on a run but from within a seminar room on a university campus.</p>
<p>As a member of a panel of judges, I saw dozens of young people present after working for 30 hours at a hackathon at the University of the West Indies. While even the most mature of the working apps was still a prototype, the best of them were squarely focused on issues that affect real Jamaicans: scoring the risk of farmers that needed banking loans and collecting and sharing data about produce.</p>
<p>The winning team created a working mobile app that would enable government officials to collect data at farms. While none of the apps are likely to be adopted by the agricultural agency in its current form, or show up in the Google Play store this week, the experience the teams gained will help them in the future.</p>
<p>As I left the island, the perspective that I&#8217;d taken away from trips to Brazil, Moldova and Africa last year was further confirmed: technical talent and creativity can be found everywhere in the world, along with considerable passion to apply design thinking, data and mobile technology to improve the societies people live within. This is innovation that matters, not just clones of popular social networking apps &mdash; though the judges saw more than a couple of those ideas flow by as well.</p>
<p>In the years ahead, Jamaican developers will play an important role in media, commerce and government on the island. If attracting young people to engineering and teaching them to code is the long-term legacy of efforts like Developing the Caribbean, it will deserve its own thumbs up from Mr. Bolt. The track to that future looks wide open.</p>
<p><img src="http://s.radar.oreilly.com/wp-files/2/2013/04/photo-23.jpg" alt="photo-23" width="600" height="450" class="aligncenter size-full wp-image-56888" /></p>
<p><em>Disclosure: the cost of my travel to Jamaica was paid for by the organizers of the Developing the Caribbean conference.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2013/04/developing-caribbean-jamaica-open-data.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Predictive analytics and data sharing raise civil liberties concerns</title>
		<link>http://radar.oreilly.com/2013/04/predictive-big-data-analytics-privacy.html</link>
		<comments>http://radar.oreilly.com/2013/04/predictive-big-data-analytics-privacy.html#comments</comments>
		<pubDate>Thu, 11 Apr 2013 15:00:42 +0000</pubDate>
		<dc:creator>Alex Howard</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[@home]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[CISPA]]></category>
		<category><![CDATA[data sharing]]></category>
		<category><![CDATA[privacy]]></category>
		<category><![CDATA[terrorism]]></category>

		<guid isPermaLink="false">http://radar.oreilly.com/?p=56803</guid>
		<description><![CDATA[Last winter, around the same time there was a huge row in Congress over the Cyber Intelligence Sharing and Protection Act (CISPA), U.S. Attorney General Holder quietly signed off on expanded rules on government data sharing. The rules allowed the National &#8230; ]]></description>
				<content:encoded><![CDATA[<p>Last winter, around the same time there was a huge row in Congress over the Cyber Intelligence Sharing and Protection Act (<a href="http://radar.oreilly.com/2012/04/cispa-legislation-cyber-security.html">CISPA</a>), U.S. Attorney General Holder quietly signed off on <a href="http://online.wsj.com/article/SB10001424127887324478304578171623040640006.html">expanded rules on government data sharing</a>. The rules allowed the National Counterterrorism Center (NCTC), housed within the Department of Homeland Security, to analyze the regulatory data collected during the business of government for patterns relevant to domestic terrorist threats.</p>
<p>Julia Angwin, <a href="http://online.wsj.com/article/SB10001424127887324478304578171623040640006.html">who reported the story for the Wall Street Journal</a>, highlighted the key tension: the rules allow the NCTC to &#8220;examine the government files of U.S. citizens for possible criminal behavior, even if there is no reason to suspect them.&#8221; </p>
<p>On the one hand, this is a natural application of big data: search existing government records collected about citizens for suspicious patterns of behavior. The action can be justified for counter-terrorism purposes: there are advanced persistent threats. (When national security is invoked, privacy concerns are often deprecated.) The failure to &quot;connect the dots&quot; using existing data across government on Christmas Day 2009 (remember the so-called &quot;underwear bomber?&quot;) added impetus to getting more data in the NCTC&#8217;s hands. It&#8217;s possible that the rules on data retention were extended five years because the agency didn&#8217;t have the capabilities  it needed. Data mining existing records offers unprecedented opportunities to find and detect terrorism plots before they happen. </p>
<p>On the other hand, the changes at the NCTC that were authorized back in March 2012 represent a massive data grab with far-reaching consequences. The changes received little public discussion prior to the WSJ breaking the story, and they seem to substantially override the purpose of the Federal Privacy Act that Congress passed in 1974. Extension of the rules happened without public debate because of what effectively amounts to a legal loophole. Post proposed changes to the Federal Register, voila. Effectively, this looks like an end run around the Federal Privacy Act. <span id="more-56803"></span></p>
<p>Here&#8217;s the rub: according to Angwin, DoJ Chief Privacy Officer Nancy Libin:</p>
<blockquote><p>&#8220;&#8230; raised concerns about whether the guidelines could unfairly target innocent people, these people said. Some research suggests that, statistically speaking, there are too few terror attacks for predictive patterns to emerge. The risk, then, is that innocent behavior gets misunderstood &mdash; say, a man buying chemicals (for a child&#8217;s science fair) and a timer (for the sprinkler) sets off false alarms. An August government report indicates that, as of last year, NCTC wasn&#8217;t doing predictive pattern-matching.&#8221;</p>
</blockquote>
<p>It&#8217;s hard to say whether predictive data analytics are now in use at NCTC. It would be surprising if there isn&#8217;t pressure to experiment, given the expansion of &#8220;predictive policing&#8221; in cities around the U.S.. There stand to be significant, long-lasting repercussions if the center builds capacity to apply that capability at large scale without great care and informed Congressional oversight. </p>
<p>One outcome is a dystopian scenario straight out of science fiction, from &#8220;thoughtcrime&#8221; to presumptions of guilt.  Alistair Croll highlighted some of the associated issues involved with <a href="http://radar.oreilly.com/2012/08/follow-up-on-big-data-and-civil-rights.html">big data and civil rights</a> last year.</p>
<p>As Angwin pointed out, the likelihood of a terrorist attack in the U.S. remains low as compared to other risks Americans face every day from traffic, bees or lifestyle decisions. After 9/11, however, public officials and Congress have had little risk tolerance. As a result, vast, expensive intelligence and surveillance infrastructure in the U.S. has been massively expanded, with limited oversight and very little accountability, as documented in &#8220;<a href="http://projects.washingtonpost.com/top-secret-america/">Top Secret America</a>.&#8221;</p>
<p>When intelligence officials have gone public to the press as whistle-blowers regarding overspending, they have been prosecuted. Former National Security Agency staffer <a href="http://www.newyorker.com/reporting/2011/05/23/110523fa_fact_mayer?currentPage=all">Thomas Drake</a> spoke at the 2011 Web 2.0 Summit about his experience. We talked about it in a subsequent interview, below:</p>
<p><iframe width="620" height="349" src="http://www.youtube.com/embed/AgtPE_X_GhQ?feature=oembed" frameborder="0" allowfullscreen></iframe></p>
<p>The new rules have been in place now for months, with little public comment upon the changes. (Even after it was relaunched, the nation doesn&#8217;t seem to be reading the Federal Register. These days, I&#8217;m not sure how many members of the DC media do, either.) I&#8217;m unsure whether it&#8217;s fair to blame the press, though I do wonder how media resources were allocated during the &#8220;horse race&#8221; of the presidential campaigns last year. Now, the public is left to hope that the government oversees itself effectively behind closed doors. </p>
<p>I would find a recent &#8220;<a href="http://www.dhs.gov/blog/2013/04/02/securing-cyberspace-while-protecting-privacy-and-civil-liberties">commitment to privacy and civil liberties</a>&#8221; by the Department of Homeland Security more convincing if the agency wasn&#8217;t <a href="http://www.aclu.org/blog/technology-and-liberty-immigrants-rights-national-security/aclu-files-foia-request-unreleased">confiscating and searching electronic devices at the border</a> without a warrant. </p>
<p>Does anyone think that the privacy officers whose objections were overruled in the internal debates will provide the effective counterweight <a href="http://radar.oreilly.com/2012/07/digital-bill-of-rights-internet-freedom.html">protecting the Bill of Rights</a> will require in the years to come?</p>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2013/04/predictive-big-data-analytics-privacy.html/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Sensoring the news</title>
		<link>http://radar.oreilly.com/2013/03/sensor-journalism-data-journalism.html</link>
		<comments>http://radar.oreilly.com/2013/03/sensor-journalism-data-journalism.html#comments</comments>
		<pubDate>Fri, 22 Mar 2013 13:00:54 +0000</pubDate>
		<dc:creator>Alex Howard</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[@home]]></category>
		<category><![CDATA[civic media]]></category>
		<category><![CDATA[data journalism]]></category>
		<category><![CDATA[journalism]]></category>
		<category><![CDATA[open data]]></category>
		<category><![CDATA[sensor journalism]]></category>
		<category><![CDATA[sxswi]]></category>

		<guid isPermaLink="false">http://radar.oreilly.com/?p=56375</guid>
		<description><![CDATA[When I went to the 2013 SXSW Interactive Festival to host a conversation with NPR&#8217;s Javaun Moradi about sensors, society and the media, I thought we would be talking about the future of data journalism. By the time I left &#8230; ]]></description>
				<content:encoded><![CDATA[<p>When I went to the 2013 SXSW Interactive Festival to host a conversation with NPR&#8217;s <a href="http://twitter.com/javaun">Javaun Moradi</a> about sensors, society and the media, I thought we would be talking about the future of data journalism. By the time I left the event, I&#8217;d learned that <a href="http://tumblr.thefjp.org/post/34704405146/hurricane-sandy-and-the-possibilities-of-sensor">sensor journalism</a> had long since arrived and been applied. Today, inexpensive, easy-to-use <a href="http://www.niemanlab.org/2013/03/the-cicadas-are-coming-wnycs-tracker-is-the-latest-sign-of-the-rise-of-sensor-news-networks/">open source hardware is making it easier for media outlets to create data</a> themselves.</p>
<p>&#8220;Interest in sensor data has grown dramatically over the last year,&#8221; said Moradi. &#8220;Groups are experimenting in the areas of environmental monitoring, journalism, human rights activism, and civic accountability.&#8221; His post on what <a href="http://javaunmoradi.com/blog/2011/12/16/what-do-open-sensor-networks-mean-for-journalism/">sensor networks mean for journalism</a> sparked our collaboration after we connected in December 2011 about how data was being used in the media. </p>
<div id="attachment_56376" class="wp-caption aligncenter" style="width: 510px"><a href="http://hosted.ap.org/specials/interactives/_international/oly_fea_pollution/index.html"><img src="http://s.radar.oreilly.com/wp-files/2/2013/03/beijing-air-quality.jpg" alt="AP Beijing Air Quality graphic" width="500" height="373" class="size-full wp-image-56376" /></a><p class="wp-caption-text">Associated Press visualization of Beijing air quality. <a href="http://hosted.ap.org/specials/interactives/_international/oly_fea_pollution/index.html">See related feature</a>.</p></div>
<p>At a SXSW panel on &#8220;<a href="http://schedule.sxsw.com/2013/events/event_IAP4679">sensoring the news</a>,&#8221; Sarah Williams, an assistant professor at MIT, described how the Spatial Information Design Lab At Columbia University* had partnered with the Associated Press to independently measure air quality in Beijing.</p>
<p>Prior to the 2008 Olympics, the coaches of the Olympic teams had expressed serious concern about the impact of air pollution on the athletes. That, in turn, put pressure on the Chinese government to take substantive steps to improve those conditions. While the Chinese government released an index of air quality, explained Williams, they didn&#8217;t explain what went into it, nor did they provide the raw data.</p>
<p>The <a href="http://www.civicdatadesignproject.org/BEIJING-AIR-TRACKS">Beijing Air Tracks project</a> arose from the need to determine what the conditions on the ground really were. AP reporters carried sensors connected to their cellphones to detect particulate and carbon monoxide levels, enabling them to report air quality conditions back in real-time as they moved around the Olympic venues and city. <span id="more-56375"></span></p>
<p>The sensor data helped the AP measure the effect of policy decisions that the Chinese government made, said Williams, from closing down factories to widespread shutdowns of different kinds of industries. The results from the sensor journalism project, which showed a decrease in particulates but conditions 12 to 25 times worse than New York City on certain days, were published as an <a href="http://hosted.ap.org/specials/interactives/_international/oly_fea_pollution/index.html">interactive data visualization</a>.</p>
<div id="attachment_56380" class="wp-caption aligncenter" style="width: 610px"><img src="http://s.radar.oreilly.com/wp-files/2/2013/03/beijing-air-quality-days-compared.jpg" alt="AP Beijing mash-up of particulate levels and photography in Beijing." width="600" height="307" class="size-full wp-image-56380" /><p class="wp-caption-text">Associated Press mashup of particulate levels and photography at the Olympic stadium in Beijing over time.</p></div>
<p>This AP project is a prime example of how sensors, data journalism, and old-fashioned, on-the-ground reporting can be combined to shine a new level of accountability on official reports. It won&#8217;t be the last time this happens, either. Around the world, from the Amazon to Los Angeles to Japan, sensor data is now being put to use by civic media and journalists.</p>
<h2>Sensing civic media</h2>
<p>There are an increasing number of sensors in our lives, said John Keefe, a <a href="http://strata.oreilly.com/2012/05/profile-of-the-data-journalist-10.html">data news editor</a> for WNYC, speaking at his SXSW panel in Austin. From the physical sensors in smartphones to new possibilities built with Arduino or Raspberry Pi hardware, Keefe highlighted how journalists could seize hold of new possibilities.</p>
<p>&#8220;Google takes data from maps and Android phones and creates traffic data,&#8221; Keefe said. &#8220;In a sense, that&#8217;s sensor data being used live in a public service. What are we doing in journalism like that? What could we do?&#8221;</p>
<p>The evolution of <a href="http://safecast.org/">Safecast</a> offers a glimpse of networked accountability, collecting and publishing radiation data through sensors, citizen science and the Internet. The project, which won last year&#8217;s <a href="http://strata.oreilly.com/2012/09/knight-news-challenge-data-winners.html">Knight News Challenge on data</a>, is now building the infrastructure to enable people to help monitor air quality in Los Angeles.</p>
<p>Sensor journalism is also being applied to <a href="http://radar.oreilly.com/2012/12/six-ways-data-journalism-is-making-sense-of-the-world-around-the-world.html">make sense of the world</a> in using remote sensing data and satellite imagery. The director of that project, Gustavo Faleiros, recently described how <a href="https://ijnet.org/stories/journalists-should-crowdsource-environmental-coverage">environmental reporting</a> can be combined with civic media to collect data, with relevant projects in Asia, Africa and the Americas. For instance, Faleiros cited an <a href="http://www.globalproblems-globalsolutions-files.org/unf_website/PDF/vodafone/tech_social_change/Environmental_Conservation_case3.pdf">environmental monitoring project</a> led by Eric Paulos of the University of California at Berkeley&#8217;s Center for New Media, where sensors on taxis were used to gather data in Accra, Ghana.</p>
<p>Another direction that sensor data could be applied lies in social justice and education. At SXSW, Sarah Williams described [<a href="https://dl.dropbox.com/u/466610/SXSWbeijingextra.pdf">slides</a>] how the <a href="http://airqualityegg.com/">Air Quality Egg</a>, an <a href="http://javaunmoradi.com/blog/2012/09/10/lessons-from-the-chicago-airqualityegg-hackathon/">open source hardware device</a>, is being used to make an argument for public improvements. At the <a href="http://www.civicdatadesignproject.org/CYPRESS-HILLS-AIR-QUALITY-CHAQ">Cypress Hills Community School</a>, kids are bringing the eggs home, measuring air quality and putting data online, said Williams.</p>
<div id="attachment_56379" class="wp-caption aligncenter" style="width: 610px"><a href="http://www.civicdatadesignproject.org/CYPRESS-HILLS-AIR-QUALITY-CHAQ"><img src="http://s.radar.oreilly.com/wp-files/2/2013/03/air-quality-eggs.jpg" alt="Air Quality Eggs at Cypress Hill Community School" width="600" height="416" class="size-full wp-image-56379" /></a><p class="wp-caption-text">Air Quality Eggs at <a href="http://www.civicdatadesignproject.org/CYPRESS-HILLS-AIR-QUALITY-CHAQ">Cypress Hill Community School</a>.</p></div>
<p>&#8220;Health sensors are useful when they can compare personal real-time data against population-wide data,&#8221; said Nadav Aharony, who also spoke on our panel in Austin.</p>
<p>Aharony talked about how <a href="http://www.behav.io/">Behavio</a>, a startup based upon his <a href="http://radar.oreilly.com/2012/07/social-mri-smartphone-data.html">research on smartphones and data</a> at MIT, has created <a href="http://funf.org/">funf</a>, an open source sensing toolkit for Android devices. Aharony&#8217;s team has now deployed an <a href="http://funf.org/inabox">integration with Dropbox</a> that requires no coding ability to use.</p>
<p>According to Aharony, the One Laptop Per Child project is using funf in tablets deployed in Africa, in areas where there are no schools. Researchers will use funf as a behavioral tool to sense how children are interacting with the devices, including whether tablets are next to one another.</p>
<blockquote class="twitter-tweet"><p>Just set up @<a href="https://twitter.com/openfunf">openfunf</a>&#8216;s Funf in a Box on a phone to send motion+geo data to my @<a href="https://twitter.com/dropbox">dropbox</a> Couldn&#8217;t have been easier. <a href="http://t.co/JhY83yWoTi" title="http://funf.org/">funf.org</a></p>
<p>&mdash; John Keefe (@jkeefe) <a href="https://twitter.com/jkeefe/status/314955049778429952">March 22, 2013</a></p></blockquote>
<h2>Sensing citizen science</h2>
<p>While challenges lie ahead, it&#8217;s clear that sensors will be used to create data where there was none before. At SXSW, Williams described a project in Nairobi, Kenya, where cellphones are being used to map informal bus systems. </p>
<p>The <a href="http://www.civicdatadesignproject.org/Digital-Matatus-1">Digital Matatus project</a> is publishing the data into the General Transit Feed Standard, one of the most promising emerging <a href="http://www.xconomy.com/san-francisco/2012/02/21/google-transit-a-search-giant-remaps-public-transportation/?single_page=true">global standards</a> for transit data. &#8220;Hopefully, a year from now [we] will have all the bus routes from Nairobi,&#8221; Williams said.</p>
<div id="attachment_56377" class="wp-caption aligncenter" style="width: 610px"><img src="http://s.radar.oreilly.com/wp-files/2/2013/03/ghana-bus-data-map.jpg" alt="Map of Matatus stops in Nairobi, Kenya" width="600" height="340" class="size-full wp-image-56377" /><p class="wp-caption-text">Map of Matatus stops in Nairobi, Kenya</p></div>
<p>Data journalism has long depended upon official data released by agencies. In recent years, data journalists have begun scraping data. Sensors allow another step in that evolution to take place, where civic media can create data to inform the public interest.</p>
<p>Matt Waite, a <a href="http://journalism.unl.edu/cojmc/about/bios/waite.shtml">professor of practice</a> and head of the Drone Journalism Lab at the University of Nebraska-Lincoln, joined the panel in Austin using a Google Hangout and shared how he and his students are experimenting with sensors to gather data for projects.</p>
<p>Journalists are going to run up against stories where no one has data, he said. &#8220;The old way was to give up,&#8221; said Waite. &#8220;I don&#8217;t think that&#8217;s the way to do it.&#8221;</p>
<p>Sensors give journalists a new, interesting way to enlist a distributed audience in gathering needed data, he explained. &#8220;Is it &#8216;capital N&#8217; news? Probably not,&#8221; said Waite. &#8220;But it&#8217;s something people are really interested in. The easy part is getting a parts list together and writing software. The hard part is the creative process it takes to figure out what we are going to measure and what it means.&#8221;</p>
<p>In an interview with the Nieman Journalism Lab on <a href="http://www.niemanlab.org/2013/03/the-cicadas-are-coming-wnycs-tracker-is-the-latest-sign-of-the-rise-of-sensor-news-networks">sensor journalism</a>, Waite also raised practical concerns with the quality of data collection that can be gathered with inexpensive hardware. &#8220;One legitimate concern about doing this is, you&#8217;re talking about doing it with the cheapest software you can find,&#8221; Waite told the Nieman Lab&#8217;s Caroline O&#8217;Donovan. &#8220;It&#8217;s not expertly calibrated. It&#8217;s not as sensitive as it possibly could be.&#8221;</p>
<p>Those are questions that will be explored practically in New York in the months ahead, when New York City&#8217;s public radio station will be collaborating with the Columbia School of Public Health to collect data about New York&#8217;s environmental conditions. They&#8217;ll put particulate detectors, carbon dioxide monitors, leg motion sensors, audio monitors, cameras and GPS trackers on bicycles and ride around the city collecting pollution data.</p>
<p>&#8220;At WNYC, we already do crowdsourcing, where we ask our audience to do something,&#8221; said Keefe. &#8220;What if we could get our audience to do something with this? What if you could get an audience to work with you to solve a problem?&#8221;</p>
<p>Keefe also announced the <a href="http://project.wnyc.org/cicadas/">Cicada Project</a>, where WNYC is inviting its listeners to build homemade sensors and track the emergence of cicadas this spring across New Jersey, New York and the Northeast region.</p>
<div id="attachment_56378" class="wp-caption aligncenter" style="width: 610px"><a href="http://project.wnyc.org/cicadas/"><img src="http://s.radar.oreilly.com/wp-files/2/2013/03/wnyc-bugs.jpg" alt="WNYC Cicada Project" width="600" height="544" class="size-full wp-image-56378" /></a><p class="wp-caption-text"><a href="http://project.wnyc.org/cicadas/">WNYC Cicada Project.</a></p></div>
<p>This <a href="http://www.scientificamerican.com/citizen-science/project.cfm?id=cicada-tracker-wnyc-magicicada">cicada tracker project</a> is a 21st century parallel to the role that birders have played for decades in the annual <a href="http://birds.audubon.org/christmas-bird-count">Christmas Bird Count</a>, creating new horizons for citizen science and public media.</p>
<p><em>Update</em>: WNYC&#8217;s public is responding in interesting ways that go beyond donations. On Twitter, <a href="https://twitter.com/jkeefe/status/314924670329712642">Keefe highlighted</a> the work of a NYC-based hacker, Guan, who was able to make a cicada tracker for $20, 1/4 the cost of WNYC&#8217;s kit. </p>
<blockquote class="twitter-tweet"><p>@<a href="https://twitter.com/jkeefe">jkeefe</a> &lt;$20 cicada tracker (using crappy uncharacterized thermistor I had lying around). @<a href="https://twitter.com/johnmyleswhite">johnmyleswhite</a> <a href="http://t.co/E6xQ6EgIIR" title="http://twitter.com/guan/status/314920424322498561/photo/1">twitter.com/guan/status/31…</a></p>
<p>&mdash; guan (@guan) <a href="https://twitter.com/guan/status/314920424322498561">March 22, 2013</a></p></blockquote>
<h2>Sensing challenges ahead</h2>
<p>Just as civic technologists need to be mindful of &#8220;<a href="http://davidsasaki.name/2012/12/on-hackathons-and-solutionism/">solutionism</a>,&#8221; so too will data journalists need to be aware of the &#8220;<a href="http://del-fi.org/post/44793351571/solutionism-and-sensorism">sensorism</a>&#8221; that exists in the health care world, <a href="http://del-fi.org/post/44793351571/solutionism-and-sensorism">as John Wilbanks pointed out this winter</a>.</p>
<p>&#8220;Sensorism is rife in the sciences,&#8221; Wilbanks wrote. &#8220;Pick a data generation task that used to be human centric and odds are someone is trying to automate and parallelize it (often via solutionism, oddly &mdash; there&#8217;s an app to generate that data). What&#8217;s missing is the epistemic transformation that makes the data emerging from sensors actually useful to make a scientific conclusion &mdash; or a policy decision supposedly based on a scientific consensus.&#8221;</p>
<p>Anyone looking to practice sensor journalism will face interesting challenges, from incorrect conclusions based upon faulty data to increased risks to journalists carrying the sensors, to gaming or misreporting.</p>
<p>&#8220;Data accuracy is both a real and a perceived problem,&#8221; said Moradi at SXSW. &#8220;Third-party verification by journalists or other non-aligned groups may be needed.&#8221;</p>
<p>Much as in the cases of &#8220;drone journalism&#8221; and data journalism, context, usage and ethics have to be considered before you launch a quadcopter, fire up a scraper or embed sensors around your city. The question you come back to is whether you&#8217;re facing a new ethical problem or an old ethical problem with new technology, suggested Waite at SXSW.  &#8220;The truth is, most ethical issues you can find with a new analogue.&#8221;</p>
<p>It may be, however, that sensor data, applied to taking a &#8220;<a href="http://radar.oreilly.com/2012/07/social-mri-smartphone-data.html">social MRI</a>&#8221; or other uses, may present us with novel challenges. For instance, who owns the data? Who can access or use it? Under what conditions?</p>
<p>A GPS device is a form of sensor, after all, and one that&#8217;s quite useful to law enforcement. While the <a href="http://www.nytimes.com/2012/01/24/us/police-use-of-gps-is-ruled-unconstitutional.html?pagewanted=all&amp;_r=0">Supreme Court ruled</a> that the use of a GPS device for tracking a person without a warrant was unconstitutional, sensor data from cellphones may provide law enforcement with equal or greater insight into a target&#8217;s movements. Journalists may well face unexpected questions about protecting sources if their sensor data captures the movements or actions of a person of interest.</p>
<p>&#8220;There&#8217;s a lot of concern around privacy,&#8221; said Moradi. &#8220;What data can the government request? Will private companies abuse personal data for marketing or sales? Do citizens have the right to personal data held by companies and government?&#8221;</p>
<p>Aharony outlined many of the issues in a 2011 paper on <a href="http://web.media.mit.edu/~cebrian/IS-26-06-Altsh-5.pdf">stealing reality</a>, exploring what happens when criminals become data scientists.</p>
<p>&#8220;It&#8217;s like a slow-moving attack if you attach yourself to someone&#8217;s communication,&#8221; said Aharony, in a follow-up interview in Austin. &#8220;&#8216;<a href="http://radar.oreilly.com/2011/04/iphone-tracking-followup.html">iPhonegate</a>&#8216; didn&#8217;t surprise people who know about mobile app data or how the cellular network is architected. Look at <a href="http://bits.blogs.nytimes.com/2012/02/12/disruptions-so-many-apologies-so-much-data-mining/">what happened to Path</a>. You can make mistakes without meaning to. You have to think about this and encrypt the data.&#8221;</p>
<p><em>This post is part of our series <a href="http://radar.oreilly.com/2012/11/investigating-data-journalism.html">investigating data journalism</a>.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2013/03/sensor-journalism-data-journalism.html/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>The City of Chicago wants you to fork its data on GitHub</title>
		<link>http://radar.oreilly.com/2013/03/the-city-of-chicago-wants-you-to-fork-its-data-on-github.html</link>
		<comments>http://radar.oreilly.com/2013/03/the-city-of-chicago-wants-you-to-fork-its-data-on-github.html#comments</comments>
		<pubDate>Tue, 19 Mar 2013 13:00:35 +0000</pubDate>
		<dc:creator>Alex Howard</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[@home]]></category>
		<category><![CDATA[chicago]]></category>
		<category><![CDATA[github]]></category>
		<category><![CDATA[goss]]></category>
		<category><![CDATA[government as a platform]]></category>
		<category><![CDATA[open data]]></category>
		<category><![CDATA[open government]]></category>
		<category><![CDATA[open source]]></category>

		<guid isPermaLink="false">http://radar.oreilly.com/?p=56255</guid>
		<description><![CDATA[GitHub has been gaining new prominence as the use of open source software in government grows. Earlier this month, I included a few thoughts from Chicago&#8217;s chief information officer, Brett Goldstein, about the city&#8217;s use of GitHub, in a piece &#8230; ]]></description>
				<content:encoded><![CDATA[<p>GitHub has been gaining <a href="http://radar.oreilly.com/2013/03/github-government-bureaucat-open-source.html">new prominence</a> as the use of open source software in government grows. </p>
<p>Earlier this month, I included a few thoughts from Chicago&#8217;s chief information officer, Brett Goldstein, about the city&#8217;s use of GitHub, in a piece <a href="http://radar.oreilly.com/2013/03/github-government-bureaucat-open-source.html">exploring GitHub&#8217;s role in government</a>.</p>
<p>While Goldstein says that Chicago&#8217;s open data portal will remain the primary means through which Chicago releases public sector data, publishing open data on GitHub is an experiment that will be interesting to watch, in terms of whether it affects reuse or collaboration around it.</p>
<p>In a followup email, Goldstein, who also serves as Chicago&#8217;s chief data officer, shared more about why the city is on GitHub and what they&#8217;re learning. Our discussion follows.</p>
<div id="attachment_56345" class="wp-caption aligncenter" style="width: 610px"><a href="https://github.com/Chicago"><img src="http://s.radar.oreilly.com/wp-files/2/2013/03/0313-github-chicago.png" alt="Chicago&#039;s presence on GitHub" width="600" height="330" class="size-full wp-image-56345" /></a><p class="wp-caption-text">The City of Chicago is on GitHub.</p></div>
<p><span id="more-56255"></span></p>
<h2>What has your experience on GitHub been like to date?</h2>
<p><strong>Brett Goldstein:</strong> It has been a positive experience so far. Our local developer community is very excited by the <a href="http://opensource.org/licenses/MIT">MIT License</a> on <a href="https://github.com/Chicago">these datasets</a>, and we have received positive reactions from outside of Chicago as well.</p>
<p>This is a new experiment for us, so we are learning along with the community. For instance, GitHub was not built to be a data portal, so it was difficult to upload our <a href="https://github.com/Chicago/osd-building-footprints">buildings dataset</a>, which was over 2GB. We are rethinking how to deploy that data more efficiently.</p>
<h2>Why use GitHub, as opposed to some other data repository?</h2>
<p><strong>Brett Goldstein:</strong> GitHub provides the ability to download, fork, make pull requests, and merge changes back to the original data. This is a new experiment, where we can see if it&#8217;s possible to crowdsource better data. GitHub provides the necessary functionality. We already had a presence on GitHub, so it was a natural extension to that as a complement to our <a href="http://data.cityofchicago.org">existing data portal</a>.</p>
<h2>Why does it make sense for the city to use or publish open source code?</h2>
<p><strong>Brett Goldstein:</strong> Three reasons. First, it solves issues with incorporating data in open source and proprietary projects. The city&#8217;s data is available to be used publicly, and this step removes any remaining licensing barriers. These datasets were targeted because they are incredibly useful in the daily life of residents and visitors to Chicago. They are the most likely to be used in outside projects. We hope this data can be incorporated into existing projects. We also hope that developers will feel more comfortable developing applications or services based on an open source license.</p>
<p>Second, it fits within the city&#8217;s ethos and vision for data. These datasets are items that are visible in daily life &mdash; transportation and buildings. It is not proprietary data and should be open, editable, and usable by the public.</p>
<p>Third, we engage in projects like this because they ultimately benefit the people of Chicago. Not only do our residents get better apps when we do what we can to support a more creative and vibrant developer community, they also will get a smarter and more nimble government using tools that are created by sharing data. </p>
<p>We open source many of our projects because we feel the methodology and data will benefit other municipalities.</p>
<h2>Is anyone pulling it or collaborating with you? Have you used that code? Would you, if it happened?</h2>
<p><strong>Brett Goldstein:</strong> We collaborated with <a href="https://github.com/iandees">Ian Dees</a>, who is a significant contributor to OpenStreetMaps, to launch this idea. We anticipate that buildings data will be integrated in OpenStreetMaps now that it&#8217;s available with a compatible license.</p>
<p>We have had 21 forks and a handful of pull requests fixing some issues in our README. We have not had a pull request fixing the actual data.</p>
<p>We do intend to merge requests to fix the data and are working on our internal process to review, reject, and merge requests. This is an exciting experiment for us, really at the forefront of what governments are doing, and we are learning along with the community as well.</p>
<h2>Is anyone using the open data that wasn&#8217;t before, now that it&#8217;s JSON?</h2>
<p><strong>Brett Goldstein:</strong> We seem to be reaching a new audience with posting data on GitHub, working in tandem with our heavily trafficked data portal. A core goal of this administration is to make data open and available. We have one of the most ambitious open data programs in the country. Our portal has over 400 datasets that are machine readable, downloadable and searchable. Since it&#8217;s <a href="http://data.cityofchicago.org">hosted on Socrata</a>, basic analysis of the data is possible as well. </p>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2013/03/the-city-of-chicago-wants-you-to-fork-its-data-on-github.html/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>GitHub gains new prominence as the use of open source within governments grows</title>
		<link>http://radar.oreilly.com/2013/03/github-government-bureaucat-open-source.html</link>
		<comments>http://radar.oreilly.com/2013/03/github-government-bureaucat-open-source.html#comments</comments>
		<pubDate>Fri, 08 Mar 2013 21:12:53 +0000</pubDate>
		<dc:creator>Alex Howard</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[@home]]></category>
		<category><![CDATA[github]]></category>
		<category><![CDATA[open data]]></category>
		<category><![CDATA[open government]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[social coding]]></category>

		<guid isPermaLink="false">http://radar.oreilly.com/?p=56158</guid>
		<description><![CDATA[When it comes to government IT in 2013, GitHub may have surpassed Twitter and Facebook as the most interesting social network.  GitHub&#8217;s profile has been rising recently, from a Wired article about open source in government, to its high profile &#8230; ]]></description>
				<content:encoded><![CDATA[<p><a href="http://radar.oreilly.com/2013/03/github-government-bureaucat-open-source.html/github-social-coding" rel="attachment wp-att-56161"><img class="alignright size-full wp-image-56161" alt="github-social-coding" src="http://s.radar.oreilly.com/wp-files/2/2013/03/github-social-coding.jpg" width="160" height="240" /></a>When it comes to government IT in 2013, GitHub may have surpassed Twitter and Facebook as the most interesting social network. </p>
<p>GitHub&#8217;s profile has been rising recently, from a <a href="http://www.wired.com/wiredenterprise/2013/01/hack-the-government/">Wired article</a> about open source in government, to its high profile use by the <a href="http://oreillyradar.tumblr.com/post/30073717898/we-the-coders-white-house-commits-open-source-code">White House</a> and within the <a href="http://radar.oreilly.com/2012/04/open-source-government-cfpb.html">Consumer Financial Protection Bureau</a>. This March, after the <a href="http://civic.mit.edu/blog/kanarinka/reflections-on-the-first-hackathon-at-the-white-house">first White House hackathon</a> in February, the administration&#8217;s digital team posted its new <a href="https://github.com/WhiteHouse/api-standards">API standards</a> on GitHub. In addition to the U.S., code from the <a href="https://github.com/alphagov">United Kingdom</a>, <a href="https://github.com/pwgsc">Canada</a>, <a href="https://github.com/gcba">Argentina</a> and <a href="https://github.com/avoinministerio">Finland</a> is also on the platform.</p>
<p>&#8220;We&#8217;re reaching a tipping point where we&#8217;re seeing more collaboration not only within government agencies, but also between different agencies, and between the government and the public,&#8221; said GitHub head of communications Liz Clinkenbeard, when I asked her for comment.<span id="more-56158"></span></p>
<p>Overall, <a href="http://fedscoop.com/is-github-governments-next-big-thing/">2012</a> was a breakout year for the use of GitHub by government, with more than 350 government code repositories by year&#8217;s end. </p>
<div id="attachment_56211" class="wp-caption aligncenter" style="width: 460px"><img src="http://s.radar.oreilly.com/wp-files/2/2013/03/total-gov-github-repositories.jpg" alt="Total government GitHub repositories" width="450" height="320" class="size-full wp-image-56211" /><p class="wp-caption-text">Total number of government  repositories on GitHub.</p></div>
<p>In January 2012, the British government committed the <a href="http://radar.oreilly.com/2012/01/with-govuk-british-government.html">code for GOV.UK to GitHub</a>.</p>
<p><a href="http://radar.oreilly.com/2011/04/nasa-open-source.html">NASA</a>, after its <a href="http://fedscoop.com/nasa-contributes-first-open-source-project-to-github/">first commit</a>, added <a href="http://open.nasa.gov/blog/2012/08/24/welcoming-the-white-house-to-github/">11 more code repositories</a> over the course of the year.</p>
<p>In September, the new Open Gov Foundation published the <a href="http://oreillyradar.tumblr.com/post/31407101909/opengov-foundation-open-sources-house-oversights">code for the MADISON legislative platform</a>. In December, the <a href="http://radar.oreilly.com/2012/12/the-united-states-code-is-on-github.html">U.S. Code went on GitHub</a>.</p>
<p>GitHub&#8217;s profile was raised further in Washington this week when <a href="http://twitter.com/benbalter">Ben Balter</a> was <a href="https://github.com/blog/1432-ben-balter-is-a-githubber">announced</a> as the company&#8217;s <a href="http://www.nextgov.com/emerging-tech/2013/03/open-source-government-code-sharing-site-hires-federal-liaison/61718/">federal liaison</a>. Balter made some open source history last year, when he was part of the <a href="http://fedscoop.com/is-this-the-federal-governments-first-agency-to-agency-accepted-pull-request/">federal government&#8217;s first agency-to-agency pull request</a>. He also was a big part of giving the White House some much-needed geek cred when he coded the administration&#8217;s <a href="http://radar.oreilly.com/2012/05/white-house-launches-new-digit.html">digital government strategy</a> in <a href="http://www.whitehouse.gov/sites/default/files/omb/egov/digital-government/digital-government.html">HTML5</a>.</p>
<p>Balter will be GitHub&#8217;s <a href="http://fedscoop.com/github-hires-first-government-focused-employee/">first government-focused employee</a>. He won&#8217;t, however, be saddled with an undecipherable title. In a sly dig at the slow-moving institutions of government, and in keeping with GitHub&#8217;s love for <a href="http://octodex.github.com/">octocats</a>, Balter will be the first &#8220;Government Bureaucat,&#8221; focused on &#8220;helping government to do all sorts of governmenty things, well, more awesomely,&#8221; <a href="https://github.com/blog/1432-ben-balter-is-a-githubber">wrote</a> GitHub CIO Scott Chacon. </p>
<p>Part of Balter&#8217;s job will be to evangelize the use of GitHub&#8217;s platform as well as open source in government, in general. The latter will come naturally to him, given how he and the other Presidential Innovation Fellows approached their work. </p>
<p>&#8220;Virtually everything the Presidential Innovation Fellows touched was open sourced,&#8221; said Balter when I interviewed him earlier this week. &#8220;That&#8217;s everything from better IT procurement software to internal tools that we used to streamline paperwork. Even more important, much of that development (particularly <a href="http://radar.oreilly.com/2013/01/rfp-ez-making-it-easier-for-small-companies-to-bid-on-government-contracts.html">RFPEZ</a>) happened entirely in the <a href="https://github.com/presidential-innovation-fellows/rfpez">open</a>. We were taking the open source ethos and applying it to how government solutions were developed, regardless whether or not the code was eventually public. That&#8217;s a big shift.&#8221;</p>
<p>Balter is a proponent of social coding in the open as a means of providing some transparency to interested citizens. &#8220;You can go back and see why an agency made a certain decision, especially when tools like these are used to aid formal decision making,&#8221; he said. &#8220;That can have an empowering effect on the public.&#8221;</p>
<h2>Forking code in city hall and beyond</h2>
<p>There&#8217;s notable government activity beyond the Beltway as well.</p>
<p>The <a href="http://thechangelog.com/the-city-of-chicago-is-on-github/">City of Chicago is now on GitHub</a>, where chief data officer and city CIO Brett Goldstein is <a href="https://github.com/chicago">releasing open data</a> as JSON files, along with open source code.</p>
<p>Both Goldstein and Philadelphia chief data officer Mark Headd are also laudably participating in conversations about code and data on <a href="http://news.ycombinator.com/item?id=5315889">Hacker News threads</a>.</p>
<p>&#8220;Chicago has released over 400 datasets using our data portal, which is located at <a href="http://data.cityofchicago.org">data.cityofchicago.org</a>,&#8221; Goldstein wrote on HackerNews. </p>
<p>While Goldstein says that the city&#8217;s portal will remain the primary way they release public sector data, publishing data on GitHub is an experiment that will be interesting to watch, in terms of whether it affects reuse.</p>
<p>&#8220;We hope [the datasets on GitHub] will be widely used by open source projects, businesses, or non-profits,&#8221; wrote Goldstein. &#8220;GitHub also allows an on-going collaboration with editing and improving data, unlike the typical portal technology. Because it&#8217;s an open source license, data can be hosted on other services, and we&#8217;d also like to see applications that could facilitate easier editing of geographic data by non-technical users.&#8221;</p>
<p>Headd is also <a href="https://github.com/PhillyCDO">on GitHub</a> in a professional capacity, where he and his colleagues have been publishing code to a <a href="https://github.com/cityofphiladelphia">City of Philadelphia repository</a>.</p>
<p>&#8220;We use [GitHub] to share some of our official city apps,&#8221; commented Headd on the same <a href="http://news.ycombinator.com/item?id=5315889">Hacker News thread</a>. &#8220;These are usually simple web apps built with tools like Bootstrap and jQuery. We&#8217;ll be open sourcing more of these going forward. Not only are we interested in sharing the code for these apps, we&#8217;re actively encouraging people to fork, improve and send pull requests.&#8221;</p>
<p>While there&#8217;s still a long road ahead for widespread code sharing between the public and government, the economic circumstances of cities and agencies could create the conditions for more code sharing <em>inside</em> government. In a TED Talk last year, Clay Shirky suggested that adopting open source methods for collaboration could even <a href="http://www.ted.com/talks/clay_shirky_how_the_internet_will_one_day_transform_government.html">transform government</a>.</p>
<p>A more modest (although still audacious) goal would be to simply change how government IT is done. </p>
<p>&#8220;I&#8217;ve often said, the hardest part of being a software developer is training yourself to Google the problem first and see if someone else has already solved it,&#8221; said Balter during our interview. &#8220;I think we&#8217;re going to see government begin to learn that lesson, especially as budgets begin to tighten. It&#8217;s a relative &#8216;app store&#8217; of technology solutions just waiting to be used or improved upon. That&#8217;s the first step: rather than going out to a contractor and reinventing the wheel each time, it&#8217;s training ourselves that we&#8217;re part of a larger ecosystem and to look for prior art. On the flip side, it&#8217;s about contributing back to that commons once the problem has been solved. It&#8217;s about realizing you&#8217;re part of a community. We&#8217;re quickly approaching a tipping point where it&#8217;s going to be easier for government to work together than alone. All this means that a taxpayer&#8217;s dollar can go further, do more with less, and ultimately deliver better citizen services.&#8221;</p>
<p>Some people may understandably bridle at including open source code and open data under the broader umbrella of &#8220;open government,&#8221; particularly if such efforts are not balanced by adherence to good government principles around transparency and accountability.</p>
<p>That said, there&#8217;s reason to hail collaboration around software and data as bonafide examples of 21st century civic participation, where better platforms for social coding enable improved outcomes. The commits and pulls of staff and residents on GitHub may feel like small steps, but they represent measurable progress toward more government not just of the people, but with the people.</p>
<p>&#8220;Open source in government is nothing new,&#8221; said Balter. &#8220;What&#8217;s new is that we&#8217;re finally approaching a tipping point at which, for federal employees, it&#8217;s going to be easier to work together, than work apart. Whereas before, &#8216;open source&#8217; often meant compiling, zipping, and uploading, when you fuse the internal development tools with the external publishing tools, and you make those tools incredibly easy to use, participating in the open source community becomes trivial. Often, it can be more painful for an agency to avoid it completely. I think we&#8217;re about to see a big uptick in the amount of open source participation, and not just in the traditional sense. Open source can be between business units within an agency. Often the left hand doesn&#8217;t know what the right is doing between agencies. The problems agencies face are not unique. Often the taxpayer is paying to solve the same problem multiple times. Ultimately, in a collaborative commons with the public, we&#8217;re working together to make our government better.&#8221;</p>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2013/03/github-government-bureaucat-open-source.html/feed</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>If followers can sponsor updates on Facebook, social advertising has a new horizon</title>
		<link>http://radar.oreilly.com/2013/03/facebook-sponsored-posts-followers-friends.html</link>
		<comments>http://radar.oreilly.com/2013/03/facebook-sponsored-posts-followers-friends.html#comments</comments>
		<pubDate>Thu, 07 Mar 2013 22:56:00 +0000</pubDate>
		<dc:creator>Alex Howard</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[@home]]></category>
		<category><![CDATA[advertising]]></category>
		<category><![CDATA[behavioral advertising]]></category>
		<category><![CDATA[business models]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[social media]]></category>
		<category><![CDATA[social networking]]></category>

		<guid isPermaLink="false">http://radar.oreilly.com/?p=56193</guid>
		<description><![CDATA[This week, I found that one of my Facebook updates received significantly more attention that others I&#8217;ve posted. On the one hand, it was a share of an important New York Times story focusing on the first time a baby &#8230; ]]></description>
				<content:encoded><![CDATA[<p>This week, I found that one of my Facebook updates received significantly more attention that others I&#8217;ve posted. On the one hand, it was a share of an important New York Times story focusing on the <a href="http://www.nytimes.com/2013/03/04/health/for-first-time-baby-cured-of-hiv-doctors-say.html?pagewanted=all&amp;_r=0">first time a baby was cured of HIV</a>. But I discovered something that went beyond the story itself: <strong>someone who was not my friend had paid to sponsor one of my posts</strong>. </p>
<p><img src="http://s.radar.oreilly.com/wp-files/2/2013/03/my-promoted-post.jpg" alt="Promoted post on Facebook." width="600" height="450" class="aligncenter size-full wp-image-56196" /></p>
<p>According to Facebook, the promoted post had 27 times as many views because it was sponsored this way, with 96% of the views coming through the sponsored version. </p>
<p>When I started to investigate what had happened, I learned that I&#8217;d missed some relevant news last month. Facebook had announced that <a href="http://allfacebook.com/users-can-now-promote-facebook-friends-posts_b110836">users would be able to promote the posts of friends</a>. My situation, however, was clearly different: Christine Harris, the sponsor of my post, is not my friend. </p>
<p>When I followed up with <a href="http://www.linkedin.com/pub/elisabeth-diana/2/24b/105">Elisabeth Diana</a>, Facebook&#8217;s advertising communications manager, she said this was part of the cross-promote feature that Facebook rolled out. If a reporter posts a public update to his followers on Facebook, Diana explained to me in an email, that update can be promoted and &#8220;boosted&#8221; to the reporter&#8217;s friends.</p>
<p>While I couldn&#8217;t find Harris on Facebook, Diana said with &#8220;some certainty&#8221; that she was my follower, &#8220;in order to have seen your content.&#8221; Harris definitely isn&#8217;t my friend, and while she may well be one of my <a href="https://www.facebook.com/alexhoward/followers">followers</a>, I have no way to search them to determine whether that&#8217;s so.<span id="more-56193"></span></p>
<p>In these situations, &#8220;sponsored&#8221; is the label you&#8217;ll see on promoted posts, Diana explained. She also confirmed to me that anyone can (or will be able to) sponsor/promote the public post of someone else, &#8220;if they are following them or are friends with them.&#8221;</p>
<p>If that happens, the sponsored post will then be boosted only to friends of its author, as opposed to an entire network of followers, said Diana. In the United States, she said that will cost about $7. If this is broadly rolled out, it will be interesting to see if PR companies or news outlets quietly opt to boost stories.</p>
<h2>The only constant on Facebook is change</h2>
<p>What this all seems to herald is a broader move where getting seen on Facebook will depend much more upon your willingness to pay for it. This is, of course, the dynamic that has long existed on radio and television, unless you can earn &#8220;free media&#8221; coverage by being newsworthy.</p>
<p>Given the recent kerfluffle over the <a href="http://bits.blogs.nytimes.com/2013/03/03/disruptions-when-sharing-on-facebook-comes-at-a-cost/">cost of sharing on Facebook</a> and <a href="http://bits.blogs.nytimes.com/2013/03/05/facebook-news-feed-draws-more-criticism/">criticism of the Facebook newsfeed</a>, issues around <a href="http://radar.oreilly.com/2012/12/14-trends-for-2013.html">algorithmic transparency</a> only seem to be growing. </p>
<p>While Facebook posted a &#8220;<a href="https://newsroom.fb.com/Fact-Check">fact check</a>&#8221; in response to Nick Bilton&#8217;s New York Times column, arguing that &#8220;overall engagement on posts from people with followers has gone up 34% year over year,&#8221; my experience on the platform matches his: even with nearly 100,000 subscribers, my updates aren&#8217;t receiving anywhere close to as much engagement as they did before last November.</p>
<p>Given the reactions I&#8217;ve seen to his column, I believe that Bilton speaks for many journalists and others who have turned on subscribers, along with quite a few Page owners. What we see on Facebook is now driven not just by what our friends and family share but how we and others respond to it, as interpreted by algorithms, along with our interests, expressed by Likes, and the social networking giant&#8217;s need to make money. </p>
<p>I remember quite clearly when this shift began, on November 3, 2012. WolframAlpha analytics told me that an <a href="https://www.facebook.com/photo.php?fbid=635931091985">update</a> with a screencap and annotation of Facebook&#8217;s prompt to &#8220;pay to promote&#8221; received the most comments of any picture in 2012. </p>
<p><img src="http://s.radar.oreilly.com/wp-files/2/2013/03/pay-to-promote-on-FB1.jpg" alt="Pay to Promote on Facebook image" width="600" height="331" class="aligncenter size-full wp-image-56197" /></p>
<p>My feeling last November was that paid promotions would result in my updates becoming deprecated in the newsfeeds of others. Feelings, however, have to be balanced with data. Recent research suggests that, like most users, I have underestimated the audience size for my posts. </p>
<p>A new <a href="http://hci.stanford.edu/publications/2013/invisibleaudience/invisibleaudience.pdf">study</a> (PDF) by the human-computer interaction group at Stanford University’s computer science department and <a href="https://www.facebook.com/notes/facebook-data-science/estimating-audience-size-on-facebook/10151390940058859">Facebook&#8217;s data science team</a> found that a median Facebook user reaches 60% of his or her friends over the course of a month.</p>
<p><img src="http://s.radar.oreilly.com/wp-files/2/2013/03/fb-friends-who-saw-post.png" alt="Percentage of Facebook friends who saw a post" width="600" height="249" class="aligncenter size-full wp-image-56199" /></p>
<p>I&#8217;m not sure if making public updates sponsorable will fundamentally change how we use or experience the world&#8217;s biggest social network. Will having followers promote posts degrade your relationships with friends or your interactions with them? Does it create an incentive to be nicer to them? Perhaps the latter, but the rest of it seems uncertain. </p>
<p>What does seem clear is that, over the past five months, Facebook users have been seeing fewer updates from friends and more content targeted to their &#8220;Likes.&#8221; This now include updates containing links or ads regarding products, services, causes or politicians that their friends &#8220;<a href="https://developers.facebook.com/docs/reference/plugins/like/">Like</a>&#8221; elsewhere online.</p>
<p>Given that Facebook is a public company that provides a free, advertising-supported product and needs to grow its revenues, these changes aren&#8217;t surprising. That said, these changes feel like one more step away from the clean, uncluttered network I joined in 2007 to privately share details about my life with friends and family. </p>
<p>If this new pay-to-promote feature catches on with brands and corporations, they will have a quietly effective new means to influence us through our friends. I still find Facebook valuable, but my relationship status with Facebook is now set to &#8220;it&#8217;s complicated.&#8221;</p>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2013/03/facebook-sponsored-posts-followers-friends.html/feed</wfw:commentRss>
		<slash:comments>18</slash:comments>
		</item>
		<item>
		<title>Untangling algorithmic illusions from reality in big data</title>
		<link>http://radar.oreilly.com/2013/03/untangling-algorithmic-illusions-from-reality-in-big-data.html</link>
		<comments>http://radar.oreilly.com/2013/03/untangling-algorithmic-illusions-from-reality-in-big-data.html#comments</comments>
		<pubDate>Mon, 04 Mar 2013 20:19:57 +0000</pubDate>
		<dc:creator>Alex Howard</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[@home]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[data collection]]></category>
		<category><![CDATA[data ethics]]></category>
		<category><![CDATA[research]]></category>

		<guid isPermaLink="false">http://radar.oreilly.com/?p=56125</guid>
		<description><![CDATA[Microsoft principal researcher Kate Crawford (@katecrawford) gave a strong talk at last week&#8217;s Strata Conference in Santa Clara, Calif. about the limits of big data. She pointed out potential biases in data collection, questioned who may be excluded from it, &#8230; ]]></description>
				<content:encoded><![CDATA[<p>Microsoft principal researcher Kate Crawford (<a href="http://twitter.com/katecrawford">@katecrawford</a>) gave a strong talk at last week&#8217;s Strata Conference in Santa Clara, Calif. about the limits of big data. She pointed out potential biases in data collection, questioned who may be excluded from it, and hammered home the constant need for context in conclusions. Video of her talk is embedded below:</p>
<p><iframe width="620" height="349" src="http://www.youtube.com/embed/irP5RCdpilc?feature=oembed" frameborder="0" allowfullscreen></iframe></p>
<p>Crawford explored many of these same topics in our interview, which follows.</p>
<p><span id="more-56125"></span></p>
<h2>What research are you working on now, following up on your <a href="http://www.scribd.com/doc/98034770/Six-Provocations-for-Big-Data-Danah-Boyd-Kate-Crawford">paper on big data</a>?</h2>
<p><strong>Kate Crawford:</strong> I&#8217;m currently researching how big data practices are affecting different industries, from news to crisis recovery to urban design. This talk was based on that upcoming work, touching on questions of smartphones as sensors, on dealing with disasters (like Hurricane Sandy), and new epistemologies &mdash; or ways we understand knowledge &mdash; in an era of big data. </p>
<p>When &#8220;<a href="http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1926431&amp;http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1926431">Six Provocations for Big Data</a>&#8221; came out in 2011, we were critiquing the very early stages of big data and social media. In the two years since, the issues we raised are even more prominent. </p>
<p>I&#8217;m now looking beyond social media to a range of other areas where big data is raising questions of social justice and privacy. I&#8217;m also editing a special issue on critiques of big data, which will be coming out later this year in the International Journal of Communications.</p>
<h2>As more nonprofits and governments look to data analysis in governing or services, what do they need to think about and avoid?</h2>
<p><strong>Kate Crawford:</strong> Governments have a responsibility to serve all citizens, so it&#8217;s important that big data doesn&#8217;t become a proxy for &#8220;data about everyone.&#8221; There are two problems here: first is the question of who is visible and who isn&#8217;t represented; the second is privacy, or what I call &#8220;privacy practices&#8221; &mdash; because privacy means different things depending on where and who you are. </p>
<p>For example, the <a href="http://streetbump.org/">Streetbump</a> app is brilliant. What city wouldn&#8217;t want to passively draw on data from all those smartphones out there, a constantly moving network of sensors? But, as we know, there are significant percentages of Americans who don&#8217;t have smartphones, particularly older citizens and those with lower disposable incomes. What happens to their neighborhoods if they generate no data? They fall off the map. To be invisible when governments make resource decisions is dangerous. </p>
<p>Then, of course, there&#8217;s the whole issue of people signing up to be passively tracked wherever they go. People may happily opt into it, but we&#8217;d want to be very careful about who gets that data, and how it is protected over the long term &mdash; not just five years, but 50 years and beyond. Governments might be tempted to use that data for other purposes, even civic ones, and this has significant implications for privacy and the expectations citizens have for the use of their data.</p>
<h2>Where else could such biases apply?</h2>
<p><strong>Kate Crawford:</strong> There are many areas where big data bias is a problem from a social equity perspective. One of the key ones at the moment is law enforcement. I&#8217;m concerned by some of the work that seeks to &#8220;profile&#8221; areas, and even people, as likely to be involved in crime. It&#8217;s called &#8220;predictive policing&#8221; (<a href="http://ctolabs.com/wpcontent/uploads/2012/06/120627HadoopForLawEnforcement.pdf">more here</a>). We&#8217;ve already seen some problematic outcomes when profiling was introduced for plane travel. Now, imagine what happens if you or your neighborhood falls on the wrong side of a predictive model. How do you even begin to correct the record? Which algorithm do you appeal to?</p>
<h2>What are the things, as <a href="http://www.nytimes.com/2013/02/19/opinion/brooks-what-data-cant-do.html">David Brooks listed</a> recently, that big data can&#8217;t do?</h2>
<p><strong>Kate Crawford:</strong> There are lots of things that big data can&#8217;t do. It&#8217;s useful to consider the history of knowledge, and then imagine what it would look like if we only used one set of tools, one methodology for getting answers. </p>
<p>This is why I find people like <a href="http://en.wikipedia.org/wiki/Gabriel_Tarde">Gabriel Tarde</a> so interesting &mdash; he was grappling with ideas of method, big data and small data, back in the late 1800s. </p>
<p>He reminds us of what we can lose sight of when we go up orders of magnitude and try to leave small-scale data behind &mdash; like interviewing people, or observing communities, or running limited experiments. Context is key, and it is much easier to be attentive to context when we are surrounded by it. When context is dissolved into so many aggregated datasets, we can start getting mistaken impressions. </p>
<p>When <a href="http://www.nature.com/news/when-google-got-flu-wrong-1.12413">Google Flu Analytics mistakenly predicted</a> that 11% of the US had flu this year, that points to how relying on a big data signal alone may give us an exaggerated or distorted result (in that case, more than double the actual figure, which was between 4.5-4.8%). Now, imagine how much worse it would be if that data was all that health agencies had to work with.</p>
<p>I&#8217;m really interested in how we might best combine computational social science with traditional qualitative and ethnographic methods. With a range of tools and perspectives, we&#8217;re much more likely to get a three-dimensional view of a problem and be less prone to serious error. This goes beyond tacking on a few focus groups to big datasets, but conjoining deep, ethnographically-informed research with rich data sources.</p>
<h2>What can the history of statistics in social science tell us about correlation vs causation? Does big data change that dynamic?</h2>
<p><strong>Kate Crawford:</strong> This is a gigantic question, and one that could be its own talk! With big datasets, it&#8217;s very tempting for researchers to engage in apophenia &mdash; seeing patterns where none actually exist &mdash; because massive quantities of data can point to a range of correlative possibilities.</p>
<p>For example, <a href="http://www.haas.berkeley.edu/faculty/leinweber">David Leinweber</a> showed back in 2007 that data mining techniques could show a strong but spurious correlation between the changes in the S&amp;P 500 stock index and butter production in Bangladesh. There&#8217;s<br />
another <a href="http://www.businessweek.com/magazine/correlation-or-causation-12012011-gfx.html">great correlation</a> between the use of Facebook and the rise of the Greek debt crisis.</p>
<p>With big data techniques, some people argue you can get much closer to being able to predict causal relations. But even here, big data tends to need several steps of preparation (data &#8220;cleaning&#8221; and pre-processing) and several steps in interpretation (deciding which of many analyses shows a positive result versus a null-result). </p>
<p>Basically, humans are still in the mix, and thus it&#8217;s very hard to escape false positives, strained correlations and cognitive bias.</p>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2013/03/untangling-algorithmic-illusions-from-reality-in-big-data.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>