"data journalism" entries

Four short links: 29 October 2015

Cloud Passports, Better Python Notebooks, Slippery Telcos, and Python Data Journalism

by Nat Torkington | @gnat | +Nat Torkington | October 29, 2015

Australia Floating the Idea of Cloud Passports — Under a cloud passport, a traveller’s identity and biometrics data would be stored in a cloud, so passengers would no longer need to carry their passports and risk having them lost or stolen. That sound you hear is Taylor Swift on Security, quoting “Wildest Dreams” into her vodka and Tang: “I can see the end as it begins.” This article is also notable for The idea of cloud passports is the result of a hipster-style-hackathon.
Jupyter — Python Notebooks that allows you to create and share documents that contain live code, equations, visualizations, and explanatory text. Uses include: data cleaning and transformation, numerical simulation, statistical modeling, machine learning, and much more.
Telcos $24B Business In Your Data — Under the radar, Verizon, Sprint, Telefonica, and other carriers have partnered with firms including SAP, IBM, HP, and AirSage to manage, package, and sell various levels of data to marketers and other clients. It’s all part of a push by the world’s largest phone operators to counteract diminishing subscriber growth through new business ventures that tap into the data that showers from consumers’ mobile Web surfing, text messaging, and phone calls. Even if you do pay for it, you’re still the product.
Introducing Agate — a Python data analysis library designed to be useable by non-data-scientists, so leads to readable and predictable code. Target market: data journalists.

Four short links: 1 August 2014

Data Storytelling Tools, Massive Dataset Mining, Failed Crowdsourcing, and IoT Networking

by Nat Torkington | @gnat | +Nat Torkington | August 1, 2014

Miso — Dataset, a JavaScript client-side data management and transformation library, Storyboard, a state and flow-control management library & d3.chart, a framework for creating reusable charts with d3.js. Open source designed to expedite the creation of high-quality interactive storytelling and data visualisation content.
Mining of Massive Datasets (PDF) — book by Stanford profs, focuses on data mining of very large amounts of data, that is, data so large it does not fit in main memory. Because of the emphasis on size, many of our examples are about the Web or data derived from the Web. Further, the book takes an algorithmic point of view: data mining is about applying algorithms to data, rather than using data to “train” a machine-learning engine of some sort.
Lessons from Iceland’s Failed Crowdsourced Constitution (Slate) — Though the crowdsourcing moment could have led to a virtuous deliberative feedback loop between the crowd and the Constitutional Council, the latter did not seem to have the time, tools, or training necessary to process carefully the crowd’s input, explain its use of it, let alone return consistent feedback on it to the public.
Thread a ZigBee Killer? — Thread is Nest’s home automation networking stack, which can use the same hardware components as ZigBee, but which is not compatible, also not open source. The Novell NetWare of Things. Nick Hunn makes argument that Google (via Nest) are taking aim at ZigBee: it’s Google and Nest saying “ZigBee doesn’t work”.

Four short links: 1 April 2014

Unimaginative Vehicular Connectivity, Data Journalism, VR and Gender, and Open Data Justice

by Nat Torkington | @gnat | +Nat Torkington | April 1, 2014

Connected for a Purpose (Jim Stogdill) — At a recent conference, an executive at a major auto manufacturer described his company’s efforts to digitize their line-up like this: “We’re basically wrapping a two-ton car around an iPad. Eloquent critique of the Internet of Shallow Things.
Why Nate Silver Can’t Explain It All — Data extrapolation is a very impressive trick when performed with skill and grace, like ice sculpting or analytical philosophy, but it doesn’t come equipped with the humility we should demand from our writers. Would be a shame for Nate Silver to become Malcolm Gladwell: nice stories but they don’t really hold up.
Gender and VR (danah boyd) — Although there was variability across the board, biological men were significantly more likely to prioritize motion parallax. Biological women relied more heavily on shape-from-shading. In other words, men are more likely to use the cues that 3D virtual reality systems relied on. Great article, especially notable for there are more sex hormones on the retina than in anywhere else in the body except for the gonads.
Even The Innocent Should Worry About Sex Offender Apps (Quartz) — And when data becomes compressed by third parties, when it gets flattened out into one single data stream, your present and your past collide with potentially huge ramifications for your future. When it comes to personal data—of any kind—we not only need to consider what it will be used for but how that data will be represented, and what such representation might mean for us and others. Data policies are like justice systems: either you suffer a few innocent people being wrongly condemned (bad uses of open data0, or your system permits some wrongdoers to escape (mould grows in the dark).

Knight news winners, a journalist-summoning app, and an analysis of Forbes and new media.

by Janaya Williams | January 17, 2014

Knight news announced the seven newest winners of their news challenge grants this week. For this round, the challenge focused on health data. The projects include a personal monitor which will allow people to do their own chemical analysis of their environments, and an online portal where people can volunteer their personal health information to aid in medical research.

German journalists at the online news outlet Mittendrin, in partnership with Open Data City, have developed an app that allows members of the public to alert a journalist when they witness a newsworthy event, like a police action or spontaneous demonstration. The ‘Call A Journalist’ app will contact a journalist and deliver your GPS information along with your report. The best part is that after your information is relayed, the app will let you know that a journalist is on the way. Now, why didn’t I think of that?

Read more…

Secure Reporting, a new life for EveryBlock, and predictions for 2014.

by Janaya Williams | December 20, 2013

According to the Committee to Protect Journalists, 2013 was the second worst year on record for imprisoning journalists around the world for doing their work.

Which makes this story from PBS Idea Lab all the more important: How Journalists Can Stay Secure Reporting from Android Devices. There are tips here on how to anonymize data flowing through your phone using Tor, an open network that helps protect against traffic analysis and network surveillance. Also, there is information about video publishing software that facilitates YouTube posting, even if the site is blocked in your country. Very cool.

The Neiman Lab is publishing an ongoing series of Predictions for Journalism in 2014, and, predictably, the idea of harnessing data looms large. Hassan Hodges, director of innovation for the MLive Media Group, says that in this new journalism landscape, content will start to look more like data and data will look more like content. Poderopedia founder Miguel Paz says that news organizations should fire the consultants and hire more nerds. There are 51 contributions so far, and counting. It’s good reading.

Read more…

A new data working group from BBC News, a data library adds uploading capabilities, and a timeline of data journalism.

by Janaya Williams | December 6, 2013

BBC News is the latest media company to create a working group tasked with developing “innovative and experimental” journalism projects. The BBC ‘NewsLabs’ team will focus on data journalism and data visualization. The Guardian calls it a ‘back to the future’ move by the BBC’s new managing editor, James Harding.

After Washington Post owner Jeff Bezos announced this week that that Amazon may soon be making customer deliveries by drone, USA TODAY wondered whether newspaper delivery boys in Bezos’ jurisdiction should be worried.

Read more…

Pulling a Dick Cheney, context, and just getting started in data journalism

by Janaya Williams | November 23, 2013

The New York Times is replacing Nate Silver’s FiveThirtyEight blog (which Silver took to ESPN back in July) with a brand new site intended to “produce clear analytical reporting and writing on opinion polls, economic indicators, politics, policy, education, and sports.” The venture will be headed by D.C. bureau chief David Leonhardt, who also helmed the search committee and selected himself for the job. Naturally, his colleagues are teasing Leonhardt for “pulling a Dick Cheney.” The new team will also include presidential historian Michael Beschloss, Nate Cohn of The New Republic, and economist Justin Wolfers.

Take it from me — If you are short on time, do not even attempt to play around on the new Spending Stories website. Developed by the folks at Open Knowledge Foundation and Journalism++, Spending Stories is intended to help journalists understand and contextualize spending data by making easy comparisons to other data. For example, using the site, I was able to see that $15,000 US dollars is equal to 3% of private ambulance costs in Yorkshire, England; 0.02% of the cost of the contract awarded to IT company CGI for implementing healthcare.gov; and 90% of government spending per person per year in the UK in 2012. It’s a fun tool!

Read more…

Data journalism’s secrets, no more math-bashing, and a new way to create visualizations.

by Janaya Williams | November 9, 2013

The ProPublica Nerd Blog this week features an article by Hassel Fallas, a data journalist at La Nación in Costa Rica. Fallas was a 2013 Fellow at the International Center for Journalists, where she studied up on Data-Driven Journalism’s Secrets. Spoiler alert: The secret is…don’t keep secrets.

Over at the data-driven journalism blog, A Fundamental Way Data Repositories Must Change includes some fascinating examples of how data has been historically manipulated in Romania and Rwanda, including some examples from the present day.

Google Chrome’s new extension, Knoema, provides access to more than 500 data repositories and provides visualization tools for use with those databases. Knoema’s CTO says the platform can be used solely as a data source, but more importantly, it can be used as a tool for journalists to create embeddable visualisations. Pretty cool.

Read more…

Data journalism and social media merge, a call-out for ‘crap’ data journalism, and tips for creating a data resume.

by Janaya Williams | October 31, 2013

I suppose it was only a matter of time before the worlds of data journalism and social media cozied up and got comfortable. The London office of the Trinity Mirror announced that their new initiative, Mysterious Project Y, will focus on creating data journalism that will be compelling to share on the social Web. The site will focus on visualizations; “charts, graphs, facts, and figures” that “people care passionately about.”

You may have heard the statistic floating around lately that it is more difficult to land a job at a new location of the supermarket chain Wegmans than it is to gain admittance to Harvard University. The dragonflyeye blog refutes the numbers, and says that the story is an example of “crap data journalism.” Ouch.

Read more…

Journalism’s new scoop, learning from Harry Potter, and meeting designers more than half way.

by Janaya Williams | October 17, 2013

As an aspiring journalist, I worked in the Washington Post newsroom in an entry-level position that used to be known as a “copy boy”. (Later updated to the more inclusive “copy aide.”) I loved taking in the energy of the reporters, especially when they had pulled off a “scoop,” or a story that the other papers didn’t have yet. There was a pall over the newsroom when other papers “scooped” us, and published a story that the Post reporters had been too slow to report.

Finnish data journalist Esa Makinen says that data visualizations are journalism’s “new scoop.” Text stories can be quickly re-published by competitors, Makinen told journalism.co.uk, but data visualizations can not be copied. Makinen works on the data desk at Finland’s daily paper and website, Helsingin Sanomat, and spoke this week at the Digital Journalism Days conference in Warsaw.

A new tablet-first investigative publication is in the works from a team of data journalists around the world. Acuerdo (an old Spanish word for ‘agreement’) bills itself as “long-form journalism for pissed off readers.” The first edition will be published next month in three languages. If you self-identify as a pissed-off reader, consider making a contribution to Acuerdo’s Kickstarter campaign.

Read more…