- The Million Follower Fallacy (PDF) — We found that indegree represents a user’s popularity, but is not related to other important notions of inﬂuence such as engaging audience, i.e., retweets and mentions. Retweets are driven by the content value of a tweet, while mentions are driven by the name value of the user. Such subtle differences lead to dissimilar groups of the top Twitter users; users who have high indegree do not necessarily spawn many retweets or mentions. This ﬁnding suggests that indegree alone reveals very little about the inﬂuence of a user. Research confirms what we all knew, that idiots who chase follower numbers have the influence they deserve. (via Steve O’Grady on Twitter, indirectly)
- Geocoding Github: Visualizing Distributed Open-Source Development — work for the Stanford visualization class, plotting open source commits on maps over time. See this page for the interactive explorer. (via Michael Driscoll on Twitter)
- ArduPilotMega 1.0 Launched — autopilot built on the Arduino platform. (via Chris Anderson on Twitter)
- Lessons of the Gawker Security Mess (Forbes blog) — nice deconstruction of what happened. In the chat, Gawker’s Hamilton Nolan, after hearing that it is just Gawker users who have been compromised, remarks “oh, well. unimportant”. Gawker’s Richard Lawson wants to know if the breach is limited to “just the peasants?” Don’t trash talk about your users in company channels. The business that forgets it lives and dies on its customers is a business that will eventually be hated by its customers. (via Nahum Wild on Twitter)
ENTRIES TAGGED "visualizations"
Stack Exchange goes in-house, Netflix pays for platforms, survey data gets visualized, and Infochimps acquires Data Marketplace
In this edition of Strata Week: Stack Exchange takes their hardware and software in-house; Neflix explains their adoption of AWS and open source; the New York Times maps out survey and census data; and Infochimps acquires Data Marketplace.
Twitter Influence, Open Source Visualized, Arduino Autopilot, and Customer Respect
Use Gephi and Python to find your personal communities
Using a bit of Python and the Gephi graph tool, exploring your own Twitter network is a great way to learn about analyzing networks: and the results definitely have a "wow" factor.
Powerful open source graph manipulation
A Photoshop for data, Gephi is a powerful tool for exploring and presenting data as a graph. It's easy to get started with sample data sets, then import your own by generating files in a standard graph format.
Data geekery, visualization and journalism
From deep-diving startup founders to national newspapers, there's a rich vein of wisdom and information in blogs about data. Here's five to get your reading list started.
Easy-to-use timelines catch on with consumers and publishers.
Dipity is making it easier for businesses, media outlets and individual users to create interactive timelines. In the following interview, Dipity co-founder and CEO Derek Dukes discusses the company's business model and the opportunities that come when rich datasets are matched with user-friendly interfaces.
Make your charts more informative and interesting
You know bar and pie charts can be ineffective, but where do you turn next? Here are some great starting points to make a start with more interesting and effective charts.
The Processing language is an easy way to get started with graphics
Visualization is a powerful way to turn data into a story. But if you're not a "graphics person", where do you start?
Twitter Mapped, Bibliographic Data Released, Babies Engadgeted, and Nat's Christmas Present Sorted
- A Day in the Life of Twitter (Chris McDowall) — all geo-tagged tweets from 24h of the Twitter firehose, displayed. Interesting things can be seen, such as Jakarta glowing as brightly as San Francisco. (via Chris’s sciblogs post)
- British Library Release 3M Open Bibliographic Records) (OKFN) — This dataset consists of the entire British National Bibliography, describing new books published in the UK since 1950; this represents about 20% of the total BL catalogue, and we are working to add further releases.
- Gadgets for Babies (NY Times) — cry decoders, algorithmically enhanced rocking chairs, and (my favourite) “voice-activated crib light with womb sounds”. I can’t wait until babies can make womb sound playlists and share them on Twitter.
- GP2X Caanoo MAME/Console Emulator (ThinkGeek) — perfect Christmas present for, well, me. Emulates classic arcade machines and microcomputers, including my nostalgia fetish object, the Commodore 64. (via BoingBoing’s Gift Guide)
Syntax Highlighting, Forkability, Product Invention, Science Animations
- Fear of Forking — (Brian Aker) GitHub has begun to feel like the Sourceforge of the distributed revision control world. It feels like it is littered with half started, never completed, or just never merged trees. If you can easily takes changes from the main tree, the incentive to have your tree merged back into the canonical tree is low.
- Product Invention Workshops (BERG London) — Matt Webb explains what they do with customers. Output takes the form, generally, of these microbriefs. A microbrief is how we encapsulate recommendations: it’s a sketch and short description of a new product or effort that will easily test out some hypothesis or concept arrived at in the workshop. It’s sketched enough that people outside the workshop can understand it. And it’s a hook to communicate the more abstract principles which have emerged in the days. Their process isn’t their secret weapon, it’s their creativity, empathy, and communication skills that make them so valuable.
- OneMicron — Janet Isawa’s beautiful animations of biological science. (via BoingBoing who linked to this NYTimes piece)