Fairness in Machine Learning — read this fabulous presentation. Most ML objective functions create models accurate for the majority class at the expense of the protected class. One way to encode “fairness” might be to require similar/equal error rates for protected classes as for the majority population.
A Recent Discussion about DRM (Joi Ito) — strong arguments against including Digital Rights Management in W3C’s web standards (I can’t believe we’re still debating this; it’s such a self-evidently terrible idea to bake disempowerment into web standards).
MacroBase — Analytic monitoring for the Internet of Things. The code behind a research paper, written up in the morning paper where Adrian Colyer says, there is another story that also unfolds in the paper – one of careful system design based on analysis of properties of the problem space, of thinking deeply and taking the time to understand the prior art (aka “the literature”), and then building on those discoveries to advance and adapt them to the new situation. “That’s what research is all about!” you may say, but it’s also what we’d (I’d?) love to see more of in practitioner settings, too. The result of all this hard work is a system that comprises just 7,000 lines of code, and I’m sure, many, many hours of thinking!
Survey of Commenters and Comment Readers — Americans who leave news comments, who read news comments, and who do neither are demographically distinct. News commenters are more male, have lower levels of education, and have lower incomes compared to those who read news comments. (via Marginal Revolution)
Moneyball for Book Publishers: A Detailed Look at How We Read (NYT) — On average, fewer than half of the books tested were finished by a majority of readers. Most readers typically give up on a book in the early chapters. Women tend to quit after 50 to 100 pages, men after 30 to 50. Only 5% of the books Jellybooks tested were completed by more than 75% of readers. Sixty percent of books fell into a range where 25% to 50% of test readers finished them. Business books have surprisingly low completion rates. Not surprisingly low to anyone who has ever read a business book. They’re always a 20-page idea stretched to 150 pages because that’s how wide a book’s spine has to be to visible on the airport bookshelf. Fat paper stock and 14-point text with wide margins and 1.5 line spacing help, too. Don’t forget to leave pages after each chapter for the reader’s notes. And summary checklists. And … sorry, I need to take a moment.
Deep Visual Analogy-Making (PDF) — In this paper, we develop a novel deep network trained end-to-end to perform visual analogy making, which is the task of transforming a query image according to an example pair of related images. Open source code from the paper also available.
Samsung’s TV and Privacy Gets More Awkward — Samsung has now issued a new statement clarifying how the voice activation feature works. “If a consumer consents and uses the voice recognition feature, voice data is provided to a third party during a requested voice command search,” Samsung said in a statement. “At that time, the voice data is sent to a server, which searches for the requested content then returns the desired content to the TV.” It only seems creepy until you give in and nothing bad happens, then you normalise the creepy.
2015 Robot Numbers (RoboHub) — The Robotic Industries Association (RIA), representing North American robotics, reported […] 2015 set new records and showed a 14% increase in units and 11% in dollars over 2014. The automotive industry was the primary growth sector, with robot orders increasing 19% year over year. Non-automotive robot orders grew at 5%.
Mozilla, Caribou Digital Release Report Exploring the Global App Economy (Mark Surman) — The emerging markets are the 1% — meaning, they earn 1% of total app economy revenue. 95% of the estimated value in the app economy is captured by just 10 countries, and 69% of the value is captured by just the top three countries. Excluding China, the 19 countries considered low- or lower-income accounted for only 1% of total worldwide value. Developers in low-income countries struggle to export to the global stage. About one-third of developers in the sample appeared only in their domestic market.
Overview of the Civilian Drone Market (DIY Drones) — Six categories: toy quadcopters; FPV/racing; consumer camera drones; prosumer camera drones; consumers, industrial, agricultural, NGO and Research drones; winged and VTOL drones.
Bottom Up Computer Science — A free, online book designed to teach computer science from the bottom end up. Topics covered include binary and binary logic, operating systems internals, toolchain fundamentals, and system library fundamentals.
The Economics of Drone Delivery — The analysis is still mostly speculative. Keeney imagines that 6,000 operators who earn $50,000 per year will operate 30,000 to 40,000 drones. Each drone will make 30 deliveries per day. Her analysis ignores depreciation and questions like: ‘How will drones avoid airplanes and deliver packages in Manhattan?’ And there’s another core issue: $12.92 is the price UPS charges to consumers, but its actual marginal cost of delivering one more package along a route they are delivering to already is probably closer to $2. When push comes to shove, will drones be able to compete? (via Chris Anderson)
7 Ways Your Data is Telling You It’s a Graph — Network, tree, taxonomy, ancestry, structure – if people are using those words to talk about an organizational chart or reporting structure, they’re telling you that data and the relationships between that data are important.
Six Degrees of Francis Bacon — recreates the British early modern social network to trace the personal relationships among figures like Bacon, Shakespeare, Isaac Newton, and many others. (via CMU)
Last Bus Startup Standing (TechCrunch) — Vahabzadeh stressed that a key point of Chariot’s survival has been that the company has been above-board with the law from day one. “They haven’t cowboy-ed it,” said San Francisco supervisor Scott Wiener, a mass transit advocate who recently pushed for a master subway plan for the city. “They’ve been good about taking feedback and making sure they’re complying with the law. I’m a fan and think that private transportation options and rideshares have a significant role to play in making us a transit-first city.”
Mobile App Developers are Suffering — the top 20 app publishers, representing less than 0.005% of all apps, earn 60% of all app store revenue. The article posits causes of the particularly extreme power law.
How Big is the Gig Economy? (Medium) — this is one example in which the Labor Department and Bureau of Labor Statistics really have shirked their responsibility to try and assess the size and growth of this dynamic shift to our economy.
The Twelve Networking Truths — RFC1925 is channeling the epigram-leaking protagonist of Robert Heinlein’s Time Enough for Love. It is easier to move a problem around (for example, by moving the problem to a different part of the overall network architecture) than it is to solve it. This is true for most areas of life: generally easier to make it someone else’s problem than to solve it.
The Decay of Twitter (The Atlantic) — In other words, on Twitter, people say things that they think of as ephemeral and chatty. Their utterances are then treated as unequivocal political statements by people outside the conversation. Because there’s a kind of sensationalistic value in interpreting someone’s chattiness in partisan terms, tweets “are taken up as magnum opi to be leapt upon and eviscerated, not only by ideological opponents or threatened employers but by in-network peers.”
Power of Small Groups (Matt Webb) — Matt’s joined a small Slack community of like-minded friends. There’s a space where articles written or edited by members automatically show up. I like that. I caught myself thinking: it’d be nice to have Last.FM here, too, and Dopplr. Nothing that requires much effort. Let’s also pull in Instagram. Automatic stuff so I can see what people are doing, and people can see what I’m doing. Just for this group. Back to those original intentions. Ambient awareness, togetherness. cf Clay Shirky’s situated software. Everything useful from 2004 will be rebuilt once the fetish for scale passes.
Asymmetric Misperceptions (PDF) — research into the systematic mismatch between how politicians think their constituents feel on issues, and how the constituents actually feel. Our findings underscore doubts that policymakers perceive opinion accurately: politicians maintain systematic misperceptions about constituents’ views, typically erring by over 10 percentage points, and entire groups of politicians maintain even more severe collective misperceptions. A second, post-election survey finds the electoral process fails to ameliorate these misperceptions.
Homebrew Bioweapons Not Imminent Threat — you need a safe facility, lab instruments, base strain, design and execution skills, and testing. None of these are easy until the Amazon-Google cloud wars finally cause them to move into “bioweapons as a service.”
gitrob — a command line tool that can help organizations and security professionals find such sensitive information. The tool will iterate over all public organization and member repositories and match filenames against a range of patterns for files, that typically contain sensitive or dangerous information.
How Much is a Leader’s Integrity Worth? — Kiel found that high-integrity CEOs had a multi-year return of 9.4%, while low-integrity CEOs had a yield of just 1.9%. What’s more, employee engagement was 26% higher in organizations led by high-integrity CEOs. (via Neelan Choksi)