Nat Torkington

Nat has chaired the O'Reilly Open Source Convention and other O'Reilly conferences for over a decade. He ran the first web server in New Zealand, co-wrote the best-selling Perl Cookbook, and was one of the founding Radar bloggers. He lives in New Zealand and consults in the Asia-Pacific region.

Four short links: 16 February 2016

Full-on Maker, Robot Recap, Decoding Mandarin, and Sequencing Birds

by Nat Torkington | @gnat | +Nat Torkington | February 16, 2016

Washers and Screws (YouTube) — this chap is making his own clock from scratch, and here he is making his own washers and screws. Sometimes another person’s obsession can be calming. (via Greg Sadetsky)
ROScon 2015 Recap with Videos (Robohub) — Shuttleworth suggests that robotics developers really need two things at this point: a robust Internet of Things infrastructure, followed by the addition of dynamic mobility that robots represent. However, software is a much more realistic business proposition for a robotics startup, especially if you leverage open source to create a developer community around your product and let others innovate through what you’ve built.
Getting Deep Speech to Work in Mandarin (Baidu SVAIL) — TIL that some of the preprocessing traditionally used in speech-to-text systems throws away pitch information necessary to decode tonal languages like Mandarin. Deep Speech doesn’t use specialized features like MFCCs. We train directly from the spectrogram of the input audio signal. The spectrogram is a fairly general representation of an audio signal. The neural network is able to learn directly which information is relevant from the input, so we didn’t need to change anything about the features to move from English speech recognition to Mandarin speech recognition. Their model works better than humans at decoding short text such as queries.
Sequencing Genomes of All Known Kakapo — TIL there’s a project to sequence genomes of 10,000 bird species and that there’s this crowdfunded science project to sequence the kakapo genome. There are only 125 left, and conservationists expect to use the sequenced genomes to ensure rare genes are preserved. Every genome in this species could be sequenced … I’m boggling. (via Duke)

Four short links: 15 February 2016

Deep Learning Analogies, IoT Privacy, Robot Numbers, and App Economy

by Nat Torkington | @gnat | +Nat Torkington | February 15, 2016

Deep Visual Analogy-Making (PDF) — In this paper, we develop a novel deep network trained end-to-end to perform visual analogy making, which is the task of transforming a query image according to an example pair of related images. Open source code from the paper also available.
Samsung’s TV and Privacy Gets More Awkward — Samsung has now issued a new statement clarifying how the voice activation feature works. “If a consumer consents and uses the voice recognition feature, voice data is provided to a third party during a requested voice command search,” Samsung said in a statement. “At that time, the voice data is sent to a server, which searches for the requested content then returns the desired content to the TV.” It only seems creepy until you give in and nothing bad happens, then you normalise the creepy.
2015 Robot Numbers (RoboHub) — The Robotic Industries Association (RIA), representing North American robotics, reported […] 2015 set new records and showed a 14% increase in units and 11% in dollars over 2014. The automotive industry was the primary growth sector, with robot orders increasing 19% year over year. Non-automotive robot orders grew at 5%.
Mozilla, Caribou Digital Release Report Exploring the Global App Economy (Mark Surman) — The emerging markets are the 1% — meaning, they earn 1% of total app economy revenue. 95% of the estimated value in the app economy is captured by just 10 countries, and 69% of the value is captured by just the top three countries. Excluding China, the 19 countries considered low- or lower-income accounted for only 1% of total worldwide value. Developers in low-income countries struggle to export to the global stage. About one-third of developers in the sample appeared only in their domestic market.

Four short links: 12 February 2016

Slack's Voice, Accessible Experiences, IoT Design, and What I Learned

by Nat Torkington | @gnat | +Nat Torkington | February 12, 2016

Webstock: Bug Fixes & Minor Improvements — notes from Anna Pickard’s talk about being the voice of Slack, recorded by Luke Wroblewski.
Webstock: The Map & The Territory — notes from Ethan Marcotte’s talk about making accessible experiences, recorded by Luke Wroblewski.
Webstock: The Shape of Things — notes from Tom Coates’s talk about designing for the Internet of Things, recorded by Luke Wroblewski.
Today I Learned — A collection of concise write-ups on small things I learn day to day across a variety of languages and technologies. These are things that don’t really warrant a full blog post.

Four short links: 11 February 2016

Surviving Crashes, Thumbs-Up Thumbs-Down Learning, Faster Homomorphic Encryption, and Nerdy V-Day Cards

by Nat Torkington | @gnat | +Nat Torkington | February 11, 2016

All File Systems are Not Created Equal: On the Complexity of Crafting Crash Consistent Applications (Paper a Day) — an important subject for me. BOB, the Block Order Breaker, is used to find out what behaviours are exhibited by a number of modern file systems that are relevant to building crash consistent applications. ALICE, the Application Level Intelligent Crash Explorer, is then used to explore the crash recovery behaviour of a number of applications on top of these file systems.
BinaryNet: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1 (Arxiv) — instead of complex positive/negative floating-point weights, this uses +1 and -1 (which I can’t help but think of as “thumbs up”, “thumbs down”) to get nearly state-of-the-art results because a run-time, BinaryNet drastically reduces memory usage and replaces most multiplications by 1-bit exclusive-not-or (XNOR) operations, which might have a big impact on both general-purpose and dedicated Deep Learning hardware. GPLv2 code available.
Microsoft Speeds Up Homomorphic Encryption (The Register) — homomorphic encryption lets databases crunch data without needing keys to decode it.
Nerdy Valentine Cards (Evil Mad Scientist) — for a nerd in your life. (via Cory Doctorow)

Four short links: 10 February 2016

Bitcoin Textbook, Brain Books, Post-Quantum Crypto, and Amazon's Game Engine

by Nat Torkington | @gnat | +Nat Torkington | February 10, 2016

Princeton Bitcoin Book (PDF) — The Coursera course accompanying this book had 30,000 students in its first version, and it was a success based on engagement and end-of-course feedback. Large introduction to Bitcoin from Princeton. (via Cory Doctorow)
A Quartet of Complementary Brain Books (Vaughan Bell) — The books have been chosen to complement each other and the idea is that if you read all four, you should have a solid grounding in modern cognitive neuroscience and beyond.
NIST Report on Post-Quantum Cryptography (PDF) — in case you missed it, “post-quantum crypto” is “existing crypto relies on how hard it is to find the prime factors of large numbers, of which we suspect quantum computers may make a mockery. Wut to do?” The goal of post-quantum cryptography (also called quantum-resistant cryptography) is to develop cryptographic systems that are secure against both quantum and classical computers, and can interoperate with existing communications protocols and networks.
Amazon Lumberyard — a free, cross-platform, 3D game engine for you to create the highest-quality games, connect your games to the vast compute and storage of the AWS Cloud, and engage fans on Twitch. From Amazon.

Four short links: 9 February 2016

Collaborative Mario Agents, ElasticSearch at Scale, Anomaly Detection, Robotics Experiment

by Nat Torkington | @gnat | +Nat Torkington | February 9, 2016

Social Intelligence in Mario Bros (YouTube) — collaborative agents built by cognitive AI researchers … they have drives, communicate, learn from each other, and solve problems. Oh, and the agents are Mario, Luigi, Yoshi, and Toad within a Super Mario Brothers clone. No code or papers about it on the research group’s website yet, just a YouTube video and a press release on the university’s website, so appropriately adjust your priors for imminent world destruction at the hands of a rampaging super-AI. (via gizmag)
How we Monitor and Run ElasticSearch at Scale (SignalFx) — sweet detail on metrics, dashboards, and alerting.
Simple Anomaly Detection for Weekly Patterns — Rule-based heuristics do not scale and do not adapt easily, especially if we have thousands of alarms to set up. Some statistical approach is needed that is generic enough to handle many different metric behaviours.
How to Design a Robotics Experiment (Robohub) — although there are many good experimental scientists in the robotic community, there has not been uniformly good experimental work and reporting within the community as a whole. This has advice such as “the five components of a well-designed experiment.”

Four short links: 8 February 2016

Experimental Support, Coding Books, Bad Decisions, and GitHub to Jupyter

by Nat Torkington | @gnat | +Nat Torkington | February 8, 2016

Elemental Machines — Boston startup fitting experiments & experimenters with sensors, deep learning to identify problems (vibration, humidity, etc.) that could trigger experimental failure. [C]rucial experiments are often delayed by things that seem trivial in retrospect. “I talked to my friends who worked in labs,” Iyengar says. “Everyone had a story to tell.” One scientist’s polymer was unstable because of ultraviolet light coming through a nearby window, he says; that took six months to debug. Another friend who worked at a pharmaceutical company was testing drug candidates in mice. The results were one failure after another, for months, until someone figured out that the lab next door was being renovated, and after-hours construction was keeping the mice awake and stressing them out. (that quote from Xconomy)
Usborne Computer and Coding Books — not only do they have sweet Scratch books for kids, they also have their nostalgia-dripping 1980s microcomputer books online. I still have a pile of my well-loved originals.
Powerful People are Terrible at Making Decisions Together — Researchers from the Haas School of Business at the University of California, Berkeley, undertook an experiment with a group of health care executives on a leadership retreat. They broke them into groups, presented them with a list of fictional job candidates, and asked them to recommend one to their CEO. The discussions were recorded and evaluated by independent reviewers. The higher the concentration of high-ranking executives, the more a group struggled to complete the task. They competed for status, were less focused on the assignment, and tended to share less information with each other.
MyBinder — turn a GitHub repo into a collection of interactive notebooks powered by Jupyter and Kubernetes.

Four short links: 5 February 2016

Signed Filesystem, Smart Mirror, Deep Learning Tuts, and CLI: Miami

by Nat Torkington | @gnat | +Nat Torkington | February 5, 2016

Introducing the Keybase Filesystem — love that crypto is making its way into the filesystem.

DIY Smart Bathroom Mirror — finally, someone is building this science-fiction future! (via BoingBoing)
tensorflow tutorials — for budding deep learners.
clmystery — a command-line murder mystery.

Four short links: 4 February 2016

Shmoocon Video, Smart Watchstrap, Generalizing Learning, and Dataflow vs Spark

by Nat Torkington | @gnat | +Nat Torkington | February 4, 2016

Shmoocon 2016 Videos (Internet Archive) — videos of the talks from an astonishingly good security conference.
TipTalk — Samsung watchstrap that is the smart device … put your finger in your ear to hear the call. You had me at put my finger in my ear. (via WaPo)
Ecorithms — Leslie Valiant at Harvard broadened the concept of an algorithm into an “ecorithm,” which is a learning algorithm that “runs” on any system capable of interacting with its physical environment. Algorithms apply to computational systems, but ecorithms can apply to biological organisms or entire species. The concept draws a computational equivalence between the way that individuals learn and the way that entire ecosystems evolve. In both cases, ecorithms describe adaptive behavior in a mechanistic way.
Dataflow/Beam vs Spark (Google Cloud) — To highlight the distinguishing features of the Dataflow model, we’ll be comparing code side-by-side with Spark code snippets. Spark has had a huge and positive impact on the industry thanks to doing a number of things much better than other systems had done before. But Dataflow holds distinct advantages in programming model flexibility, power, and expressiveness, particularly in the out-of-order processing and real-time session management arenas.

Four short links: 3 February 2016

Security Forecast, Machine Learning for Defence, Retro PC Fonts, and Cognitive Psych Research

by Nat Torkington | @gnat | +Nat Torkington | February 3, 2016

Software Security Ideas Ahead of Their Time — astonishing email exchange from 1995 presaged a hell of a lot of security work.
Doxxing Sherlock — Cory Doctorow’s ruminations on surveillance, Sherlock, and what he found in the Snowden papers. What he found included an outline of intelligence use of machine learning.
Old-School PC Fonts — definitive collection of ripped-from-the-BIOS fonts from the various types of PCs. Your eyes will ache with nostalgia. (Or, if you’re a young gun, wondering how anybody wrote code with fonts like that) (my terminal font is VT220 because it makes me happy and productive)
Cognitive Load: Brain Gems — We distill the latest behavioural economics & consumer psychology research down into helpful little brain gems.