- Hidden Technical Debt in Machine Learning Systems (PDF) — We explore several ML-specific risk factors to account for in system design. These include boundary erosion, entanglement, hidden feedback loops, undeclared consumers, data dependencies, configuration issues, changes in the external world, and a variety of system-level anti-patterns.
- Large-Scale Content-Based Matching of Midi and Audio Files — We present a system that can efficiently match and align MIDI files to entries in a large corpus of audio content based solely on content, i.e., without using any metadata.
- Critical Social Research on Self-Tracking — I am currently working on an article that is a comprehensive review of both literatures, in the attempt to outline what each can contribute to understanding self-tracking as an ethos and a practice, and its wider sociocultural implications. Here is a reading list of the work from critical social researchers that I am aware of. Trigger warning: phrases like “The discursive construction of student subjectivities.”
- Warp-CTC — Baidu’s open source deep learning code. Connectionist Temporal Classification is a loss function useful for performing supervised learning on sequence data, without needing an alignment between input data and labels.
Get Four Short Links in Your Inbox
Sign up to receive Nat’s eclectic collection of curated links every weekday.
Data science: Where are we going? - DJ Patil, the U.S. government's first Chief Data Scientist, looks at the future of data science at Strata + Hadoop World 2015 in San Jose.
Most Recently Discussed
Radar managing editor