<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>O&#039;Reilly Radar &#187; Roger Magoulas</title>
	<atom:link href="http://radar.oreilly.com/rogerm/feed" rel="self" type="application/rss+xml" />
	<link>http://radar.oreilly.com</link>
	<description>Insight, analysis, and research about emerging technologies</description>
	<lastBuildDate>Fri, 17 May 2013 16:29:56 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>Open source won</title>
		<link>http://radar.oreilly.com/2012/07/open-source-won.html</link>
		<comments>http://radar.oreilly.com/2012/07/open-source-won.html#comments</comments>
		<pubDate>Mon, 30 Jul 2012 17:00:39 +0000</pubDate>
		<dc:creator>Roger Magoulas</dc:creator>
				<category><![CDATA[Programming]]></category>
		<category><![CDATA[@editpick]]></category>
		<category><![CDATA[@home]]></category>
		<category><![CDATA[developers]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[open source community]]></category>
		<category><![CDATA[open source model]]></category>
		<category><![CDATA[OSCON]]></category>
		<category><![CDATA[software]]></category>

		<guid isPermaLink="false">http://radar.oreilly.com/?p=49880</guid>
		<description><![CDATA[I heard the comments a few times at the 14th OSCON: The conference has lost its edge. The comments resonated with my own experience &#8212; a shift in demeanor, a more purposeful, optimistic attitude, less itching for a fight. Yes, &#8230; ]]></description>
				<content:encoded><![CDATA[<p>I heard the comments a few times at the <a href="http://www.oscon.com/oscon2012">14th OSCON</a>: The conference has lost its edge. The comments resonated with my own experience &mdash; a shift in demeanor, a more purposeful, optimistic attitude, less itching for a fight. Yes, the conference has lost its edge, it doesn&#8217;t need one anymore.</p>
<p>Open source won. It&#8217;s not that an enemy has been vanquished or that proprietary software is dead, there&#8217;s not much regarding adopting open source to argue about anymore. After more than a decade of the low-cost, lean startup culture successfully developing on open source tools, it&#8217;s clearly a legitimate, mainstream option for technology tools and innovation.</p>
<p>And open source is not just for hackers and startups. A new class of innovative, widely adopted technologies has emerged from the open source culture of collaboration and sharing &mdash; turning the old model of replicating proprietary software as open source projects on its head. Think <a href="http://en.wikipedia.org/wiki/Git_(software)">Git</a>, <a href="http://d3js.org">D3</a>, <a href="https://github.com/nathanmarz/storm/">Storm</a>, <a href="http://nodejs.org">Node.js</a>, <a href="http://rubyonrails.org">Rails</a>, <a href="http://www.mongodb.org">Mongo</a>, <a href="http://incubator.apache.org/mesos/">Mesos</a> or <a href="http://www.spark-project.org">Spark</a>.</p>
<p>We see more enterprise and government folks intermingling with the stalwart open source crowd  who have been attending OSCON for years. And, these large organizations are actively adopting many of the open source technologies we track, e.g., web development frameworks, programming languages, content management, data management and analysis tools.</p>
<p>We hear fewer concerns about support or needing geek-level technical competency to get started with open source. In the Small and Medium Business (SMB) market we see <a href="http://radar.oreilly.com/2012/07/open-source-small-business-report.html">mass adoption of open source for content management and ecommerce applications</a> &mdash; even for self-identified technology newbies.</p>
<p><span id="more-49880"></span>
<p><a href="http://www.mysql.com">MySQL</a> appears as popular as ever and remains open source after three years of Oracle control and <a href="http://radar.oreilly.com/2012/04/microsoft-open-source.html">Microsoft is pushing open source</a> JavaScript as a key part of its web development environment and more explicit support for other open source languages. Oracle and Microsoft are not likely to radically change their business models, but their recent efforts show that open source can work in many business contexts.</p>
<p>Even more telling:</p>
<ul>
<li> With so much of the consumer web undergirded with open source infrastructure, open source permeates most interactions on the web.</li>
<li>The massive, $100 million, <a href="http://www.theregister.co.uk/2012/07/10/github_100m_investment/">GitHub investment</a> validates the open collaboration model and culture &mdash; forking becomes normal.</li>
</ul>
<p>What does winning look like? Open source is mainstream and a new norm &mdash; for startups, small business, the enterprise and government. Innovative open source technologies creating new business sectors and ecosystems (e.g., the distribution options, tools and services companies building around Hadoop). And what&#8217;s most exciting is the notion that the collaborative, sharing culture that permeates the open source community spreads to the enterprise and government with the same impact on innovation and productivity.</p>
<p>So, thanks to all of you who made the open source community a sustainable movement, the ones who were there when &#8230; and all the new folks embracing the culture. I can&#8217;t wait to see the new technologies, business sectors and opportunities you create.</p>
<p><strong>Related:</strong></p>
<ul>
<li> <a href="http://radar.oreilly.com/2012/07/open-source-small-business-report.html">Economic impact of open source on small business</a></li>
<li> <a href="http://radar.oreilly.com/2012/07/open-source-enterprise-strategies.html">Open source community collaboration strategies for the enterprise</a></li>
<li> <a href="http://radar.oreilly.com/2012/04/microsoft-open-source.html">Microsoft opens up</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2012/07/open-source-won.html/feed</wfw:commentRss>
		<slash:comments>30</slash:comments>
		</item>
		<item>
		<title>7 emergent themes from Webstock reveal a framework</title>
		<link>http://radar.oreilly.com/2011/03/7-webstock-themes.html</link>
		<comments>http://radar.oreilly.com/2011/03/7-webstock-themes.html#comments</comments>
		<pubDate>Thu, 17 Mar 2011 13:00:00 +0000</pubDate>
		<dc:creator>Roger Magoulas</dc:creator>
				<category><![CDATA[Web 2.0]]></category>
		<category><![CDATA[@home]]></category>
		<category><![CDATA[agility]]></category>
		<category><![CDATA[business]]></category>
		<category><![CDATA[development]]></category>
		<category><![CDATA[interface]]></category>
		<category><![CDATA[users]]></category>

		<guid isPermaLink="false">http://blogs.oreilly.com/radar/2011/03/7-webstock-themes.html</guid>
		<description><![CDATA[The  speakers at the recent Webstock conference in New Zealand gravitated toward many of the same themes. Taken together, these themes create a framework for building the next generation of services, applications and devices. ]]></description>
				<content:encoded><![CDATA[<p><img src="http://s.radar.oreilly.com/2011/03/15/0311-webstock.png" border="0" alt="Webstock" style="float: right;margin: 3px 0 10px 10px" />At this year&#8217;s <a href="http://www.webstock.org.nz/11/programme/">Webstock</a> in Wellington, New Zealand, there were no reports of bad acid or rainstorms to muck things up, just a few sore tummies from newcomers (like me) eating too many Pineapple Lumps. </p>
<p>Webstock wasn&#8217;t a rock concert, but a gathering of the geek tribes that lived up to its reputation with accomplished speakers, an interesting mix of topics, and a scenic venue on the harbor.</p>
<p>The conference organizers sat the speakers together and they developed a nice camaraderie &mdash; unusual for a tech conference &mdash; that had them referencing and building on each others&#8217; presentations. While the presenters came from a variety of backgrounds &mdash; engineering, design and the arts &mdash; the talks were anchored in how how their topics related to web and mobile business processes.</p>
<p>A few useful and instructive themes emerged over the course of the conference (besides presenters using metaphors from Pixar&#8217;s &#8220;Wall-E&#8221;). Sessions by <a href="http://stopdesign.com">Doug Bowman</a>, <a href="http://www.marco.org/">Marco Arment</a>, <a href="http://www.braintraffic.com/company/">Karen Halvorson</a>, <a href="http://stevesouders.com/">Steve Souders</a>, <a href="http://www.frankchimero.com/">Frank Chimero</a>, <a href="http://www.perfettimedia.com/biography/">Christine Perfetti</a>, <a href="http://globalmoxie.com/index.shtml">Josh Clark</a> and <a href="http://www.randsinrepose.com/">Michael Lopp</a> most informed the following.</p>
</p>
<h2>Deliver value early and build fast with a purpose</h2>
</p>
<p>Best results come from agile-like processes and cultures that start small and iterate fast. Not surprisingly, this common theme was echoed most often by those with startup backgrounds. By limiting features, releasing fast, and learning, products and services can quickly become aligned to user needs. Resisting feature bloat is important to staying fast, especially after initial success and getting exposed to what may be a large number of users.</p>
<p>The agile approach also lets organization stay tuned into serendipity &mdash; uses and features that users exploit that were not part of planned use scenarios (Twitter becoming a communications tool for activism is one example). Corollaries to building fast include failing fast, i.e., quickly acknowledging what doesn&#8217;t work, and using the scientific method to test and learn.</p>
</p>
<h2>Scientific method</h2>
</p>
<p>Elements of the scientific method &mdash; hypothesis setting, testing, learning and adjusting &mdash; are just as valid for web performance tuning as for user interfaces and understanding comprehension. Embracing the scientific method can help an organization establish a learning culture. It&#8217;s not trivial. To gain the most from testing you need familiarity with using data and quantitative analysis, including visualization (charts), statistics, and machine learning.</p>
<div style="border-top: thin gray solid;border-bottom: thin gray solid;padding: 20px;margin: 20px 2px"><a href="http://en.oreilly.com/webexsf2011/public/regwith/websf11rad?cmp=il-radar-wx11-webstock-themes"><img style="float: left;border: none;padding-right: 10px" src="http://s.radar.oreilly.com/w2e11sf.png" /></a><a href="http://en.oreilly.com/webexsf2011/public/regwith/websf11rad?cmp=il-radar-wx11-webstock-themes"><strong>Web 2.0 Expo San Francisco 2011</strong></a>, being held March 28-31, will examine key pieces of the digital economy and the ways you can use important ideas for your own success. </p>
<p><a href="http://en.oreilly.com/webexsf2011/public/regwith/websf11rad?cmp=il-radar-wx11-webstock-themes"><strong>Save 20% on registration with the code WEBSF11RAD</strong></a></div>
</p>
<h2>Keep everything human-scaled</h2>
</p>
<p>Acknowledge that the builders and users of products and services are people whose needs should be understood and accommodated. For user interfaces, human scale means keeping operations and processes simple, functionally consistent, cognitively easy, and intuitive. For engineers and managers that means developing strategies that help align efforts, listening to users and making fast build cycles possible. Communicating on a human scale is best done through narrative processes and storytelling, and establishing a communications strategy. Apple products were often offered as examples of simple-to-use, human-scaled designs that delight users.</p>
</p>
<h2>Communicating with stories</h2>
</p>
<p>To effectively communicate with users &mdash; to share, to guide and convince, to engage &mdash; use storytelling processes like narration and cause-and-effect processes. Stories are a way to cut through and make sense of the increasing volumes and ubiquity of data and noise we experience via the web and mobile devices. Humans are wired to understand and remember stories &mdash; they make content and interfaces visceral, memorable and viral (i.e., worth repeating). Different speakers evoked storytelling as a way to develop content strategies, to stay connected with others, as an interface metaphor, e.g., using scrolling to represent the passage of time, and, as a way to put data to use.</p>
<p>My background in data and analysis kept me keenly interested in <a href="http://www.davidmccandless.com/">David McCandless&#8217;</a> &#8220;Information is Beautiful&#8221; session, with its mix of analytics, visually rendered data and storytelling. David does beautiful, playful and insightful infographics that focus on expressing the relative magnitude/scale of large numbers &mdash; especially numbers so large they are abstractions to most people &mdash; and on telling the stories in the data. His <a href="http://www.informationisbeautiful.net/2010/debtris/">Debtris</a> visualizations show the size of the US and UK deficits using a visual metaphor from the game Tetris to explains the size of the US and UK debt. David&#8217;s work is always designed to tell a story, keeping users compelled to drill further and stay engaged. It&#8217;s worth noting that David shared some failures and what he learned from them &mdash; the scientific method in action. David, who has a journalism background, calls his mix of charts and narration a &#8220;charticle&#8221; &mdash; a concept worth remembering when teasing a story out of data.</p>
</p>
<h2>Users expect multi-touch and gestural interfaces</h2>
</p>
<p>Multi-touch and gestural functions should be treated as the primary user interface, not functionality tacked onto conventional interfaces. If touch and gestures can do the job, there&#8217;s no reason to keep a keyboard or mouse. Users see multi-touch and gestures as intuitive and the &#8220;right&#8221; interface for many tasks, expecting them on many devices, not just mobile phones and tablets. Vendors who fail to fully embrace multi-touch and gestures as the primary interface to devices and services do so at their own peril.</p>
</p>
<h2>Simple is hard</h2>
</p>
<p>It&#8217;s hard work building the right amount of functionality, providing the optimal amount and frequency of content, creating devices that feel right, and designing user interfaces that are natural and &#8220;disappear.&#8221; Making a simple experience for users, one that takes the cognitive load of how to use a device so they can focus on why they are using the device, takes effort, sweating the details, thinking, testing, and creativity. To succeed at simple, build fast and use scientific learning processes to refine and improve interfaces. Spend as much time taking out what does not work as building on what does.</p>
</p>
<h2>Find inspiration everywhere</h2>
</p>
<p>The presenters commonly described creative flashes inspired by their own needs and from their curiosity about topics outside their primary expertise. Creativity often gets sparked by making non-obvious connections.</p>
<hr />
<p>Build fast, learn, simplify, tell stories, stay curious &mdash; taken together, these themes provide a useful framework for quickly building the next generation of services, applications and devices that become the warp and woof of our increasingly digital and mobile lives.</p>
<p></p>
<p><strong>Related:</strong></p>
<ul>
<li> <a href="http://answers.oreilly.com/topic/1413-designing-for-the-ipad-how-its-different-from-the-iphone-and-what-you-need-to-consider/">Josh Clark on designing for the iPad</a></li>
<li> <a href="http://radar.oreilly.com/2010/08/geeks-at-work.html">Geeks at work: Interview with Michael Lopp</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2011/03/7-webstock-themes.html/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Need faster machine learning? Take a set-oriented approach</title>
		<link>http://radar.oreilly.com/2011/01/faster-machine-learning.html</link>
		<comments>http://radar.oreilly.com/2011/01/faster-machine-learning.html#comments</comments>
		<pubDate>Fri, 28 Jan 2011 21:00:00 +0000</pubDate>
		<dc:creator>Roger Magoulas</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[@home]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[machine learning]]></category>

		<guid isPermaLink="false">http://blogs.oreilly.com/radar/2011/01/faster-machine-learning.html</guid>
		<description><![CDATA[We recently faced the type of big data challenge we expect to become increasingly common: scaling up the performance of a machine learning classifier for a large set of unstructured data. In this post, we explain how a set-oriented approach led to huge performance gains. ]]></description>
				<content:encoded><![CDATA[<p>We recently faced the type of big data challenge we expect to become increasingly common: scaling up the performance of a machine learning classifier for a large set of unstructured data.</p>
<p>Machine learning algorithms can help make sense of data by classifying, clustering and summarizing items in a data set. In general, performance has limited the opportunities to apply machine learning to understanding big or messy data sets. Analysts need to factor in time for speeding up off-the-shelf algorithms or even whether a machine learning pass would complete in a timely manner. While using smaller random samples can help mitigate performance issues, some data sets lend themselves to improved results when applied to more data.</p>
<p>Here we share our experience implementing a set-oriented approach to machine learning that led to huge performance increases (more detail is available in a <a href="http://answers.oreilly.com/topic/2427-how-to-speed-up-machine-learning-using-a-set-oriented-approach/">related post at O&#8217;Reilly Answers</a>). Applying a set-oriented approach can help you expand the opportunities to gain the benefits of machine learning on larger, unstructured and complex data sets.</p>
<p>We are working with the US Department of Health and Human Services (HHS) on a project to look for trends in demand for jobs related to Electronic Medical Records (EMR) and Health Information Technology (HIT). The twist, and the reason we decided to build a classifier, is that we wanted to separate jobs for those using EMR systems from those building, implementing, running and selling EMR systems. While many jobs easily fit in one of the two buckets, plenty of job descriptions had duties and company descriptions that made classifying the jobs difficult even for humans with domain expertise.</p>
<p>Identifying the approximately 400,000 jobs with EMR and related references was achieved with high accuracy using a regular expression rule-base. All the job description data is stored on a multi-node Greenplum Massively Parallel Processing (MPP) database cluster, running a Postgres engine. Having an MPP database has been critical for analyzing the large, 1.4-billion-record data set we work with &mdash; we can generally run investigative queries against the full data set in minutes.</p>
<p>After some discussion, we decided a <a href="http://en.wikipedia.org/wiki/Naive_Bayes_classifier">Naive Bayes classifier</a> seemed appropriate for the task. While there are some Python open source naive bayes classifiers available, such as <a href="http://www.nltk.org/)">NLTK</a> and <a href="http://www.ailab.si/orange/doc/ofb/c_nb.htm">Orange</a>, I decided to use the algorithm in Toby Segaran&#8217;s &#8220;<a href="http://oreilly.com/catalog/9780596529321">Programming Collective Intelligence</a>&#8221; so I could tweak the code and play with different feature arrangements. Toby does a great job of tying the code to the principles behind the naive bayes algorithm, and I thought that would help with modding and tuning the classifier for our purposes.</p>
<p>We had a tricky data set with categories that could be only subtly different. We wanted the classifier to be fast enough to iterate through the data many times so we could spend enough time training and tuning the algorithm to optimize classifier accuracy. Starting with a training set of 1,800 categorized jobs (phew&#8230;) and a random sample of 1,850 jobs, we set to work trying and reviewing different sets of feature combinations.</p>
<p>
<div style="float:right;margin: 3px 0 10px 10px;width: 250px;padding: 10px 15px 10px 15px;border: 1px solid #999;background: #ddd">We ran into a Python related problem early on that I think worth sharing. Due to the large numbers of words in a job description, the probabilities used by the Naive Bayes algorithm get exceedingly small, so small that Python turned them into zero, making for suspiciously strange results. Luckily, I complained about this problem to a friend with a doctorate in math who suggested taking the log of the probabilities, since the logs of very small numbers are not so small. Worked like a charm.</div>
<p>That&#8217;s when it hit me, job descriptions have lots of words that are often not carefully entered. That creates a large set of words and probabilities to work with, slowing down the algorithm. And, the algorithm was written to explain how Naive Bayes works, not for maximum efficiency. With training and classifying the sample data running for more than six hours, we needed to do something to speed up the process to handle all 400,000 records we wanted to classify.</p>
<p>I contacted <a href="http://www.cs.berkeley.edu/~daisyw">Daisy Zhe Wang</a>, an EECS doctoral student at UC Berkeley and a consultant at Bayes Informatics, because of her focus on scaling in-database natural language and machine learning algorithms.</p>
<p>
Daisy, together with <a href="http://www.bayesinformatics.com">Bayes Informatics</a> founder Milenko Petrovic, developed a set-oriented approach to implementing the Naive Bayes algorithm that treats the data derived from the training set (features (words) and counts) as a single entity, and converting the Naive Bayes algorithm to <a href="http://python.computersci.org/Fundamentals/DefiningFunctions">Python User Defined Functions</a> (UDFs) that, since Greenplum is a distributed database platform, let us parallelize the classifier process.</p>
<p>The result: The training set was processed and the sample data set classified in six seconds. We were able to classify the entire 400,000-record data set in under six minutes &mdash; more than a four-orders-of-magnitude records processed per minute (26,000-fold) improvement. A process that would have run for days, in its initial implementation, now ran in minutes! The performance boost let us try out different feature options and thresholds to optimize the classifier. On the latest run, a random sample showed the classifier working with 92% accuracy.</p>
<p>My simple understanding of their algorithm is that training set results are treated like a model and stored as a single row/column in the database. They&#8217;re parsed into a permanent Python data structure once, while each job description is parsed into another temporary data structure. The Python UDFs compare the words in the temporary data structure to the words in the model. The result is one database read for each job description and a single write once the probabilities are compared and the classification assignment made. That&#8217;s quite a contrast from reading and writing each word in the training set and the unassigned job.</p>
<p>Why does the set-oriented approach to machine learning matter? Performance and scale issues have long been a problem when trying to fully apply machine learning to large or unruly unstructured data sets. Set-oriented machine learning provides a straightforward way to bypass performance roadblocks, making machine learning a viable option for categorizing, clustering or summarizing large data sets or data sets with big chunks of data (e.g., descriptions or items with large numbers of features).</p>
<p>With any data set, speeding up machine learning processes allows quicker iterations through the data. That creates room to run experiments that improve accuracy and more time to focus on and interpret results to gain insights about the data. Quicker processing reduces the risk of applying machine learning to new topics by reducing the time investment to determine if results are worthwhile.</p>
<p>In summary, the performance boost provided by set-oriented machine learning makes for:</p>
<ul>
<li> Handling larger and more diverse data sets</li>
<li> Applying machine learning to a larger set of problems</li>
<li> Faster turnarounds</li>
<li> Less risk</li>
<li> Better focus on a problem</li>
<li> Improved accuracy, greater understanding and more usable results</li>
</ul>
<div style="border-top: thin gray solid;padding: 20px;margin: 20px 2px"><a href="https://en.oreilly.com/strata2011/public/register?cmp=il-radar-st11-data-warehouse"><img style="float: left;border: none;padding-right: 10px" src="http://s.radar.oreilly.com/strata11-promo-radar.png" /></a><a href="http://strataconf.com/?cmp=il-radar-st11-data-warehouse"><strong>Strata: Making Data Work</strong></a>, being held Feb. 1-3, 2011 in Santa Clara, Calif.,  will cover many similar topics related to big data, machine learning, analytics and visualization.</p>
<p><!-- a href="https://en.oreilly.com/strata2011/public/register?cmp=il-radar-st11-data-warehouse"&gt;<strong>Save 30% off registration with the code STR111RAD</strong>&lt;/a &#8211;></div>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2011/01/faster-machine-learning.html/feed</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Ebooks and the threat from &quot;internal constituencies&quot;</title>
		<link>http://radar.oreilly.com/2010/11/ebooks-and-the-threat-from-int.html</link>
		<comments>http://radar.oreilly.com/2010/11/ebooks-and-the-threat-from-int.html#comments</comments>
		<pubDate>Tue, 02 Nov 2010 13:00:00 +0000</pubDate>
		<dc:creator>Roger Magoulas</dc:creator>
				<category><![CDATA[Publishing]]></category>
		<category><![CDATA[@home]]></category>
		<category><![CDATA[business models]]></category>
		<category><![CDATA[ebooks]]></category>
		<category><![CDATA[media]]></category>
		<category><![CDATA[publishers]]></category>
		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://blogs.oreilly.com/radar/2010/11/ebooks-and-the-threat-from-int.html</guid>
		<description><![CDATA[Will internal constituencies bias how publishers value print book and ebook business models? Roger Magoulas examines that question and looks at the complementary relationship between print and electronic forms. ]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.newyorker.com/talk/financial/2010/10/18/101018ta_talk_surowiecki">A recent New Yorker article by James Surowiecki</a> on the problem of &#8220;internal constituencies&#8221; and how organizations respond to technology and market changes seems relevant to the ebook conundrum publishers are facing.</p>
<p>Surowiecki highlights how Blockbuster was unable to correctly value the assets that created the company&#8217;s initial success, especially when faced with insurgents like Netflix and Redbox. He ends with a warning for Netflix, and a look at the uncertain world of digital distribution to come.</p>
<p>The summary of how Blockbuster overvalued their &#8220;clicks and mortar&#8221; strategy may provide a catalyst for publishers to look at the opportunities and threats from ebooks. Will internal constituencies bias how publishers value and contrast print book and ebook assets and business models?</p>
</p>
<h2>Subjective rambling on print and ebooks</h2>
</p>
<p>Lessons from music and movies need to be tempered by the nature of the media. Vinyl, VCR tapes, CDs, and DVDs do nothing but carry information and require a player. While some may wax nostalgic for the media of their youth (yes, the pun is intentional; note the resurgence in vinyl records sales), the media of yore are awkward and limiting, particular<br />
when compared to playing devices with enough built-in storage and connectivity to access everything users may ever want to hear or watch.</p>
<p>Print books and magazines &#8211;black print on white paper &#8212; create a uniquely effective reading platform that integrates both storage and a player in a convenient package. Reading material is consumed differently than music and movies. Books generally take longer to read and aren&#8217;t continually or frequently re-read, making the low storage density of books less of an issue than with music and movies. Ebooks and ebook readers are still maturing and may not be &#8220;good enough&#8221; yet to effectively replace the print book experience.</p>
<p>In books we may see a complementary relationship between print and electronic forms based on context, content, distribution and consumer usage. For example, students needing portable access to multiple textbooks may find the storage density of print a significant issue that pushes the adoption of ebooks (there&#8217;s a <a href="http://archives.newyorker.com/?i=2010-09-13">funny New Yorker cover</a> showing a young girl with a backpack leading a mule, laden with books and school supplies).</p>
<p>Likewise, technical folks who want random access to a broad range of reference material may find the storage density, search capability and instant distribution of ebooks an unassailable benefit. For many others, the comfort and familiarity with print, the ereader experience, slow consumption rates, etc., may all continue to create demand for print books.</p>
<p>The audio book market may provide guidance as an alternate media channel that complementarily coexists with print books. Audio books have the same distribution characteristics as ebooks, as they are almost entirely distributed online these days. There&#8217;s also a market maker in <a href="http://www.audible.com">Audible</a>, where prices likely provide reliable information about demand and price elasticity (Audible has alternate flat subscription pricing). Audible charges more for new books, best sellers and evergreen sellers. Audio books are generally priced higher than Amazon&#8217;s print and ebook prices, anecdotally 10-40 percent higher for bestsellers. (A digression: researching Amazon pricing for best sellers and evergreen books shows about a 10 percent (+/- 7 percent) discount for ebooks compared to print books on Amazon.) </p>
<p>Smart publishers can work to keep the price differential between print books and ebooks close, and learn how to segregate price insensitive consumers via temporal distribution strategies (e.g., early access), distribution channels (e.g., print on demand), content (e.g., colors and diagrams that don&#8217;t render as clearly electronically) and form<br />
factors that can provide extra margins from the commodity, low-margin mass market.</p>
<p>Or, maybe I&#8217;m just showing my age and internal constituencies.</p>
<p></p>
<p><strong>Related:</strong></p>
<ul>
<li> <a href="http://radar.oreilly.com/2010/09/beyond-ebooks-publisher-as-api.html">The line between book and Internet will disappear</a></li>
<li> <a href="http://radar.oreilly.com/2010/08/i-am-a-veteran-of.html">View From the Trenches: Surviving Change</a></li>
<li> <a href="http://radar.oreilly.com/2010/10/the-ubiquitous-platform-play.html">Pandora&#8217;s  ubiquitous platform play</a></li>
<li> <a href="http://radar.oreilly.com/2010/08/what-publishers-can-and-should.html">What publishers can and should learn from &#8220;The Elements&#8221;</a></li>
</ul>
<p></p>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2010/11/ebooks-and-the-threat-from-int.html/feed</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>