<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>O&#039;Reilly Radar &#187; Liza Daly</title>
	<atom:link href="http://radar.oreilly.com/lizad/feed" rel="self" type="application/rss+xml" />
	<link>http://radar.oreilly.com</link>
	<description>Insight, analysis, and research about emerging technologies</description>
	<lastBuildDate>Fri, 24 May 2013 12:45:24 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>iPad and ebooks: Lots of unanswered questions</title>
		<link>http://radar.oreilly.com/2010/04/ipad-and-ebooks-lots-of-unansw.html</link>
		<comments>http://radar.oreilly.com/2010/04/ipad-and-ebooks-lots-of-unansw.html#comments</comments>
		<pubDate>Fri, 02 Apr 2010 13:00:00 +0000</pubDate>
		<dc:creator>Liza Daly</dc:creator>
				<category><![CDATA[Mobile]]></category>
		<category><![CDATA[Publishing]]></category>
		<category><![CDATA[ebooks]]></category>
		<category><![CDATA[iPad]]></category>
		<category><![CDATA[publishing]]></category>
		<category><![CDATA[web apps]]></category>

		<guid isPermaLink="false">http://blogs.oreilly.com/radar/2010/04/ipad-and-ebooks-lots-of-unansw.html</guid>
		<description><![CDATA[Liza Daly says a host of unanswered questions about the iPad&apos;s ebook functionality coupled with the disconnect between simulators and hardware, will delay publishing innovation. But one upside: the iPad&apos;s hardware will ultimately benefit both native apps and web-based apps. ]]></description>
				<content:encoded><![CDATA[<p>Right now people are conflating the whiz-bang demos from <a href="http://radar.oreilly.com/2010/03/trapping-content-on-the-ipad-w.html">Wired</a>, <a href="http://paidcontent.co.uk/article/419-first-look-how-penguin-will-reinvent-books-with-ipad/">Penguin</a> and the <a href="http://mashable.com/2010/03/17/awesome-ipad-digital-magazine-demo-video/">magazine world</a> with the iBooks offering. In fact, no one can say anything right now other than that iBooks uses the <a href="http://en.wikipedia.org/wiki/EPUB">EPUB format</a>.  Will books from iBooks be able to include custom fonts? Video? Audio? All unknowns.</p>
<p><a href="http://oreilly.com/ipad"><img alt="iPad Coverage" src="http://s.radar.oreilly.com/ipad-promo-250.jpg" border="0" width="250" height="190" style="float: right;margin: 0 0 12px 12px" /></a>If it&#8217;s true that books in the <a href="http://en.wikipedia.org/wiki/IBookstore">iBookstore</a> can include video, for example, what kind of video? Adobe promotes Flash video for use in EPUB, but of course there&#8217;s no Flash in the whole iPad/iPhone ecosystem.  So yes, EPUB is a standard, but the kind of content that would really showcase a tablet versus an E-Ink reader &#8212; like video &#8212; seems to have already fragmented before the device even hits the marketplace.  That kind of fragmentation is going to deter publishers from doing much more than what they&#8217;re already doing: making text-only EPUB for wide distribution, and doing the occasional book-as-app experiment (at a premium price point).</p>
<p>We&#8217;re already seeing the tension between <a href="http://www.businessinsider.com/non-us-developers-angry-about-ipad-delays-aapl-2010-3">Apple&#8217;s own interests and their control of the App Store</a> with the announcement that many apps (notably the Kindle) <a href="http://techcrunch.com/2010/03/22/app-store-ipad/">won&#8217;t be available in the App Store on release</a> of the iPad because they haven&#8217;t been able to test on a real device.  And then &#8230; how long until they release? That&#8217;s totally up to the Apple approval process.</p>
<p>In our development of an HTML5-based web app, we&#8217;ve found that hardware profiles are key to making web apps really compete. <a href="http://ibisreader.com/">Ibis Reader</a> isn&#8217;t nearly as compelling on an older iPod Touch as it is on an iPhone 3GS. By providing what is apparently a really sexy tablet that&#8217;s capable of running video and games fluidly, Apple also widens the playing field to let web apps shine too.  As the hardware increases in capacity, web apps become increasingly attractive as an end product, not just as a way for web developers to play in the app space.</p>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2010/04/ipad-and-ebooks-lots-of-unansw.html/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Slides from &quot;What Publishers Need to Know about Digitization&quot; Webcast</title>
		<link>http://radar.oreilly.com/2008/11/slides-from-the-webcast-what-p.html</link>
		<comments>http://radar.oreilly.com/2008/11/slides-from-the-webcast-what-p.html#comments</comments>
		<pubDate>Thu, 13 Nov 2008 22:18:10 +0000</pubDate>
		<dc:creator>Liza Daly</dc:creator>
				<category><![CDATA[Publishing]]></category>
		<category><![CDATA[content]]></category>
		<category><![CDATA[copyright]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[ebook]]></category>
		<category><![CDATA[EPUB]]></category>
		<category><![CDATA[Google Book Search]]></category>
		<category><![CDATA[ocr]]></category>
		<category><![CDATA[scanning]]></category>
		<category><![CDATA[schema]]></category>
		<category><![CDATA[xml]]></category>

		<guid isPermaLink="false">http://blogs.oreilly.com/radar/2008/11/slides-from-the-webcast-what-p.html</guid>
		<description><![CDATA[Slides from the &#34;What Publishers Need to Know about Digitization&#34; webcast. ]]></description>
				<content:encoded><![CDATA[<p>TOC will be posting a complete recording of the presentation, but in the meantime I&#8217;ve posted the slides from yesterday&#8217;s webcast, &#8220;<a href="http://toc.oreilly.com/2008/11/toc-webcast-tomorrow-what-publ.html">What publishers need to know about digitization</a>&#8221; on Slideshare.</p>
<p>Thanks to everyone who attended and especially to those who asked so many excellent questions.</p>
<p><div align="center">
<div style="width:425px;text-align:left">
<div style="font-size:11px;font-family:tahoma,arial;height:26px;padding-top:2px">View SlideShare <a href="http://www.slideshare.net/lizadaly/what-publishers-need-to-know-about-digitization-presentation?type=powerpoint" title="View What publishers need to know about digitization on SlideShare">presentation</a> or <a href="http://www.slideshare.net/upload?type=powerpoint">Upload</a> your own. (tags: <a href="http://slideshare.net/tag/schema">schema</a> <a href="http://slideshare.net/tag/epub">epub</a>)</div>
</div>
<p><img style="width:0px;height:0px" border="0" width="0" height="0" src="http://counters.gigya.com/wildfire/IMP/CXNID=2000002.0NXC/bT*xJmx*PTEyMjY1OTYzNTY1NjAmcHQ9MTIyNjU5NjM3MzE2NSZwPTEwMTkxJmQ9Jmc9MiZ*PSZvPWRmOGM3MDgzOWYyYjQzOTliMmZlYWZkZDc1YWFkZDk3.gif" /></div>
</p>
<p><span id="more-42568"></span></p>
<h2>Upcoming TOC Webcasts</h2>
<p></p>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2008/11/slides-from-the-webcast-what-p.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Analog Hole: Another Argument Against DRM</title>
		<link>http://radar.oreilly.com/2008/10/the-analog-hole-in-digital-boo.html</link>
		<comments>http://radar.oreilly.com/2008/10/the-analog-hole-in-digital-boo.html#comments</comments>
		<pubDate>Thu, 23 Oct 2008 15:00:00 +0000</pubDate>
		<dc:creator>Liza Daly</dc:creator>
				<category><![CDATA[Publishing]]></category>
		<category><![CDATA[analog hole]]></category>
		<category><![CDATA[digital books]]></category>
		<category><![CDATA[Digital Rights Management]]></category>
		<category><![CDATA[DRM]]></category>
		<category><![CDATA[ebooks]]></category>
		<category><![CDATA[ocr]]></category>
		<category><![CDATA[piracy]]></category>
		<category><![CDATA[scanning]]></category>

		<guid isPermaLink="false">http://blogs.oreilly.com/radar/2008/10/the-analog-hole-in-digital-boo.html</guid>
		<description><![CDATA[No matter how strong the encryption, digital rights management can&apos;t block &#34;analog hole&#34; piracy. ]]></description>
				<content:encoded><![CDATA[<p><a href="http://toc.oreilly.com/resources/drm.html">Digital rights management</a> (DRM) might be unpopular with the public and<br />
plagued with social and technical challenges, but at least it&#8217;s a guarantee that digital books can&#8217;t be pirated &mdash; right?
</p>
<p>
Not so fast. Experienced computer crackers will find weaknesses in any encryption scheme, but regular folks with basic computer<br />
skills can exploit the one weakness found in all DRM&#8217;ed media: the analog hole.
</p>
<p>
<b>What is the Analog Hole?</b></p>
<p>
The &#8220;analog hole&#8221; reflects a basic principle of physics: before humans<br />
can consume any digital media, the ones and zeroes that computers<br />
understand  must be converted into an analog format<br />
that our senses can perceive.  For music, it&#8217;s sound waves; for<br />
video and for digital books, it&#8217;s patterns of light.
</p>
<p>
If you&#8217;ve ever visited a major metropolitan city you&#8217;ve probably<br />
seen the analog hole in action: street vendors selling pirated<br />
copies of popular movies, often months before they&#8217;re officially<br />
released on DVD.  Most of these are &#8220;cam&#8221; films, shot in real<br />
movie theaters using camcorders.  Even without access to a<br />
physical copy of the film, pirates are able to capture its analog<br />
expression: the sound and pictures as perceived by a theater-goer.
</p>
<p>
In music, the analog hole is often used to get around software<br />
preventing digital copying.  A user simply plays the the desired song<br />
on their computer using the legal DRM-enabled software, and records<br />
the audio coming out of their computer.  Now they have a copy of<br />
the sound recording, which can be re-imported into the computer<br />
and digitally-encoded, with the original DRM stripped out.  (A<br />
similar principle is at work when DRM systems go defunct and<br />
users are told to <a href="http://arstechnica.com/news.ars/post/20080928-wal-mart-latest-to-shut-down-drm-key-servers.html">pirate<br />
their own music</a>, although the industry uses the euphemism &#8220;making a backup.&#8221;)
</p>
<p>
Film and music companies are painfully aware of the analog hole<br />
and have taken steps to close it, either by<br />
monitoring patron behavior (as in movie theaters) or by<br />
petitioning to <a href="http://www.eff.org/issues/analog-hole">legally limit the recording features of consumer electronics</a>.
</p>
<p>
Because reading is a visual experience,<br />
there is the possibility of an analog hole exploit.  Unlike<br />
with camcorder copies or re-burned MP3s, there is a potential for no<br />
loss in quality.  And with a little ingenuity, the process can<br />
be completely automatic.
</p>
<p>
<b>One example: Ebooks and Optical Character Recognition (OCR)</b></p>
<p>Here&#8217;s a sample digital book as displayed in Adobe Digital Editions.  (This<br />
book is public domain and isn&#8217;t technically covered by DRM, but the principle<br />
is exactly the same.)</p>
<p><span class="mt-enclosure mt-enclosure-image"><a href="http://toc.oreilly.com/assets_c/2008/10/pride-chapter-one.html"><img src="http://toc.oreilly.com/assets_c/2008/10/pride-chapter-one-thumb-300x262.png" width="300" height="262" alt="pride-chapter-one.png" class="mt-image-center" style="text-align: center;margin: 0 auto 20px" /></a></span></p>
<p>
I hid as much of the Digital Editions menus as I could and took a screenshot of this first page of <i>Pride and Prejudice</i>.
</p>
<p>
Next I downloaded some free optical character recognition (OCR) software.  OCR<br />
programs can &#8220;read&#8221; images and output the words in them as plain text.  It&#8217;s a normal<br />
part of digitization projects, in which archival printed material is first scanned<br />
and its text is automatically extracted.  At the consumer level, OCR software is often bundled with commercial scanners and fax machines.
</p>
<p>
I took my screenshot and fed it to the OCR software. Here&#8217;s what I got without any special fine-tuning or spell-checking. <em>Note<br />
that all typos are from the OCR software.</em>
</p>
<blockquote><p>
<b>Chapter 1</b><br />
<br />
It is a truth universally acknowledged, that a single man in<br />
possession ofa large fortune must be in want of a wife, However little<br />
known the feelings or views of such a man may be on his first entering<br />
a neighbourhood, this truth is so well fixed in the minds of the<br />
surrounding families, that he is considered the rightful property of<br />
someone or other of their daughters. <br />
&#8220;My dear Mr. Bennet,&#8221; said his lady to him one day, &#8220;have you heard<br />
that Netherfield Park is let at last?&#8221; <br />
Mr. Bennet replied that he had not.
</p></blockquote>
<p>&#8230;and on through the entire first page.  This output was in HTML, ready to be<br />
posted  to the Web for anyone to read.
</p>
<p>
The OCR isn&#8217;t 100 percent accurate, of course, but neither are the widely-available<br />
pirated ebooks created by laborious scanning of physical books, page<br />
after page. I was also using free software that requires careful fine-tuning to get working optimally; commercial OCR software is much better, especially when combined with<br />
spell-checking.
</p>
<p>It wouldn&#8217;t be difficult to automate the process of advancing one page in Digital Editions, taking a screenshot, and passing that on to my OCR software.  Once the workflow was in place, I could strip hundreds or thousands of books of their DRM in a matter of minutes.
</p>
<p>
<b>Another Possibility: Speech Recognition</b></p>
<p>My local library is kind enough to allow me to check out digital audiobooks.  Naturally<br />
they&#8217;re also secured with DRM (so much so that I can&#8217;t actually play them, as they require Windows Media Player and I have only Mac and Linux computers).  But assuming I could play them, I&#8217;d have available to me a nice stream of professionally-produced audio.
</p>
<p>
You&#8217;re using <a href="http://en.wikipedia.org/wiki/Speech_recognition">speech recognition</a> software every time you call a customer service line and an<br />
automated voice prompts you to speak your credit card number.  If that&#8217;s happened<br />
to you, you also know that speech recognition isn&#8217;t 100 percent accurate yet, but under<br />
certain conditions it can be quite good.  Automatic speech-to-text transcription<br />
isn&#8217;t nearly as far along as optical character recognition, but it&#8217;s another<br />
analog hole exploit that will eventually become trivial to perform.
</p>
<p>
<b>Does This Mean Publishers Shouldn&#8217;t Produce Ebooks or Audiobooks?</b></p>
<p>No!  What I hope to convey is that DRM is not a true safeguard against ebook piracy.<br />
(It is, however, a known deterrent to ebook <i>adoption</i>.)  I&#8217;ve heard a lot of passing the buck on DRM: publishers claim authors want it, booksellers claim publishers<br />
insist on it.  These days it&#8217;s hard to find someone to publicly state that they&#8217;re<br />
actually for it.
</p>
<p>I think of DRM like this:  years ago my apartment was broken into and I<br />
called a locksmith to replace the door.  My landlord had authorized me to<br />
get &#8220;the best lock possible,&#8221; and the locksmith obliged with a four-foot steel<br />
bolt. It was almost too heavy to turn but made a very satisfying noise when it snapped shut.
</p>
<p>I asked the locksmith, &#8220;Is this really unbreakable?&#8221;</p>
<p>&#8220;The lock is, sure.&#8221;  He slapped the door frame.  &#8220;But this is made out of<br />
wood.  If I really wanted to get in I&#8217;d just kick out the door.  That&#8217;s why I&#8217;m honest<br />
about what I sell.&#8221; When I looked<br />
puzzled he handed me his business card.<br />
It contained his name, phone number, and company slogan: &#8220;A feeling of security.&#8221;
</p>
<p>
Authors and publishers should be compensated for their talent and their hard work,<br />
and the desire for DRM is understandable. Book lovers, too, want their<br />
favorite authors to succeed. But digital books are a form of technology as<br />
much as they are literature, and technologies that are successful adapt to people&#8217;s<br />
needs.  Is just a &#8220;feeling&#8221; of security worth the ire of good customers who<br />
want to read their books wherever and however they like?
</p>
<p class="related">Related Stories:</p>
<ul class="related">
<li> <a href="http://tim.oreilly.com/pub/a/p2p/2002/12/11/piracy.html">Tim O&#8217;Reilly: &#8220;Piracy is Progressive Taxation &#8230;&#8221;</a></li>
<li> <a href="http://toc.oreilly.com/2008/09/spore-backlash-is-drm-official.html">&#8220;Spore&#8221; Backlash: Is DRM Officially Bad for Business?</a></li>
<li> <a href="http://toc.oreilly.com/2008/08/pirates-convince-game-develope.html">Pirates Convince Game Developer to Drop DRM</a></li>
<li> <a href="http://toc.oreilly.com/2008/04/charting-the-pitfalls-of-drm.html">Charting the Pitfalls of DRM</a></li>
<li> <a href="http://toc.oreilly.com/2008/06/putting-ebook-piracy-into-pers.html">Putting Ebook Piracy into Perspective</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2008/10/the-analog-hole-in-digital-boo.html/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Publishing Lessons from Web 2.0 Expo</title>
		<link>http://radar.oreilly.com/2008/09/web-20-expo-nyc-wrap-up-draft.html</link>
		<comments>http://radar.oreilly.com/2008/09/web-20-expo-nyc-wrap-up-draft.html#comments</comments>
		<pubDate>Fri, 26 Sep 2008 15:00:00 +0000</pubDate>
		<dc:creator>Liza Daly</dc:creator>
				<category><![CDATA[Publishing]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[content]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[social networking]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://blogs.oreilly.com/radar/2008/09/web-20-expo-nyc-wrap-up-draft.html</guid>
		<description><![CDATA[A collection of publishing-centric takeaways from the recent Web 2.0 conference in New York City. ]]></description>
				<content:encoded><![CDATA[<p>
Last week I was in New York for the city&#8217;s first <a href="http://en.oreilly.com/Webexny2008/public/content/home">Web 2.0 Expo</a>. I was a member of the program committee and one of our goals was to make it a uniquely New York event. This meant a real focus on measurable outcomes and integrating Web 2.0 principles into established business, in contrast with the more startup-friendly atmosphere of the San Francisco event. The fact that the conference ran during the week of the Wall Street meltdown only reinforced the need for pragmatism in tough economic times.
</p>
<p>
Naturally I was interested in applying what I learned to the publishing world.  If you couldn&#8217;t make it to the event, here were my big take-aways:
</p>
<p>
<strong>Web 2.0 is social software</strong><br />
Consultant <a href="http://Web2.socialcomputingmagazine.com/">Dion Hinchcliffe&#8217;s</a> tutorial on the Web 2.0 landscape summed it up best: <em>Web 2.0 means software that gets better the more people use it</em>.  This is radically different from traditional software development, which gets better only when programmers add new features.  (In the case of Microsoft Word, it generally gets worse.)
</p>
<p>
The best example in the publishing space is <a href="http://www.librarything.com/">LibraryThing</a>, which has a more accurate book catalog than Amazon.com, but also content found nowhere else.  My favorites are the <a href="http://www.librarything.com/groups/iseedeadpeoplesbooks">Legacy Libraries</a>, which collect works associated with famous dead people.  The Legacy Library project illustrates a related principle of Web 2.0: <em>encourage unintended uses</em>.  LibraryThing was designed for individuals to catalog and rate their own books, but this user-driven initiative has added tremendous unexpected value.
</p>
<p>
<strong>Thinking outside the box</strong><br />
That is, outside of a single computer (geeks like to call them &#8220;boxes&#8221;).  More Web applications are either being built on top of other services, or make use of so-called cloud computing.  Amazon, Google and other providers now offer a wealth of ready-made software and infinite computing power to allow companies to leapfrog over problems of cost and scaling.
</p>
<p>
Only a few years ago when I was approached by a publisher to start a project, we would begin at the beginning: purchasing a computer, selecting a service provider, writing some HTML, crunching some data.  With services like Amazon&#8217;s <a href="http://aws.amazon.com/ec2/">Elastic Compute Cloud</a>, there&#8217;s no longer any need to buy hardware: instantly an application can be deployed on one computer, or a thousand, at very low cost.  This makes experimentation much more feasible: if no users come to a new product, no expensive hardware investment has been wasted.  If it&#8217;s successful, a few keystrokes can add 10X the computing power.
</p>
<p>
Cloud computing has also created tremendous benefit for offline processing tasks, as shown by <a href="http://toc.oreilly.com/2008/08/processing-the-deep-backlist-a.html">The New York Times</a> when converting their digitized archive for use on the Web.
</p>
<p>
<strong>It&#8217;s not just about people, it&#8217;s about data</strong><br />
Finally, Toby Segaran&#8217;s talk on &#8220;<a href="http://Webexny2008.crowdvine.com/talks/show/1050">The Ecosystem of Corporate and Social Data</a>&#8221; reminded me how much value publishers have.  Toby explored clever ways of finding usually-expensive data for free (for example, rather than paying for Yellow Page listings of restaurants, he scraped the New York City health department Web site, which includes ratings of every food-service facility).
</p>
<p>
Diving deeper, he emphasized how much value can be added to digital services if they are already full of content.  Wikipedia came preloaded with a public domain encyclopedia, as it&#8217;s much easier to correct or update old content than to enter it wholesale.  The more of your content that users can find and interact with (for example, by providing an extensive full-content backlist), the more engaged they&#8217;ll be.
 </p>
<p>
Speaker presentations for the conference are available here: <a href="http://en.oreilly.com/Webexny2008/public/schedule/proceedings">Web 2.0 NYC presentations</a>.</p>
<p class="related">Related Stories:</p>
<ul class="related">
<li> <a href="http://www.oreillynet.com/pub/a/oreilly/tim/news/2005/09/30/what-is-web-20.html">Tim O&#8217;Reilly: &#8220;What Is Web 2.0?&#8221;</a></li>
<li> <a href="http://toc.oreilly.com/2008/07/cloud-computings-potential-imp.html">Cloud Computing&#8217;s Potential Impact on Publishing</a></li>
<li> <a href="http://toc.oreilly.com/2008/09/qa-with-hadrien-gardeur-co-fou.html">Q&amp;A with Hadrien Gardeur, Co-Founder of Feedbooks</a></li>
<li> <a href="http://toc.oreilly.com/2008/06/qa-with-susan-danziger-ceo-of.html">Q&amp;A with Susan Danziger, CEO of DailyLit</a></li>
<li> <a href="http://toc.oreilly.com/2008/04/qa-with-webook-president-sue-h.html">Q&amp;A with WEbook President Sue Heilbronner</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2008/09/web-20-expo-nyc-wrap-up-draft.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How to Read any Type of Document on the Kindle (Almost)</title>
		<link>http://radar.oreilly.com/2008/08/how-to-read-any-type-of-docume.html</link>
		<comments>http://radar.oreilly.com/2008/08/how-to-read-any-type-of-docume.html#comments</comments>
		<pubDate>Tue, 26 Aug 2008 15:00:00 +0000</pubDate>
		<dc:creator>Liza Daly</dc:creator>
				<category><![CDATA[Publishing]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[comics]]></category>
		<category><![CDATA[ebook readers]]></category>
		<category><![CDATA[ebooks]]></category>
		<category><![CDATA[Kindle]]></category>
		<category><![CDATA[manga]]></category>
		<category><![CDATA[Mobi]]></category>
		<category><![CDATA[PDF]]></category>

		<guid isPermaLink="false">http://blogs.oreilly.com/radar/2008/08/how-to-read-any-type-of-docume.html</guid>
		<description><![CDATA[With a little work and help from some undocumented features, it&apos;s possible to read PDFs with their native layout on the Kindle. ]]></description>
				<content:encoded><![CDATA[<p>
There are a few options for readers who want to convert PDFs or other non-supported files to the Kindle&#8217;s AZW format. Amazon&#8217;s <a href="http://www.amazon.com/gp/help/customer/display.html?nodeId=200140600&amp;#send">recommended method</a> is to email the file to your personal Kindle email address. It&#8217;s also possible for users to convert PDFs and other document types themselves using <a href="http://www.mobipocket.com/en/downloadSoft/ProductDetailsCreator.asp">Mobipocket Creator</a> or <a href="http://www.lexcycle.com/faq#4n46">Stanza</a>.
</p>
<p>All of the above methods have the same flaw: AZW does not support the kind of advanced layout available in formats like PDF, and non-Latin fonts aren&#8217;t easy to convert. What if you need to review a complex legal form, or read a graphic novel, or one in Chinese? A hidden feature can help.
</p>
<p>
<a href="http://toc.oreilly.com/2008/08/20/screen_shot-50087.html"><img src="http://toc.oreilly.com/2008/08/20/screen_shot-50087-thumb-200x266.gif" alt="screen_shot-50087.gif" style="margin: 0 0 10px 15px;float: right" width="200" height="266" /></a><br />
The Kindle has an undocumented picture-viewing mode that was first uncovered by <a href="http://igorsk.blogspot.com/2007/12/hacking-kindle-part-3-root-shell-and.html">Igor Skochinsky</a>. Although the black and white E Ink screen is not especially good at displaying actual photographs, it is quite good at rendering line art and text.</p>
<p>
Here&#8217;s how to do it, using PDF as an example. Note that unofficial features may be buggy and could damage your Kindle; proceed at your own risk.</p>
<ol>
<li> Convert the PDF to a series of images. Commercial versions of Acrobat should be able to do this in batch, but users of free readers may have to convert a page at a time. The Kindle can read JPEG, PNG and GIF; the latter two will work best. Because the picture-viewing application doesn&#8217;t support a table of contents, you&#8217;ll need to name the image files in ascending alphabetical or numeric order (e.g. &#8220;0001.jpg,&#8221; &#8220;0002.jpg,&#8221; etc.)  For best results, resize the image to 600 x 800, the resolution of the Kindle screen.</li>
<li> Connect the Kindle to your computer using the USB cable. Once connected, browse to the Kindle&#8217;s drive. If you have an SD card installed that will appear on your computer as well. The following procedure works on either the Kindle or the SD card. I prefer to do everything on the SD card &#8212; it feels safer.</li>
<li> Create a folder called &#8220;pictures,&#8221; and a folder inside of that with the name of your &#8220;document.&#8221; Put the images in the document folder. Disconnect the Kindle from the PC. When you go to the Kindle&#8217;s home screen, nothing will have changed. This is where the secret feature comes in:</li>
<li> Press Alt-Z from the home screen. Your book title should appear in the list.</li>
<li> Click on the book title.  It will open the first image.  Use the normal Kindle next/previous buttons to page through the &#8220;book.&#8221; The picture viewer has menu options of its own to control the size of the image and how it&#8217;s rendered.</li>
</ol>
<p>
<div style="float:right"><a href="http://toc.oreilly.com/2008/08/20/screen_shot-50096.html"><img src="http://toc.oreilly.com/2008/08/20/screen_shot-50096-thumb-200x266.gif" alt="screen_shot-50096.gif" style="margin: 0 0 10px 15px;float: right" /></a></p>
<p style="font-size:9px;text-align: right">Credit: <a href="http://www.octopuspie.com/">octopus pie</a></p>
</div>
<p>
Of course because the &#8220;PDF&#8221; is really an image it&#8217;s not possible to search the document or rescale the fonts. Text-heavy PDFs should be converted in one of the recommended ways.</p>
<p>This same technique can be used to  load image-based documents directly, such as comics. (Peeking inside the &#8220;pictures&#8221; folder after it&#8217;s been read by the Kindle reveals a file with the extension <a href="http://en.wikipedia.org/wiki/Manga">manga</a>, suggesting that the picture viewer was intended to be used for this purpose).</p>
<p>It&#8217;s also possible to convert documents in Russian, Chinese or other non-Latin scripts this way.  The Kindle does have support for embedded non-Latin fonts as part of its &#8220;Topaz&#8221; file format, but there are no tools for end-users that output Topaz.</p>
<p>(Screenshots courtesy the undocumented Alt-Shift-G feature, which saves to the root of the SD card.)</p>
<p class="related">Related Stories:</p>
<ul class="related">
<li> <a href="http://toc.oreilly.com/2008/08/optimizing-web-content-for-the.html">Optimizing Web Content for the Kindle Browser</a></li>
<li> <a href="http://toc.oreilly.com/2008/06/hacking-the-kindle.html">Hacking the Kindle</a></li>
<li> <a href="http://toc.oreilly.com/2008/06/exploring-diy-ereader-platform.html">Exploring DIY E-Reader Platforms</a></li>
<li> <a href="http://toc.oreilly.com/2008/05/what-would-your-ideal-ereader.html">What Would Your Ideal E-Reader Look Like?</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2008/08/how-to-read-any-type-of-docume.html/feed</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Optimizing Web Content for the Kindle Browser</title>
		<link>http://radar.oreilly.com/2008/08/optimizing-web-content-for-the.html</link>
		<comments>http://radar.oreilly.com/2008/08/optimizing-web-content-for-the.html#comments</comments>
		<pubDate>Wed, 13 Aug 2008 15:00:00 +0000</pubDate>
		<dc:creator>Liza Daly</dc:creator>
				<category><![CDATA[Publishing]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[ebook formats]]></category>
		<category><![CDATA[ebook readers]]></category>
		<category><![CDATA[ebooks]]></category>
		<category><![CDATA[EPUB]]></category>
		<category><![CDATA[Kindle]]></category>
		<category><![CDATA[kindle 2.0]]></category>

		<guid isPermaLink="false">http://blogs.oreilly.com/radar/2008/08/optimizing-web-content-for-the.html</guid>
		<description><![CDATA[Distribute ebooks on your Web site? Make sure Kindle users can get what they&apos;re looking for. ]]></description>
				<content:encoded><![CDATA[<p><a href="http://toc.oreilly.com/2008/08/08/kindle-1.html"><img src="http://toc.oreilly.com/2008/08/08/kindle-1-thumb-300x450.jpg" alt="Amazon Kindle" class="mt-image-none" style="margin: 0 0 10px 15px;float: right" width="200" /></a></p>
<p>Amazon&#8217;s Kindle store is convenient, easy-to-use and stocked with thousands of titles.<br />
But what about publishers and content distributors who want to reach the<br />
<a href="http://www.techcrunch.com/2008/08/01/we-know-how-many-kindles-amazon-has-sold-240000/">estimated 240,000</a> Kindle users without going through Amazon&#8217;s program? And what about content formats that the Kindle does not directly support?
</p>
<p>
One selling point of the device is its free, ubiquitous Internet service<br />
and Web browser.  Amazon has filed the browser under &#8220;Experimental&#8221; but it&#8217;s<br />
quite usable as-is.  With a few simple changes to a Web site&#8217;s HTML code, it&#8217;s<br />
even possible to specially cater to Kindle users.
</p>
<p>
The screenshots used in this article are from the mobile version of<br />
<a href="http://bookworm.threepress.org/">Bookworm</a>, my Web application<br />
for reading ebooks in the <a href="http://openebook.org/">EPUB</a> format.<br />
Although what&#8217;s being displayed is ebook content, it&#8217;s being delivered by the Kindle&#8217;s browser, not the Kindle ebook technology, which does not yet support EPUB.
</p>
<p>
Because the mobile Web version is<br />
already heavily optimized for small devices, the layout is simpler than<br />
a traditional Web site. What works for an iPhone or other wireless device<br />
will also be a good starting point for the Kindle, although we&#8217;ll see there<br />
are some special considerations that don&#8217;t apply to any other device.
</p>
<p><b>Default or Advanced Mode?</b></p>
<p>
When the Kindle ships, its Web browser is in &#8220;default mode.&#8221;  It will not load<br />
images or CSS styles, but it does render basic HTML tags like the italic tag &lt;i&gt;.<br />
Personally, I prefer &#8220;advanced mode,&#8221; which displays Web pages more like a<br />
traditional browser, but some sites can be unreadable in this mode.
</p>
<p>
When optimizing for the Kindle it&#8217;s best to consider that most users will not<br />
change from &#8220;default mode,&#8221; or even realize that the option exists.
</p>
<p>
How different are these modes?  Here is a comparison shot of the same screen<br />
from Bookworm in both modes:
</p>
<div align="center">
<table width="400" border="0" style="margin: 0 0 15px 0">
<tbody>
<tr>
<td style="padding: 0 10px 5px 0" valign="top" width="200"><a href="http://toc.oreilly.com/2008/08/08/kindle-4.html"><img src="http://toc.oreilly.com/2008/08/08/kindle-4-thumb-200x173.jpg" alt="kindle-4.jpg" class="mt-image-none" width="200" height="173" /></a></td>
<td style="padding: 0 0 5px 10px" valign="top" width="200"><a href="http://toc.oreilly.com/2008/08/08/kindle-6.html"><img src="http://toc.oreilly.com/2008/08/08/kindle-6-thumb-200x178.jpg" alt="kindle-6.jpg" class="mt-image-none" width="200" height="178" /></a></td>
</tr>
<tr>
<td style="padding: 0 10px 5px 0" valign="top"><i>My list of books in Advanced mode, showing tabular layout and more advanced font styles</i></td>
<td style="padding: 0 0 5px 10px" valign="top"><i>My list of books in Default mode</i></td>
</tr>
</tbody>
</table>
</div>
<p>
In Default mode, all the information about the books runs together.<br />
It would be better to present this as a simple<br />
vertical list, the way the Amazon Kindle store does, rather than as a table.
</p>
<p><b>Font Size Considerations</b></p>
<p>
You can choose from six font sizes in the Kindle browser. As a content creator, you can provide a wider range of font sizes in your Kindle-formatted Web page, but take care that they aren&#8217;t too small.  The  device doesn&#8217;t clearly display fonts that are smaller than its default six sizes.
</p>
<p>
In this screenshot, the table of contents for a Bookworm book is not readable,<br />
even though this page has already been tailored for the small display of mobile phones:
</p>
<p>
<div align="center"><a href="http://toc.oreilly.com/2008/08/08/kindle-3.html"><img src="http://toc.oreilly.com/2008/08/08/kindle-3-thumb-200x106.jpg" alt="kindle-3.jpg" class="mt-image-none" width="200" height="106" /></a></div>
</p>
<p>
This problem is only likely to occur in Advanced mode where stylesheets are<br />
activated.
</p>
<p><b>Usability</b></p>
<p>
The Kindle&#8217;s method of selecting and traversing hyperlinks is unique.  The user activates links by selecting along the vertical, or Y-axis, using the scroll wheel.  When multiple links fall on the same line, the Kindle will open a dialog box so the user can clarify which link is the target.
</p>
<p>
In Bookworm, users move to the next or previous chapter by selecting navigation<br />
links lined up horizontally (see the top row of the first image).  In the Kindle,<br />
this presentation forces the user to click a second time to select the appropriate one:
</p>
<p>
<div align="center"><a href="http://toc.oreilly.com/2008/08/08/kindle-2.html"><img src="http://toc.oreilly.com/2008/08/08/kindle-2-thumb-200x269.jpg" alt="kindle-2.jpg" class="mt-image-none" width="200" height="269" /></a></div>
</p>
<p>
For commonly-used navigational items like this, line up the links in a vertical row:
</p>
<ol>
<li> Next</li>
<li> Contents</li>
<li> Previous</li>
</ol>
<p>
Now no second click (and accompanying page refresh) is necessary.
</p>
<p>
It&#8217;s also important to remember that the Kindle is a black-and-white<br />
device.  If your site uses text color to convey any useful information<br />
(such as what is or is not a hyperlink), re-work the design to<br />
accommodate a grayscale display.
</p>
<p>Finally, keep pages short.  The Kindle cannot scroll; long Web pages are paginated like books.  Pagination with E Ink devices is slow relative to scrolling on a computer screen. If possible, keep all your content on the first Kindle &#8220;page&#8221; when viewed at the default font size.
</p>
<p><b>Targeting the Kindle</b></p>
<p>
Web browsers are identified using their &#8220;user-agent&#8221; string.  The current<br />
version of the Kindle is broadcasting this user-agent:<br />
<code>Mozilla/4.0 (compatible; Linux 2.6.10) NetFront/3.3 Kindle/1.0<br />
(screen 600x800)</code>.<br />
It&#8217;s beyond the scope of this article to describe how to set up your<br />
Web site to deliver different kinds of content to different browsers,<br />
a process that varies considerably with your site&#8217;s technology.
</p>
<p>
How do you test your layout if you don&#8217;t have a Kindle?  There&#8217;s no<br />
substitute for having the real device (tell your boss it&#8217;s for &#8220;research&#8221;), and currently Amazon does not offer any kind of browser emulator.  Some possibilities:
</p>
<ol>
<li>Disable stylesheets on your browser and look at the output. Does<br />
  it still make sense? (<a href="http://www.howtocreate.co.uk/userStyle.html">Instructions for<br />
  disabling stylesheets</a>; Firefox users should install the <a href="https://addons.mozilla.org/en-US/firefox/addon/60">Web<br />
  Developer</a> add-on)</li>
<li> Use a text-only browser like <a href="http://en.wikipedia.org/wiki/Lynx_%28web_browser%29">Lynx</a></li>
</ol>
<p><b>Some Last Advice</b></p>
<p>
<em>Don&#8217;t spend too much time on this process</em>.  The next version<br />
of the Kindle is expected soon, no doubt with an improved browser.  Indeed, Amazon could offer a new version of the existing browser at any time.  Most of the changes recommended above should take little time and money to implement, and can make a great difference in user experience.</p>
<p>In addition, optimizing your site for<br />
small-screen browsers can have other benefits: they allow an<br />
increasing number of mobile users to get quick access to your content,<br />
and aid accessibility for screen-readers and other non-standard<br />
browser types.
</p>
<p class="related">Related Stories:</p>
<ul class="related">
<li> <a href="http://toc.oreilly.com/2008/08/techcrunch-reports-240k-kindle.html">TechCrunch Reports 240K Kindles Sold</a></li>
<li> <a href="http://toc.oreilly.com/2008/06/hacking-the-kindle.html">Hacking the Kindle</a></li>
<li> <a href="http://toc.oreilly.com/2008/04/ebook-format-primer.html">Ebook Format Primer</a></li>
<li> <a href="http://toc.oreilly.com/2008/06/exploring-diy-ereader-platform.html">Exploring DIY E-Reader Platforms</a></li>
<li> <a href="http://toc.oreilly.com/2008/05/what-would-your-ideal-ereader.html">What Would Your Ideal E-Reader Look Like?</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2008/08/optimizing-web-content-for-the.html/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Processing the Deep Backlist at the New York Times</title>
		<link>http://radar.oreilly.com/2008/08/processing-the-deep-backlist-a.html</link>
		<comments>http://radar.oreilly.com/2008/08/processing-the-deep-backlist-a.html#comments</comments>
		<pubDate>Fri, 01 Aug 2008 15:30:00 +0000</pubDate>
		<dc:creator>Liza Daly</dc:creator>
				<category><![CDATA[Publishing]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[backlist]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Java]]></category>
		<category><![CDATA[new york times]]></category>
		<category><![CDATA[newspaper]]></category>
		<category><![CDATA[web services]]></category>

		<guid isPermaLink="false">http://blogs.oreilly.com/radar/2008/08/processing-the-deep-backlist-a.html</guid>
		<description><![CDATA[At OSCON, Derek Gottfrid explained how the New York Times is using Amazon cloud computing services to make the paper&apos;s historical archive viewable on the Web. ]]></description>
				<content:encoded><![CDATA[<p>At the <a href="http://en.oreilly.com/oscon2008/public/content/home">O&#8217;Reilly Open Source Convention</a> (OSCON), Derek Gottfrid of the New York Times led a fascinating <a href="http://en.oreilly.com/oscon2008/public/schedule/detail/2544">session</a> on how the Times was able to utilize Amazon&#8217;s cloud computing services to quickly and cheaply get their huge historical archive online and freely viewable to the public.</p>
<p>
How big is the archive? Eleven million individual articles from 1851 to 1980, or 4 terabytes of data (over 4,000 gigabytes). The Times got it ready for distribution in 24 hours, for a total cost of $240 in computing fees and $650 in storage fees.</p>
<p>As part of their original <a href="http://www.nytimes.com/ref/membercenter/lettertoreaders.html">TimesSelect subscription service</a>, the paper had scanned their entire print archive.  Each full-page scan was cut into individual articles.  Typical of newspaper format, the articles often spanned column or page boundaries, which meant that many articles were composed of several scans.  In the original subscription-based program, whenever a reader requested one of these historical articles, the Times computer would need to stitch together all of the scans for a particular article before presenting it.</p>
<p>
This on-demand process used significant computing resources, but because TimesSelect was subscription-based there was never much traffic.  Once this archive was open to the public it was expected to generate greater usage, and the safest approach in those cases is to serve pre-generated versions of all 11 million articles.  Using traditional software development practices &#8212; with a single computer churning through one article at a time &#8212; the processing could potentially take weeks and tie up Times servers that were needed for other tasks.
</p>
<p>
Gottfrid turned to Amazon Web Services (AWS) and its two main products:
</p>
<p>
<a href="http://www.amazon.com/gp/browse.html?node=201590011">Amazon Elastic Compute Cloud</a> (EC2) is a form of &#8220;virtualization&#8221; where one very large computer is divided up into many virtual computers that can be individually leased out for use.   Traditional hosting costs money whether the server is working or idle; in EC2 you pay only as long as the virtual computer is running.  When it&#8217;s no longer needed, it&#8217;s shut down. This makes the service ideal for one-off processing jobs.
</p>
<p>
In addition, Amazon doesn&#8217;t care whether you use one EC2 &#8220;instance&#8221; 100 times, or 100 instances all at once &#8212; the cost is the same.  The difference is when you can usefully divide a job into 100 concurrent tasks, because then it takes 1/100th the total time.
</p>
<p>
Amazon&#8217;s other major AWS offering is the <a href="http://www.amazon.com/gp/browse.html?node=16427261">Simple Storage Service</a> (S3), for large-scale file hosting.  Like EC2, it is a leased model &#8212; you pay only for the space that you use in a given time period.
</p>
<p>
Gottfrid leveraged these technologies in combination with a relatively new software library called <a href="http://hadoop.apache.org/core/">Hadoop</a>. Hadoop is written in the Java language and is based on work done at Google.  It allows programmers to very easily write programs that can be run simultaneously on multiple computers.
</p>
<p>
Combining Hadoop concurrency with EC2 and S3, the Times was able to run a job that might have taken weeks of processing time and complete it in 24 hours, using 100 EC2 instances.  They were pleased enough with S3 it became their permanent hosting platform for the scans.  Hosting with Amazon or other cloud computing services is usually cheaper and has much better bandwidth than the average provider, although <a href="http://www.readwriteweb.com/archives/more_amazon_s3_downtime.php">downtime</a> can and does occur.
</p>
<p>
At last year&#8217;s OSCON, the Times announced the formation of its developer blog, <a href="http://open.blogs.nytimes.com/">Open</a>.  You can read more about the <a href="http://open.blogs.nytimes.com/2007/11/01/self-service-prorated-super-computing-fun/">original AWS project</a> as well as <a href="http://open.blogs.nytimes.com/2008/05/21/the-new-york-times-archives-amazon-web-services-timesmachine/">TimesMachine</a>, a project that became economically feasible due to the low cost of AWS.
</p>
<p class="related">Related Stories:</p>
<ul class="related">
<li> <a href="http://toc.oreilly.com/2008/07how_hackers_show_its_not_all_b.html">How Hackers Show it&#8217;s Not All Bad News at the New York Times</a></li>
<li> <a href="http://toc.oreilly.com/2008/07/cloud-computings-potential-imp.html">Cloud Computing&#8217;s Potential Impact on Publishing</a></li>
<li> <a href="http://toc.oreilly.com/2008/07/the-media-industrys-perspectiv.html">The Media Industry&#8217;s Perspective Problem</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2008/08/processing-the-deep-backlist-a.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ALA 2008: Librarians and Patrons Want More Openness</title>
		<link>http://radar.oreilly.com/2008/07/ala-2008-librarians-and-patron.html</link>
		<comments>http://radar.oreilly.com/2008/07/ala-2008-librarians-and-patron.html#comments</comments>
		<pubDate>Wed, 09 Jul 2008 15:00:00 +0000</pubDate>
		<dc:creator>Liza Daly</dc:creator>
				<category><![CDATA[Publishing]]></category>
		<category><![CDATA[academic publishing]]></category>
		<category><![CDATA[ebook distribution]]></category>
		<category><![CDATA[ebooks]]></category>
		<category><![CDATA[libraries]]></category>
		<category><![CDATA[reference books]]></category>
		<category><![CDATA[worldcat]]></category>

		<guid isPermaLink="false">http://blogs.oreilly.com/radar/2008/07/ala-2008-librarians-and-patron.html</guid>
		<description><![CDATA[Librarians and publishers debated a host of e-publishing issues at the 2008 American Library Association conference. ]]></description>
				<content:encoded><![CDATA[<p>At this year&#8217;s American Library Association (ALA) conference in Anaheim, Calif., one theme emerged in talk after talk: librarians and the readers they serve demand more flexibility, transparency and openness in publishers&#8217; offerings. This affects not just digital-only reference works, but the book acquisition via library catalogs and standalone ebooks.</p>
<p><b>Reference publishing and resource discovery</b> &#8212; Reference publishers invest time and money in bespoke search interfaces for advanced users, but are users finding them? In the ALA panel &#8220;<a href="http://www.booklistonline.com/default.aspx?page=show_product&amp;pid=2775664">The Future of Electronic Reference Publishing</a>,&#8221; librarians repeatedly commented that multiple reference sources are confusing to users, and that resources must also be discoverable via Google and the library&#8217;s own digital catalog. </p>
<p>If users do go directly to an individual resource or platform, the search interface should behave &#8220;like Google.&#8221; Although the panel of major reference publishers did state that they are converging on Google&#8217;s query language, many legacy systems remain that would be economically infeasible to re-tool.</p>
<p><b>Library catalogs and systems</b> &#8212; The need for more transparent, network-based services applies to the library catalog as well. In the marathon session, &#8220;<a href="http://www.meanboyfriend.com/overdue_ideas/2008/06/ala-2008-theres-no-catalog-like-no-catalog---the-ultimate-debate-onf-the-future-of-the-library-catalog.html">The Ultimate Debate on the Future of the Library Catalog</a>,&#8221; speakers identified a critical need for geo-based services and <a href="http://toc.oreilly.com/resource_api.html">APIs</a> for finding what&#8217;s in my local library &#8212; now. Once a book is located I should be only a few clicks away from reserving it or even ordering it for delivery to my home.</p>
<p>That dream is still far off &#8212; even with a service like <a href="http://www.worldcat.org/">WorldCat</a> it&#8217;s not currently possible for me to find and reserve a book at <a href="http://www.somervillepubliclibrary.org/">my local library</a>. The closest offering presented on WorldCat is Harvard University&#8217;s library, which is not about to lend to the likes of me. The problem is <a href="http://www.librarian.net/stax/2015/how-worldcat-solves-some-problems-and-creates-others/">even worse for rural libraries</a>. As for my local library &#8212; I love books and this post is the first time it even occurred to me to visit their site. I&#8217;m not alone in that.</p>
<p><b>Ebooks</b> &#8212; This is a transitional time in publishing, and while many patrons still prefer print, an increasing number are asking for <a href="http://toc.oreilly.com/resource_ebooks.html">electronic books</a>, especially in university libraries. Students and academics emphatically reject DRM and restrictions on usage, but many ebooks sold to libraries have technical barriers to printing, cut-and-paste and downloading.</p>
<p>Licensing and subscription costs are also a concern for libraries. Ebooks may be re-priced or re-bundled, challenging the basic assumption that once a library buys a title, it owns the book indefinitely. Librarians want assurances that the products they purchase are either available perpetually, or at least have clearly-stated licensing terms that do not change without notice. </p>
<p>The ability to safely and permanently archive electronic books has been a long-time concern of some librarians, but the floods in New Orleans and Iowa have changed some minds. Off-site electronic archiving would save at least some resources, especially for very small or rural libraries can&#8217;t afford state-of-art preservation facilities.</p>
<p class="related">Related Stories:</p>
<ul class="related">
<li> <a href="http://toc.oreilly.com/2008/05/what-openid-can-do-for-educati.html">What OpenID Can Do for Academic Publishers</a></li>
<li> <a href="http://toc.oreilly.com/2008/06/aggregated-ebook-service-suits.html">Aggregated Ebook Service Suits Research Publisher</a></li>
<li> <a href="http://toc.oreilly.com/2008/06/survey-results-students-rely-o.html">Survey Results: Students Rely on Digital Tools for Research</a></li>
<li> <a href="http://toc.oreilly.com/2008/03/google-opens-book-search-with.html">Google Opens Book Search with API</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2008/07/ala-2008-librarians-and-patron.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Exploring DIY E-Reader Platforms</title>
		<link>http://radar.oreilly.com/2008/06/exploring-diy-ereader-platform.html</link>
		<comments>http://radar.oreilly.com/2008/06/exploring-diy-ereader-platform.html#comments</comments>
		<pubDate>Mon, 23 Jun 2008 15:00:00 +0000</pubDate>
		<dc:creator>Liza Daly</dc:creator>
				<category><![CDATA[Publishing]]></category>
		<category><![CDATA[buglabs]]></category>
		<category><![CDATA[chumby]]></category>
		<category><![CDATA[DIY]]></category>
		<category><![CDATA[hacking]]></category>
		<category><![CDATA[Kindle]]></category>
		<category><![CDATA[Make]]></category>
		<category><![CDATA[ybox2]]></category>

		<guid isPermaLink="false">http://blogs.oreilly.com/radar/2008/06/exploring-diy-ereader-platform.html</guid>
		<description><![CDATA[Don&apos;t like any of the e-readers currently available? Here are some ideas on building your own. ]]></description>
				<content:encoded><![CDATA[<p>I&#8217;ve been working with the <a href="http://www.idpf.org/">EPUB</a> open ebook format a lot lately, but when I want to read a book in it, I have to use my computer. There just aren&#8217;t any devices which support it yet. Naturally this leads me to wonder whether I could build my own e-reader.</p>
<p>I&#8217;m not a hardware person, but the last few years have seen an emergence of open hardware platforms designed to allow even ordinary programmers like me to modify and customize small devices. As far as software goes, an e-reader is pretty straightforward: it&#8217;s just some text on a screen. That shouldn&#8217;t be too hard, right?</p>
<p>Surveying the landscape of hardware options, I&#8217;ve ranked below a variety of devices from &#8220;friendliest&#8221; to &#8220;most-intensive DIY.&#8221; I&#8217;m not addressing PDA or phone devices here, largely because I consider their screen size and text rendering insufficient (but plenty of people disagree).</p>
<p><a href="http://www.chumby.com/"><b>The Chumby</b></a> &#8212; With a 3.5&#8243; touch screen and reasonable $175 price tag, this little wireless computer in a bean bag is an obvious candidate. There&#8217;s a full-fledged <a href="http://wiki.chumby.com/mediawiki/index.php/Main_Page">development environment</a> and large community of users. Most Chumby applications are written in a lightweight version of Flash, which is easy enough to develop in. </p>
<p>It has a few downsides, though. The Chumby doesn&#8217;t have much storage space at all, so any ebooks would have to be saved online and streamed to it, a page or a chapter at a time. Since it&#8217;s meant to be an always-on wireless device, that seems doable. The screen might be too small to comfortably read lots of text, as it&#8217;s meant for short bursts like RSS feeds or Twitter updates. </p>
<p>Unfortunately, it&#8217;s powered by a wall outlet, with only a small 9-volt battery for emergency backup. People on the <a href="http://forum.chumby.com/viewforum.php?id=8">hardware forums</a> have managed to hack in rechargeable batteries, and I wouldn&#8217;t be surprised if a totally-wireless Chumby is forthcoming. [<i>Disclosure: <a href="http://www.oatv.com/investments/">O'Reilly AlphaTech Ventures</a> is an investor in Chumby Industries</i>.]</p>
<p><a href="http://www.buglabs.net/"><b>BugLabs</b></a> &#8212; The most open of the commercial hardware platforms, BugLabs sells individual pluggable modules that support various features, from touchscreens to cameras to GPS. It looks like a great platform, but it&#8217;s very expensive ($349 for the base module plus $119 for the 2.5&#8243; <a href="http://www.buglabs.net/modules/bugview">touch-sensitive screen</a>). The screen is probably too small for comfortable reading, but the company Web site promises a larger display soon.</p>
<p>Both the Chumby and BugLabs have touchscreens, which is key for  <a href="http://www.engadget.com/2008/02/27/ebook-concept-combines-leather-and-multi-touch/">making small screens more usable</a>. </p>
<p><a href="www.amazon.com/kindle"><b>The Kindle</b></a> &#8212; All the heavy lifting has been done already to <a href="http://igorsk.blogspot.com/search/label/kindle">get into the Kindle filesystem</a><br />
and peek inside. It&#8217;s probably too difficult to extend the existing Kindle environment without true source code, but it might be possible to do some simple things, like add new fonts. Few people have really explored hacking on e-ink devices, largely due to high cost and low availability. I suspect when the first color e-ink devices come out, used black and white ones will become popular playthings for enthusiasts.</p>
<p><a href="http://ladyada.net/make/ybox2/"><b>YBox2</b></a> &#8212; For<br />
the ultimate DIY experience, the YBox2 platform is a pile of electronic parts you solder together and assemble in an Altoids tin. It doesn&#8217;t come with a touch-screen, or any screen at all: you connect it to a television or monitor. It uses the tiny <a href="http://www.parallax.com/tabid/407/Default.aspx">Propeller chip</a>, which powers many hobbyist devices and small robots. Like the Chumby, YBox2 comes with networking capability but little storage, and would need to stream book content from the Internet. The networking isn&#8217;t<br />
wireless and of course there&#8217;s no handy rechargable battery, but if you<br />
are the kind of person who can build a YBox2 you probably know how to<br />
make those too. I am not that kind of person.</p>
<p>While I&#8217;d be happy to crawl around a hacked Kindle, I know I&#8217;m not ready to program my own microcontroller. BugLabs seems great from a developer standpoint, especially when they release a larger screen, but I&#8217;m unwilling to shell out almost $500 just to experiment. The <a href="http://www.sonystyle.com/webapp/wcs/stores/servlet/CategoryDisplay?catalogId=10551&amp;storeId=10151&amp;langId=-1&amp;categoryId=8198552921644523779">Sony Reader</a> doesn&#8217;t have networking, so that&#8217;s much less interesting. Perhaps a Chumby is in my future. Any other options?</p>
<p class="related">Related Stories:</p>
<ul class="related">
<li> <a href="http://toc.oreilly.com/2008/06/select-oreilly-books-soon-on-kindle-and-as-drm-free-digital-bundle.html">Select O&#8217;Reilly Books Soon on Kindle, and as DRM-free Digital Bundles (Including EPUB)</a></li>
<li> <a href="http://toc.oreilly.com/2008/05/what-would-your-ideal-ereader.html">What Would Your Ideal E-Reader Look Like?</a></li>
<li> <a href="http://toc.oreilly.com/2008/04/ergonomics-and-ebook-success.html">Ergonomics and Ebook Success</a></li>
<li> <a href="http://toc.oreilly.com/2008/06/hacking-the-kindle.html">Hacking the Kindle</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2008/06/exploring-diy-ereader-platform.html/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Release Early, Release Often: Agile Software Development in Publishing</title>
		<link>http://radar.oreilly.com/2008/06/release-early-release-often-ag.html</link>
		<comments>http://radar.oreilly.com/2008/06/release-early-release-often-ag.html#comments</comments>
		<pubDate>Thu, 12 Jun 2008 15:00:00 +0000</pubDate>
		<dc:creator>Liza Daly</dc:creator>
				<category><![CDATA[Publishing]]></category>
		<category><![CDATA[agile]]></category>
		<category><![CDATA[digital]]></category>
		<category><![CDATA[experimentation]]></category>
		<category><![CDATA[Innovation]]></category>
		<category><![CDATA[software]]></category>

		<guid isPermaLink="false">http://blogs.oreilly.com/radar/2008/06/release-early-release-often-ag.html</guid>
		<description><![CDATA[Agile software development has transformed online product development. Can it do the same in publishing? ]]></description>
				<content:encoded><![CDATA[<p>&#8220;How do Web startups release three or four new versions of a product in the time it takes publishers to launch just one new feature on their online platforms?&#8221;</p>
<p>This question framed &#8220;The Agile IT Organization,&#8221; a lively and well-informed discussion at the recent <a href="http://sspnet.org/">Society for Scholarly Publishing</a> annual conference in Boston.  As a software engineer, I&#8217;ve used both agile and traditional product development methodologies and I was interested to hear the perspectives of other programmers as well as publishers who&#8217;ve gone through the process.</p>
<p>Geoffrey Bilder of <a href="http://www.crossref.org/">CrossRef</a> provided an introduction to agile development practices, which are concisely summarized in plain English by a core <a href="http://agilemanifesto.org/principles.html">set of principles</a>.
</p>
<p>
Summarizing even further, agile development means: </p>
<ol>
<li> Minimal up-front specification.  A project has high-level goals (e.g. &#8220;make our back catalog searchable and available for print-on-demand purchase&#8221;), but is not fully described before<br />
development begins.</li>
<li>
Frequent, short-cycle releases. A project is broken up into mini-projects, each with a small set of features that take only a few weeks to implement. Every release (&#8220;iteration&#8221;) has a specification,<br />
development and testing phase.  This means that every couple of weeks the  software is fully usable, although it may have very few features at the start.</li>
<li>Change to the product design is accommodated and even expected. Market conditions, corporate re-organization or user demands may mean that new features are added or old ones are re-worked.  Changes are treated as just another iteration.</li>
</ol>
<p>The panel at SSP focused on two approaches: internal, IT-driven products, and those developed by a third-party vendor.  Larry Belmont, manager of online development at the American Institute of Physics, gave an <a href="http://www.slideshare.net/larrymbelmont57/ssp2008-belmont-the-agile-it-org-final-one-used">excellent  presentation</a> on the in-house approach.  His organization ran its first agile project with a timeline measured in days rather than weeks or months. </p>
<p>Leigh Dodds, CTO of <a href="http://www.ingenta.com/">Ingenta</a>, provided the vendor perspective, and described the principles of a formal type of agile development known as <a href="http://en.wikipedia.org/wiki/Scrum_%28development%29">Scrum</a>.</p>
<p>
The panel was, to their credit, enthusiastic about the approach, but agile development requires commitment and is not right for every<br />
organization or project. Some caveats that need to be emphasized:
</p>
<ul>
<li>Short development cycles come with a price: you will be asked to review and comment on small pieces of the larger project, and be involved on an almost daily basis.  Many publishers need vendors they can treat like plumbers: &#8220;I want a new sink put <i>here</i>, it should look like <i>this</i>, call me when it&#8217;s done.&#8221;  If someone in your organization isn&#8217;t prepared to think very hard every day about copper pipe fittings, agile isn&#8217;t right for you.</li>
<li> Project managers must be empowered to make decisions.  Whether the project is in-house or vendor-driven, every day the PM will be asked to make calls without appealing to higher powers.  When editorial buy-in is required, or when the product needs a larger  review, consider a hybrid approach: appoint a single decision-maker with deep editorial knowledge to work on evaluating, testing and approving each iteration, but use a more traditional alpha/beta/gold release process for the wider group.  </li>
<li> Product features may change, but time and budget should be invariant. Hard deadlines might seem to be antithetical to the free-wheeling, change-friendly agile approach, but in my experience they&#8217;re critical. They focus the entire team: key decision-makers cannot spend weeks in committee, IT personnel don&#8217;t fear the &#8220;death march&#8221; project with no end in sight, and it&#8217;s more difficult to introduce budget overruns that cause friction with management and vendors.  If an agile project<br />
does run out of time, you will still have a launchable product that&#8217;s been thoroughly tested and reviewed all the way down the line, not something just out of beta with weeks of QA ahead.  Many agile methodologies use the hard deadline, or <a href="http://en.wikipedia.org/wiki/Timebox">timebox</a>, as the primary method of structuring the project.</li>
</ul>
<p>
&#8220;Release early, release often&#8221; can sound a lot like &#8220;throw whatever we&#8217;ve got out the door.&#8221;  This is one reason why the iterative approach has been so embraced by Web startups: each small release has been thoroughly tested and evaluated, and there&#8217;s never a moment where the software doesn&#8217;t work. It&#8217;s possible to to go live with a project that might not be &#8220;finished&#8221; according to the original master plan, but might otherwise be caught up in insurmountable technical hurdles or tied up in editorial review.  </p>
<p>
If publishers are going to be ready for an &#8220;<a href="http://aptstudio.com/timesemit/2008/06/06/the-ipod-moment-for-books-how-serious-is-the-uk-publishing-industry/">iPod moment</a>,&#8221; this kind of flexibility and responsiveness is critical. </p>
<p class="related">Related Stories:</p>
<ul class="related">
<li> <a href="http://toc.oreilly.com/2008/06/targeting-small-companies-with.html">Targeting Small Companies with Small Products</a></li>
<li> <a href="http://toc.oreilly.com/2008/06/publishing-industry-not-prepar.html">Publishing Industry Not Prepared for &#8220;iPod Moment&#8221;</a></li>
<li> <a href="http://toc.oreilly.com/2008/05/ab-meta-in-your-blog-stepbyste.html">Tutorial: Add AB Meta Tagging to Your Blog</a></li>
<li> <a href="http://toc.oreilly.com/2008/04/ebook-format-primer.html">Ebook Format Primer</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://radar.oreilly.com/2008/06/release-early-release-often-ag.html/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>
