<?xml version="1.0" encoding="UTF-8"?> <rss version="2.0" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule"><channel><title>Disruptive Library Technology Jester &#187; OCLC</title> <atom:link href="http://dltj.org/tag/oclc/feed/" rel="self" type="application/rss+xml" /><link>http://dltj.org</link> <description>We&#039;re Disrupted, We&#039;re Librarians, and We&#039;re Not Going to Take It Anymore</description> <lastBuildDate>Mon, 06 Feb 2012 20:04:22 +0000</lastBuildDate> <language>en</language> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <cloud domain='dltj.org' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' /> <creativeCommons:license>http://creativecommons.org/licenses/by-nc-sa/3.0/us/</creativeCommons:license> <item><title>Thursday Threads: SOPA Suspended, Lending Chromebooks, OCLC Introduces WorldShare</title><link>http://dltj.org/article/thursday-threads-2011w51/</link> <comments>http://dltj.org/article/thursday-threads-2011w51/#comments</comments> <pubDate>Thu, 22 Dec 2011 21:07:52 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Thursday Threads]]></category> <category><![CDATA[chromebook]]></category> <category><![CDATA[Google]]></category> <category><![CDATA[H.R.3261 (112th Congress)]]></category> <category><![CDATA[OCLC]]></category> <category><![CDATA[Stop Online Piracy Act]]></category> <category><![CDATA[wms]]></category> <category><![CDATA[WorldShare]]></category><guid isPermaLink="false">http://dltj.org/?p=3543</guid> <description><![CDATA[Receive DLTJ Thursday Threads:by&#160;E-mailby&#160;RSSDelivered by FeedBurner This is the just-in-time-for-the-holidays edition of DLTJ Thursday Threads. The U.S. House Judiciary Committee suspended work on SOPA, and there was much relief from the technology community. The Palo Alto Public Library announced plans &#8230; <a href="http://dltj.org/article/thursday-threads-2011w51/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=3543"></abbr><div id="feedburner-thursday-threads-email-2011w51" class="wp-caption alignright noprint noFrontPage" style="width: 230px;;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; float: right;"><form style="border: 1px solid rgb(204, 204, 204); padding: 3px; margin: 0pt; text-align: center;" action="http://feedburner.google.com/fb/a/mailverify" method="post" target="popupwindow" onsubmit="window.open('http://feedburner.google.com/fb/a/mailverify?uri=thursday-threads', 'popupwindow', 'scrollbars=yes,width=550,height=520');return true"><p>Receive <i><acronym title="Disruptive Library Technology Jester">DLTJ</acronym></i> Thursday Threads:</p><p>by&nbsp;<a href="http://feedburner.google.com/fb/a/mailverify?uri=thursday-threads&amp;loc=en_US" title="D.L.T.J. Thursday Threads Email Subscription">E-mail</a><br /><input style="width: 140px;" name="email" value="Your e-mail address" onfocus="if (this.defaultValue==this.value) this.value = ''" type="text"/><input value="thursday-threads" name="uri" type="hidden"/><input name="loc" value="en_US" type="hidden"/><input value="Subscribe" type="submit"/></p><p>by&nbsp;<a href="http://feeds.dltj.org/thursday-threads/" title="D.L.T.J. Thursday Threads RSS Feed">RSS</a></p><p style="font-size: 80%;">Delivered by <a href="http://feedburner.google.com" target="_blank" title="Google Feedburner Service">FeedBurner</a></p></form></div><p> This is the just-in-time-for-the-holidays edition of <i><acronym title="Disruptive Library Technology Jester">DLTJ</acronym> Thursday Threads</i>.  The U.S. House Judiciary Committee <a href="#p3543-sopa">suspended work on <acronym title="Stop Online Piracy Act">SOPA</acronym></a>, and there was much relief from the technology community.  The Palo Alto Public Library announced plans to <a href="#p3543-chromebooks">lend Chromebooks</a> (laptops with Google&#8217;s cloud-based operating system) to patrons.  And OCLC announced a rebranding and expansion of its webscale activities with the <a href="#p3543-worldshare">WorldShare Platform</a>.</p><p>Inclusive of all <a href="http://en.wikipedia.org/wiki/List_of_winter_festivals" title="List of winter festivals - Wikipedia, the free encyclopedia">holidays of the season</a> I wish you a safe, restful and happy celebration.</p><p>Feel free to send this to others you think might be interested in the topics.  If you find these threads interesting and useful, you might want to add the <a href="http://feeds.dltj.org/thursday-threads/" title="RSS Feed for DLTJ Thursday Threads">Thursday Threads RSS Feed</a> to your feed reader or subscribe to e-mail delivery using the form to the right.  If you would like a more raw and immediate version of these types of stories, watch <a href="http://friendfeed.com/dltj" title="Peter Murray - FriendFeed">my FriendFeed stream</a> (or subscribe to <a href="http://friendfeed.com/dltj?format=atom" title="Atom feed for Peter Murray's FriendFeed account">its feed</a> in your feed reader).  Comments and tips, as always, are <a href="http://dltj.org/contact">welcome</a>.</p><p><h2 id="p3543-sopa">A Status Update on SOPA from Washington</h2></p><blockquote><p>Prospects: mixed. On the one hand, it&#8217;s looking likely that it will pass out of committee. Proposed amendments voted down 2-1 in HJC when the manager&#8217;s amendment was marked up. Unless something changes, I expect SOPA to emerge largely unamended, particularly with respect to that relates search engines and use of DNS for enforcement, the most controversial aspects of the bill for the tech community.</p><p>On the other hand, there have been significant cybersecurity concerns raised about the bills because of what it would do to DNSSEC, including by DHS officials. The committee might take a classified briefing so that the government&#8217;s own geeks from Sandia Labs and DHS and other &#8220;Three Letter Agencies&#8221; could explain to the legislators) who somehow neglected to bring in any technical experts before the committee to testify) why SOPA won&#8217;t work and why it&#8217;s a terrible idea to try to DNS for enforcement. If that happens before markup, it could change the bill that heads to the House floor &#8212; and House leadership might want to address security concerns before bringing it to a full vote.<div style="text-align: right; width: 100%;"><cite>- <a href="https://plus.google.com/107980702132412632948/posts/N1igQsDZs4D" title="A Status Update on SOPA from Washington | Google+">A Status Update on SOPA from Washington</a>, by Alexander Howard on Google+</cite></div></blockquote><p>Remember the Stop Online Piracy Act (SOPA)?  It is the proposed bill that would force internet service providers to block <acronym title="Domain Name Service">DNS</acronym> name-to-address translation and force revenue-generating systems (advertisement networks and payment intermediaries) to cut off service to a &#8220;foreign infringing site&#8221;.  The bill was on the fast track to go through the <a href="http://judiciary.house.gov/hearings/mark_12152011.html" title="Full Committee Markup of: H.R. 3261, the “Stop Online Piracy Act” | House of Representatives Committee on the Judiciary">final markup process through the Judiciary Committee</a> last week when debate on dozens of proposed amendments ran out the clock on this year&#8217;s congressional session.  That is where O&#8217;Reilly Media technology writer <a href="https://plus.google.com/107980702132412632948" title="Alexander Howard | Google+ Profile">Alexander Howard</a> picks up the story with his summary excerpted above.  Alexander&#8217;s post is a great synopsis of the history, status, players-to-watch, and people to watch for updates.  For a view on why the technology community was alternating between sobbing and anger, see the aptly titled &#8220;<a href="http://motherboard.vice.com/2011/12/16/dear-congress-it-s-no-longer-ok-to-not-know-how-the-internet-works" title="Dear Congress, It's No Longer OK To Not Know How The Internet Works | Motherboard">Dear Congress, It&#8217;s No Longer OK To Not Know How The Internet Works</a>.&#8221;</p><p><h2 id="p3543-chromebooks">Silicon Valley Library Lends Google Chromebooks</h2></p><blockquote><p>In a first-of-its-kind pilot project, the <a href="http://www.cityofpaloalto.org/depts/lib/default.asp" title="Library | City of Palo Alto Website">Palo Alto, California Library</a> will soon be loaning Google Chromebook computers to library patrons for as long as one week at a time.</p><p>The program highlights the Chromebook’s ability to operate as a kind of “disposable computer,” as Google puts it. With the Chromebook, most all data and applications reside on the Web — not the local machine — so it can easily be passed from person-to-person. It’s a very Googly setup, and the search giant hopes it will reinvent the way businesses use computers.</p><div style="text-align: right; width: 100%;"><cite>- <a href="http://www.wired.com/wiredenterprise/2011/12/chromebook-library/" title="Silicon Valley Library Lends Google Chromebooks | Wired Enterprise | Wired.com">Silicon Valley Library Lends Google Chromebooks</a>, Wired.com Enterprise blog</cite></div></blockquote><p>Cloud computing meets equipment circulation.  I remember a time when libraries used to offer VCR and DVD players to patrons for check-out.  Now that service is coming to computers.  Since everything on the computer is replicated to Google&#8217;s servers, it is easy to wipe the individual patron&#8217;s files on the machine when the next person logs in.  One just needs a Google account to make it work, and that is &#8212; of course &#8212; one of the distinguishing factors between lending Chromebooks and lending VCR and DVD players.  Will patrons mind the Google account requirement?  Should libraries educate patrons on the privacy and information-harvesting/using practices of Google before lending a device?</p><p><h2 id="p3543-worldshare">OCLC Introduces OCLC WorldShare</h2></p><blockquote><p>The OCLC WorldShare Platform facilitates collaboration and app-sharing across the library community, so that libraries can combine library-built applications, partner-built applications and OCLC-built applications. This enables the benefits of each single solution to be shared broadly throughout the library community.<div style="text-align: right; width: 100%;"><cite>- <a href="http://www.oclc.org/us/en/news/releases/2011/201170.htm" title="OCLC introduces OCLC WorldShare | OCLC">OCLC introduces OCLC WorldShare</a></cite></div></blockquote><p>Reaching back a little bit, earlier this month OCLC announced the <a href="http://oclc.org/developer/platform" title="WorldShare Platform | OCLC Developer Network">WorldShare Platform</a> &#8212; a roll-up of the existing Webscale Management tools with the ability to insert third-party applications into a single bibliographic view.  This is potentially a game-changer in how libraries work with bibliographic data.  Similar in concept &#8212; although quite different in technical implementation &#8212; to next generation library automation systems like <a href="http://kuali.org/ole" title="Kuali OLE | www.kuali.org">Kuali OLE</a> and <a href="http://www.exlibrisgroup.com/category/AlmaOverview" title="Ex Libris Alma">Ex Libris Alma</a> and <a href="http://open-ils.org/" title="Evergreen open source library system">Evergreen</a>, WorldShare views back-room bibliographic description, acquisition, and materials-handling workflows as a series of choreographed processes that can be mixed and matched to meet a library&#8217;s particular needs.  It turns the traditional approach of information processing inside out &#8212; the data is in a superior position to the computer program.  The WorldShare Platform is sort of like Facebook.  Just as Facebook introduced ways for outside developers to &#8220;<a href="http://developers.facebook.com/docs/guides/canvas/" title="Apps on Facebook.com | Facebook Developers">integrate into the core Facebook experience</a>&#8220;, WorldShare platform enables external providers to supply applications that can use the data residing in the platform.</p><p>Unless I&#8217;m not reading the right places, the WorldShare introduction has landed with somewhat of a thud among the library technologist community.  Aside from <a href="http://newsbreaks.infotoday.com/NewsBreaks/OCLC-WorldShare-Platform-OCLC-Brands-and-Strengthens-Its-Webscale-Strategy-79208.asp" title="OCLC WorldShare Platform: OCLC Brands and Strengthens Its Webscale Strategy | InfoToday">Marshall Breeding&#8217;s post</a> on InfoToday, I haven&#8217;t seen any discussion of it.  And that seems odd.</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/thursday-threads-2011w51/feed/</wfw:commentRss> <slash:comments>4</slash:comments> </item> <item><title>Open Repositories 2011 Report: Day 2 with DSpace plus Fedora and Lots of Lightning Talks</title><link>http://dltj.org/article/or11-report-3/</link> <comments>http://dltj.org/article/or11-report-3/#comments</comments> <pubDate>Fri, 10 Jun 2011 03:58:20 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Meeting]]></category> <category><![CDATA[DSpace]]></category> <category><![CDATA[fascinator]]></category> <category><![CDATA[Fedora]]></category> <category><![CDATA[microservice]]></category> <category><![CDATA[OCLC]]></category> <category><![CDATA[Open Repositories 2011]]></category> <category><![CDATA[premis]]></category> <category><![CDATA[simile]]></category><guid isPermaLink="false">http://dltj.org/?p=3004</guid> <description><![CDATA[Today was the second day of the Open Repositories conference, and the big highlight of the day for me was the panel discussion on using Fedora as a storage and service layer for DSpace. This seems like such a natural &#8230; <a href="http://dltj.org/article/or11-report-3/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=3004"></abbr><p>Today was the second day of the <a href="https://conferences.tdl.org/or/index.php/OR2011/OR2011main">Open Repositories conference</a>, and the big highlight of the day for me was the panel discussion on using Fedora as a storage and service layer for DSpace.  This seems like such a natural fit, but with two pieces of complex software the devil is in the details.  Below that summary is some brief paragraphs about some of the 24&#215;7 lightning talks.<br /><span id="more-3004"></span><br /><h2>Fedora inside DSpace</h2><br /><a href="https://profiles.google.com/mdiggory/about">Mark Diggory of @MIRE</a> moderated a panel of <a href="http://loomware.typepad.com/about.html" title="Mark Leggott">Mark Leggott</a> (Islandora, DiscoveryGarden and UPEI), <a href="http://bradmc.users.sourceforge.net/" title="SourceForge.net: Bradley McLean - Developer Web Hosting - Open Source Software">Bradley McLean</a> (CTO for DuraSpace), <a href="http://www.linkedin.com/pub/richard-rodgers/4/147/b2a" title="Richard Rodgers  | LinkedIn">Richard Rogers</a> (Head of Software at MIT Libraries), <a href="http://ryan.scherle.org/" title="Ryan's Home">Ryan Scherle</a> (Technical Lead, Dryad Digital Repository), and <a href="http://www.blogger.com/profile/13668812856976810177" title="Matt Zumwalt | Blogger">Matt Zumwalt</a> (MediaShelf; Technical Lead, Hydra) on the topic of &#8220;DSpace with Fedora Inside.&#8221;  At last year&#8217;s Open Repositories conference there was a call for the DSpace and Fedora communities to explore this idea.  All of the content and metadata would be stored in Fedora with DSpace continuing to provide the user interface for workflow, discovery and administration.  Or, viewed another way, retain the out-of-the-box experience of DSpace while exposing the versioning, object relationship, and flexible architecture features provided by Fedora.  Work on this has been going on for a number of years, starting in 2007 with Scott Yeadon demonstrating object portability between Fez/Fedora and DSpace.  In 2008, 2009 and 2010 there were three Google Summer of Code projects by Andrius Blažinskas that laid the groundwork for some of this integration by abstracting the DSpace storage layer options.</p><p>Here are some of the questions and responses from the panel.  I hope I&#8217;m representing everyone&#8217;s views as intended; comments and clarifications are welcome.</p><p>Will adding DSpace on Fedora make DSpace even more complex? <i>Matt:</i> It is an opportunity to revisit the design assumptions &#8220;and clean up your work.&#8221; This is an example of where the transition will create the opportunity to tidy up the complexity and make DSpace simpler while gaining the flexibility of Fedora. <i>Mark:</i> The complexity of DSpace isn&#8217;t in its content model, it is in trying to use the existing content model to do things DSpace wasn&#8217;t designed to do. For instance, Islandora has more complex atomistic content models, particularly with science data, than DSpace&#8217;s content model. <i>Bradley:</i> Acknowledged that there is a risk here, but &#8220;if it becomes more complex we are doing it wrong.&#8221;</p><p>There is concern from the DSpace because it may have to change to accommodate Fedora, but Fedora will not need to be changed. <i>Bradley:</i> New ideas are sources of concern.  It is difficult to categorize DSpace developers as a whole.  Any time you move a major application on top of another component you may find the underlying APIs need to change.</p><p>At what level do we align DSpace and Fedora? Do we really need an intermediary format (AIP)? <i>Bradley:</i> If all we do is find a way to graft the existing DSpace workflows onto a Fedora that is very specific to DSpace and can&#8217;t be used with other Fedora tools, then we haven&#8217;t moved very far.  The end goal is to find those places where DSpace is not formally specified enough and get it formally specified. <i>Mark:</i> Islandora hooks Drupal into Fedora. 80%-90% of the time we work with the Drupal-Fedora API &#8212; a simple PHP wrapper around Fedora API.  It transforms calls into appropriate actions into Drupal. Another option comes from an Italy project that created a synchronization of Fedora objects and Drupal Nodes; it copies information from Fedora into the Drupal RDBMS.  Other applications like Omeka have a Fedora plugin.  DiscoveryGarden has also looked at things like WordPress with Fedora underneath.  As repository services become more intelligent about microservices it would take even less time to make these integrations.</p><p>What benefit does Fedora receive? <i>Richard:</i> For the Fedora community, DSpace alignment would provide a rich IR content model for Fedora. <i>Ryan:</i> DSpace was designed as an IR and nothing else; it has that at its core.  The problems that people have with DSpace are when people try to make it do something outside that vision.  Having a Fedora repository and have a DSpace interface for those IR use cases and something like Hydra or Islandora for people using those use cases. <i>Matt:</i> There has never been a large cohesion of IR workflow in Fedora; having this workflow satisfy the IR use case. <i>Bradley:</i> DSpace with Fedora inside is a repackaging with a slightly different set of existing components.  DSpace across its lifetime has tried to become more modular; integration with Fedora will make this clearer. <i>Mark:</i> Would agree that one of the main things DSpace brings to Fedora is the workflow tool and also the back-end data transformation workflows.  But he has also never been a fan of the DSpace workflow because the staunch requirement to fill out a lot of metadata is a mistake.  Working with science data, researchers want to ingest 100K microscope images without metadata then go back and add metadata with time. <i>Ryan:</i> (Agreeing) Some of the requirements of the native installation of DSpace is difficult to work with in other use cases.  Started with configuring the workflow as much as could be done with config file, but then created a new workflow process that still used many of the underlying tools.</p><p>The way I think of the motivation is that DSpace on Fedora will have the same easy setup with access to the underlying APIs for customization. <i>Bradley:</i> Yes &#8212; that is an aspirational goal. The practical realities mean that we will have to take steps there one at a time.  And given the time scale the question comes whether we will get there before we decide to do something else.  One of them &#8212; sort of unsaid &#8212; is to take a look at EPeople and see how that would migrate. <i>Ryan:</i> Unlocking the data is one thing, but unlocking the underlying datastreams as a API.  DSpace storage API is opaque.  You can rebuild everything from the underlying storage. [Ryan also calls out an old DLTJ post: A key advantage of DSpace with Fedora inside. "<a href="http://dltj.org/p38">Why Fedora? Because You Don’t Need Fedora</a>".]</p><p><h2>Lightning Talks</h2><br />The other sessions that I went to today were of the quick 24&#215;7 type.  Here are some highlights.</p><p>Mark Phillips talked about a <strong>PREMIS Event Service</strong>.  He needed a way to log events that occur during the life of digital objects (virus checks, ingest, fixity check, replication) &#8212; when they occur and their outcomes. So he built a microservice based on AtomPub.  Each repository component sends outcomes of an event to this service, a central event collector.  Uses the PREMIS Event and Agent Modules.  Events metadata include: event_identifier, event_type, event_timestamp, event_outcome, outcome_detail, agent_identifier, object_identifier, and event_detail.  Agent metadata includes details about the software, human or organization triggering the event.  An AtomPub feed for each object returns all events for that object.  The system includes a basic search interface to see all events of a particular type and enables a feed to be set up on those searches.  There is the ability to harvest all events via OAI-PMH and Atom.  It is built with Django and Python and will be release on the MetaArchive Google Code page.</p><p>Peter Sefton talked about <strong>The Fascinator &amp; Fedora Commons: A Toolkit Tour</strong>. <a href="http://sites.google.com/site/fascinatorhome/" title="The Fascinator">Fascinator</a> is a java-based platform targeting repository solutions. It is open source (GPL), a plugin-based platform, and highly customizable.  It was first used as an aggregator of data from various sources into a discovery service.  They tried doing the same thing on the desktop computer (something a researchers could put on a personal computer, index data, group/describe it).  He thinks of the process as a conveyor belt: harvest digital objects, transform them, store them, index them and find them.  Harvest: draw into your digital ecosystem files, databases, online resources.  Transform: be ready to present and share.  Real web stuff &#8212; not PDFs.  Video and image previews. Multiple renditions for a multi-platform world.  Storage: Store originals and their friends; basic filesystem storage or use Fedora Commons.  Index: Apache SOLR index.  Find: Faceted search interface.  Web previews (turn Word into HTML and PDF for preview, same for video transcoding). Easily customizable UI (Jython and Velocity).  It is used by REDDBOX, Mint (described at a session yesterday), and a library of university policies (policies sent from Microsoft Sharepoint and transformed into HTML and PDF).</p><p>Rich Rogers talked about <strong>Publishing Large, Data-Rich Collections on the Web with Exhibit3</strong>. We collect and we like to share what we collect. Nowadays we live on the web, so how can we share out collections there? <a href="http://www.simile-widgets.org/exhibit/" title="Exhibit | SIMILE Widgets">Exhibit</a> is all I need to publish my collection to the web: no backend database, no server application; it will even convert a spreadsheet into usable data on the web.  Originally created by the SIMILE Project at MIT, Exhibit is an entire data publication platform with &#8216;list&#8217; and &#8216;tabular. views.  There is also a rich library of additional views. If temporal data, scrollable interactive timeline.  If geospatial data, interactive mapping displays.  Numerical data, scatter or time plots.  It is installable by HTML configuration and uses a browser-resident RDF database in JSON.</p><p>Geri Ingram and Carol Godby talked about <strong>Sustaining Collaboration Among Open Access Repositories</strong> focused on the <a href="http://www.oclc.org/gateway/" title="WorldCat Digital Collection Gateway  | OCLC">WorldCat Digital Collection Gateway</a>.  WorldCat broadens exposure to digital object collections; end users click through to the hosting repository server from WorldCat.org.  Metadata from OAI-PMH-compliant repositories are regularly harvested to WorldCat through the Gateway; digital objects remain on local repository server.  The gateway includes a translation service that allows repository managers to create mappings from Dublin Core (or selected other metadata schemas) to MARC.</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/or11-report-3/feed/</wfw:commentRss> <slash:comments>7</slash:comments> </item> <item><title>Thursday Threads: Websites for Small Libraries, Open Source in Govt, Measuring Reliability</title><link>http://dltj.org/article/thursday-threads-2011w2/</link> <comments>http://dltj.org/article/thursday-threads-2011w2/#comments</comments> <pubDate>Thu, 13 Jan 2011 11:51:27 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Thursday Threads]]></category> <category><![CDATA[OCLC]]></category> <category><![CDATA[open source]]></category> <category><![CDATA[reliability]]></category><guid isPermaLink="false">http://dltj.org/?p=2215</guid> <description><![CDATA[Receive DLTJ Thursday Threads:by&#160;E-mailby&#160;RSSDelivered by FeedBurner It has been the longest of weeks and the shortest of weeks. Longest because of a working weekend with the ALA Midwinter conference in San Diego. Shortest because the activities leading up to, during, &#8230; <a href="http://dltj.org/article/thursday-threads-2011w2/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=2215"></abbr><div id="feedburner-thursday-threads-email-2011w02" class="wp-caption alignright noprint noFrontPage" style="width: 230px;;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; float: right;"><form style="border: 1px solid rgb(204, 204, 204); padding: 3px; margin: 0pt; text-align: center;" action="http://feedburner.google.com/fb/a/mailverify" method="post" target="popupwindow" onsubmit="window.open('http://feedburner.google.com/fb/a/mailverify?uri=thursday-threads', 'popupwindow', 'scrollbars=yes,width=550,height=520');return true"><p>Receive <i><acronym title="Disruptive Library Technology Jester">DLTJ</acronym></i> Thursday Threads:</p><p>by&nbsp;<a href="http://feedburner.google.com/fb/a/mailverify?uri=thursday-threads&amp;loc=en_US" title="D.L.T.J. Thursday Threads Email Subscription">E-mail</a><br /><input style="width: 140px;" name="email" value="Your e-mail address" onfocus="if (this.defaultValue==this.value) this.value = ''" type="text"/><input value="thursday-threads" name="uri" type="hidden"/><input name="loc" value="en_US" type="hidden"/><input value="Subscribe" type="submit"/></p><p>by&nbsp;<a href="http://feeds.dltj.org/thursday-threads/" title="D.L.T.J. Thursday Threads RSS Feed">RSS</a></p><p style="font-size: 80%;">Delivered by <a href="http://feedburner.google.com" target="_blank" title="Google Feedburner Service">FeedBurner</a></p></form></div><p> It has been the longest of weeks and the shortest of weeks.  Longest because of a working weekend with the <abbr title="American Library Association">ALA</abbr> Midwinter conference in San Diego.  Shortest because the activities leading up to, during, and after the conference didn&#8217;t leave much time for reading items to prepare a <i><acronym title="Disruptive Library Technology Jester">DLTJ</acronym> Thursday Threads</i> article.  This edition has three short pointers:  a report from Midwinter on a project from OCLC to offer a basic website for every library, why governments are coming around to liking open source software, and a discussion of measurements of reliability.<br /><span id="more-2215"></span></p><p>If you find these threads interesting and useful, you might want to add the <a href="http://feeds.dltj.org/thursday-threads/">Thursday Threads RSS Feed</a> to your feed reader or subscribe to e-mail delivery using the form to the right.  If you would like a more raw and immediate version of these types of stories, watch <a href="http://friendfeed.com/dltj" title="Peter Murray - FriendFeed">my FriendFeed stream</a> (or subscribe to <a href="http://friendfeed.com/dltj?format=atom" title="Atom feed for Peter Murray's FriendFeed account">its feed</a> in your feed reader).  Comments and tips, as always, are welcome.</p><p><h2><a name="web_presence">A Web presence for every library</a></h2></p><blockquote><p>We are ready to share our work to date on a very early, experimental service aimed at providing a low cost and easy-to-use Web site service for small and rural public libraries and are looking for feedback and direction from the library community.</p></blockquote><p>Late last year &#8212; just about a week before ALA Midwinter &#8212; came an <a href="http://community.oclc.org/cooperative/2010/12/a-web-presence-for-every-library.html" title="A Web presence for every library - The OCLC Cooperative Blog">announcement</a> of a <a href="http://experimental.worldcat.org/lib/" title="" class="broken_link" rel="nofollow">project</a> by <a href="http://experimental.worldcat.org/" title="OCLC Innovation Lab">OCLC&#8217;s Innovation Lab</a> to offer an inexpensive website to every small library.  At a price of about $5 per month, a library could have a basic desktop and mobile website.  At about $40 per month, the library could have a simple inventory and circulation module.  You can see what is possible at the <a href="http://experimental.worldcat.org/lib/n/us.tn.loremville-public-library" rel="noindex, nofollow" title="Loremville Public Library" class="broken_link">Loremville, TN public library sample site</a> and read more information about the project in <a href="http://dltj.org/article/a-web-presence-for-small-libraries/">my write-up of the public demonstration</a>.</p><p><h2><a name="open_source">A Walled Wide Web for Nervous Autocrats</a></h2></p><blockquote><p>At the end of 2010, the &#8220;open-source&#8221; software movement, whose activists tend to be fringe academics and ponytailed computer geeks, found an unusual ally: the Russian government. Vladimir Putin signed a 20-page executive order requiring all public institutions in Russia to replace proprietary software, developed by companies like Microsoft and Adobe, with free open-source alternatives by 2015.</p><p>The move will save billions of dollars in licensing fees, but Mr. Putin&#8217;s motives are not strictly economic. In all likelihood, his real fear is that Russia&#8217;s growing dependence on proprietary software, especially programs sold by foreign vendors, has immense implications for the country&#8217;s national security. Free open-source software, by its nature, is unlikely to feature secret back doors that lead directly to Langley, Va.</p></blockquote><p>Despite the backhanded slight to open source software advocates (&#8220;fringe academics and ponytailed computer geeks&#8221; &#8212; really?), this <a href="http://online.wsj.com/article/SB10001424052748704415104576065641376054226.html" title="A Walled Wide Web for Nervous Autocrats - WSJ.com">Wall Street Journal article</a> talks about how being able to inspect the source of software used by governments can reduce the likelihood that other governments can eavesdrop on computer activity.</p><p><h2><a name="reliability">Why Internet Services Aren’t Achieving Ma Bell’s Reliability</a></h2></p><blockquote><p>AT&amp;T’s dial tone set the all-time standard for reliability. It was engineered so that 99.999 percent of the time, you could successfully make a phone call. Five 9s. That works out to being available all but 5.26 minutes a year.</p><p>Can we realistically expect that such availability will ever come to Internet services?&#8230;As more and more Web services companies acquire years of experience, we’ll see more consistent reliability — it’s just a matter of time and learning. Attaining Four-9s availability will become routine. That means available all but 52.56 minutes a year.</p><p>As for moving to 99.999, well, that may never come. “We don’t believe Five 9s is attainable in a commercial service, if measured correctly,” says Urs Hölzle, senior vice president for operations at Google. The company’s goal for its major services is Four 9s.</p></blockquote><p>Ever heard the phrase &#8220;five nines&#8221;?  It is a mystical measure of reliability where, as this <a href="http://www.nytimes.com/2011/01/09/business/09digi.html" title="http://www.nytimes.com/2011/01/09/business/09digi.html">New York Times article points out</a>, one would only see roughly five minutes of downtime a year.  You&#8217;ll hear computer people talk about five-nines &#8212; 99.999% uptime &#8212; but now you can come armed with information about what today&#8217;s most popular internet service companies are doing to try to achieve it (and ask if the company promising you five-nines is doing the same thing).</p><p>On a side note, the article also has a paragraph about Amazon&#8217;s Simple Storage Service, or S3, cloud-based disk service: &#8220;&#8216;We talk of durability of data — it’s designed for Eleven-9s durability,&#8217; says James Hamilton, a vice president for Amazon Web Services. That works out to a 0.000000001 percent chance of data being lost, at least theoretically.&#8221;  This topic came up at presentation I gave during ALA Midwinter on <a href="http://dltj.org/article/preservation-storage-options/">Options in Storage for Digital Preservation</a>.  With digital preservation, we don&#8217;t have a good way of thinking about probability.  Is eleven-9s durability good enough for the long-term preservation of our digital archive masters?</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/thursday-threads-2011w2/feed/</wfw:commentRss> <slash:comments>1</slash:comments> </item> <item><title>OCLC Introduces &#8220;A Web Presence for Small Libraries&#8221;</title><link>http://dltj.org/article/a-web-presence-for-small-libraries/</link> <comments>http://dltj.org/article/a-web-presence-for-small-libraries/#comments</comments> <pubDate>Wed, 12 Jan 2011 01:35:41 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Meeting]]></category> <category><![CDATA[ALA Midwinter Conference 2011]]></category> <category><![CDATA[OCLC]]></category> <category><![CDATA[public library]]></category><guid isPermaLink="false">http://dltj.org/?p=2114</guid> <description><![CDATA[On Sunday evening, the OCLC Innovation Lab held a public demonstration of a project with the working title, &#8220;A Web Presence for Small Libraries.&#8221; It is a templated website that could serve as a library&#8217;s barest bones presence on the &#8230; <a href="http://dltj.org/article/a-web-presence-for-small-libraries/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=2114"></abbr><p><div id="attachment_2118" class="wp-caption alignright" style="width: 327px;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; float: right;"><a href="http://experimental.worldcat.org/lib/n/us.tn.loremville-public-library" title="Loremville Public Library" class="broken_link" rel="nofollow"><img src="http://cdn.dltj.org/wp-content/uploads/2011/01/Loremville-Public-Library_1294793741440.png" alt="Screenshot of Sample Library Website" title="Screenshot of Sample Library Website" width="317" height="297" class="size-full wp-image-2118" /></a><p style=' padding: 0 4px 5px; margin: 0;'  class="wp-caption-text">Screenshot of Sample 'Web Presence for Small Libraries'</p></div> On Sunday evening, the <a href="http://experimental.worldcat.org/" title="OCLC Innovation Lab homepage">OCLC Innovation Lab</a> held a <a href="http://community.oclc.org/cooperative/2010/12/a-web-presence-for-every-library.html" title="A Web presence for every library - The OCLC Cooperative Blog">public demonstration</a> of a project with the working title, &#8220;<a href="http://experimental.worldcat.org/lib/" title="" class="broken_link" rel="nofollow">A Web Presence for Small Libraries</a>.&#8221;  It is a templated website that could serve as a library&#8217;s barest bones presence on the web.  The target audience is small and/or rural libraries that may not have the technological infrastructure &#8212; human knowledge, equipment, and/or money &#8212; to host their own web presence.  If it comes to fruition, the basic service would give a library four pages on the web that can be customized by the library staff plus dynamic areas of content that would be generated by OCLC algorithms and optionally placed on each library&#8217;s site.  A more advanced version of the service could include a light-weight book inventory and circulation option.</p><p>They created a sample library called <a href="http://experimental.worldcat.org/lib/n/us.tn.loremville-public-library" rel="noindex, nofollow" title="Loremville Public Library" class="broken_link">Loremville, TN public library</a> to demonstrate key aspects of the service.  I did not ask them how long that particular example will be around, so you may follow that link at a later date and not find it.</p><p>This &#8220;Library Website in a Box&#8221; is a concept that has been around for many years, and the latest trigger to try something was a resolution from the last OCLC Members Council to make scaled-down versions of OCLC services for &#8220;small and rural libraries.&#8221;  The Innovation Lab group conducted some research about the existing state of public library web presences by sorting the <a href="http://www.imls.gov/news/2010/063010.shtm" title="http://www.imls.gov/news/2010/063010.shtm" class="broken_link" rel="nofollow">IMLS-reported data from 2008</a> by number of volumes held and number of library staff.  They looked at the lowest quartile (roughly 20,000 volumes or less and staff size in the low single digits) and found that there was generally no web presence for these libraries.  In the second quartile there were instances of library websites, but they did the library no credit (outdated, poorly constructed, incomplete information &#8212; as was said at the meeting, these libraries had a presence on the web but probably shouldn&#8217;t).  Some had automation systems supplied by large groups, but others didn&#8217;t have evidence of an automation system.  So the project charter was to find an easy and inexpensive way for a library in these quartiles to create a desktop and mobile device web presence.</p><p>One of the unique aspects of the project development was to first set the approximate lower ($5/month) and upper ($40/month) price boundaries and find a way to provide the highest level of service possible at those price points.  The Innovation Lab team tried techniques such as a farm of WordPress sites but found they couldn&#8217;t make the revenue-versus-cost equation to work.  In the end they constructed a custom database-driven content system in PHP.  Institutional data is initially pulled from sources such as the <a href="http://www.oclc.org/registry/default.htm" title="WorldCat Registry [OCLC - Web Services]">WorldCat Registry</a>, and there will be some process for a library to &#8220;claim&#8221; its site.  There might also be a way for a library to create a site for itself if no registry data yet exists.</p><p>There are four levels of site authorization:  public (unauthenticated) viewing, a registered patron, a staff member, and an administrator.  Content from the pages are edited by the administrator at the subscribing library with a WYSIWYG<sup><a href="http://dltj.org/article/a-web-presence-for-small-libraries/#footnote_0_2114" id="identifier_0_2114" class="footnote-link footnote-identifier-link" title="What-You-See-Is-What-You-Get, meaning as you edit the document with styles like bolding, underlining, and bulleted lists, what you see in the editing interface is exactly how it will appear in the viewing interface.">1</a></sup> editor.  There are content boxes for the library&#8217;s location, staff/volunteers listing, events calendar and news, hours and phone number, policies, and service information.  The library&#8217;s address is fed into a Google Maps service to display a map of the area surrounding the library.</p><p>The dynamic parts of each library&#8217;s website could have a list of books from various sources like the New York Times and Oprah&#8217;s book group.  The service also envisions offering a &#8220;default digital collection&#8221; using public domain works in text, PDF, and mobile reading device formats from sources such as Project Gutenberg and the Internet Archive.</p><p>The inventory and circulation module is simple and straight-forward.  Each item has only eight full-text fields with the intention that the description will likely be done by a volunteer without professional library training.  Cataloging can be done by typing in the information or scanning the ISBN with an app on a mobile device; item information is pulled from WorldCat if found.  The cataloging application does not attach holdings to WorldCat, but the OCLC number is kept and might be used to facilitate offloading MARC records in cases where a library outgrows this simple circulation module to a more functional integrated library system.  The circulation functions are check-in, check-out, renew, place hold and cancel hold.  There are no financial functions in the system.</p><p>At these price levels, the system needs to be highly automated and self-supporting.  The cost to OCLC of one call to a customer support phone number could easily run through all the revenue OCLC would receive from the subscribing library in a year.  A widely adopted implementation at the targeted price points means that OCLC could dedicate one or two technical staff to support and upgrade the system in addition to the hardware and support service amortization.</p><p>One unresolved issue is domain names &#8212; what is the URL that will be used for each library&#8217;s site.  OCLC is investigating options such as partnering with a domain registrar company (someone like GoDaddy), becoming a domain registrar themselves, or putting all the sites under one domain.  The economics of each of these options will be a factor. The sometimes cumbersome aspects of migrating domain names from one service to another may make that activity cost-prohibitive as well.</p><p>Mike Teets noted that this was at a &#8220;project&#8221; stage, not a &#8220;product&#8221; stage.  My paraphrasing of what this distinction means is that the technology to create a product is largely done, but the decisions on the final formative pieces of the technology and the surrounding support infrastructure &#8212; is not yet done (and might never be done).  There isn&#8217;t even a formal name for it yet; it is being called &#8220;A Web Presence for Small Libraries.&#8221;  One desired additional feature is to add e-mail boxes for library staff/functions to the site.  Those in the room, including me but more importantly others who are more closely aligned with the target &#8220;small and rural&#8221; public library population, were pretty excited about it and wanting to talk further about if what we saw could be made a reality.</p><p><div id="attachment_2118_video" class="wp-caption alignright" style="width: 327px;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; float: right;"><iframe title="YouTube video player" class="youtube-player" type="text/html" width="316" height="208" src="http://www.youtube.com/embed/-qxiW7gkQTU" frameborder="0"></iframe><p style=' padding: 0 4px 5px; margin: 0;'  class="wp-caption-text">One minute video of Mike Teets and Willie Neumann introducing the concepts behind the project</p></div><h2>Personal Impressions</h2></p><p>Like others in the room, I came away impressed by the project demonstration.  It is definitely fits the bill as a basic library website and even a starter inventory and circulation management system.  I see that libraries could start with something like this for the cost of a couple of books added to the collection (roughly estimated at $60/year).  At this price level, OCLC thinks they could sustain the costs of operations plus have some left over for investment in incremental improvements.  I also think that because a library would pay a minimal fee for it, they would feel a tangible sense of ownership over the site and would keep them up-to-date.  From this a library could &#8220;graduate&#8221; to another service &#8212; their own Drupal or WordPress site, to a shared ILS or Webscale Management Services.</p><p>The content areas seemed the most appropriate for the target audience.  The web page design is modern, and I could see options for future enhancements as time and revenue permit such as providing limited options to personalize the template (change colors, adding picture &#8212; or adding links to pictures that might be stored on services such as Flickr).</p><p>One attendee at the session suggested that rather than OCLC prepopulating a digital content library of <em>all</em> public domain content that the set be <span class="removed_link" title="http://www.clicweb.org/e_discover/e_discovermarc.html">limited to those that are the most downloaded</span> so as not to overwhelm the user with a lot of unused (and/or unusable) digital content.  That sounds like a good suggestion to me.</p><p>OCLC Staff are looking for feedback on this project.  They say that the system is &#8220;production ready&#8221; with all the software controls and data recovery features of OCLC behind it.  What they think is missing is community support to have local engagement with the targeted libraries to show them what is possible.  That is an area where OCLC needs help.  (I can only imagine the shocked silence of a volunteer at a small library to get a call from OCLC &#8212; if they even knew what OCLC was &#8212; with an offer to create a website for the library.  &#8220;Only $5/month&#8230;sign up now and we&#8217;ll throw in a second one for free!&#8221;) There is an e-mail address &#8212; <a href="mailto:innovation@oclc.org">innovation@oclc.org</a> &#8212; that goes to all the OCLC Innovation Lab members, and a <a href="http://www.webjunction.org/923" title="WebJunction - Library Websites Group  ">WebJunction group</a> for public discussion.</p><p><h2>About the Innovation Lab</h2><br />The <a href="http://experimental.worldcat.org/" title="OCLC Innovation Lab homepage">OCLC Innovation Lab</a> is a new unit headed up by <a href="http://www.linkedin.com/in/michaelteets" title="Michael Teets - LinkedIn">Mike Teets</a>.  The group has four full-time people (Mike plus <a href="http://www.oclc.org/speakers/bios/house_tip.htm" title="Tip House [OCLC]">Tip House</a>, <a href="http://nl.linkedin.com/pub/rob-koopman/12/b9/303" title="rob koopman - LinkedIn">Rob Koopman</a> and <a href="http://www.linkedin.com/in/willieneumann" title="Willie Neumann - LinkedIn">Willie Neumann</a>) and leverage staff from other parts of OCLC to work on projects.  (Willie performed the demonstration at Midwinter.)  Their charge is to be a quick and nimble team to go after problems of a business unit of the cooperative or something from OCLC as a whole that wants done but hasn&#8217;t made progress.  They like to use this quickness as a positive attribute to intentionally limit the scope of projects.  Since its formation in April 2010 it has come up with roughly one new thing a quarter, including the WorldCat Mobile interface (build in 22 days, Mike noted) and the <a href="http://community.oclc.org/cooperative/2010/06/sometimes-the-internet-is-just-not-big-enough-for-me.html" title="Sometimes the Internet is just not big enough for me - The OCLC Cooperative Blog">Ask4Stuff Twitter service</a>.</p><p>For this project, the Innovation Lab sought out cooperation from OCLC staff to build prototypes and components of the service in their spare time.  Each Monday the self-selected group would get together to show what had been built and discuss ways to move the project forward in the following week.  In this way they rapidly iterated over ideas to come up with what was ultimately proposed.<p style="padding:0;margin:0;font-style:italic;">The text was modified to update a link from http://expreimental.worldcat.org/ to http://experimental.worldcat.org/ on January 12th, 2011.</p><p style="padding:0;margin:0;font-style:italic;">The text was modified to update a link from http://www.webjunction.org/923 to http://www.webjunction.org/923 on January 12th, 2011.</p><p style="padding:0;margin:0;font-style:italic;" class="removed_link">The text was modified to remove a link to http://www.clicweb.org/e_discover/e_discovermarc.html on June 9th, 2011.</p><h2>Footnotes</h2><ol class="footnotes"><li id="footnote_0_2114" class="footnote">What-You-See-Is-What-You-Get, meaning as you edit the document with styles like bolding, underlining, and bulleted lists, what you see in the editing interface is exactly how it will appear in the viewing interface.</li></ol>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/a-web-presence-for-small-libraries/feed/</wfw:commentRss> <slash:comments>27</slash:comments> </item> <item><title>Options in Storage for Digital Preservation</title><link>http://dltj.org/article/preservation-storage-options/</link> <comments>http://dltj.org/article/preservation-storage-options/#comments</comments> <pubDate>Sun, 09 Jan 2011 22:45:10 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Meeting]]></category> <category><![CDATA[Amazon S3]]></category> <category><![CDATA[Association for Library Collections and Technical Services]]></category> <category><![CDATA[Chronopolis]]></category> <category><![CDATA[DAITS]]></category> <category><![CDATA[Duracloud]]></category> <category><![CDATA[Iron Mountain]]></category> <category><![CDATA[LOCKSS]]></category> <category><![CDATA[OCLC]]></category> <category><![CDATA[preservation]]></category> <category><![CDATA[storage]]></category> <category><![CDATA[trac]]></category><guid isPermaLink="false">http://dltj.org/?p=2101</guid> <description><![CDATA[A last-minute change to my plans for ALA Midwinter came on Tuesday when I was sought out to fill in for a speaker than canceled at the ALCTS Digital Preservation Interest Group meeting. Options for outsourcing storage and services for &#8230; <a href="http://dltj.org/article/preservation-storage-options/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=2101"></abbr><p>A last-minute change to <a href="http://dltj.org/article/alamw11-schedule/">my plans for ALA Midwinter</a> came on Tuesday when I was sought out to fill in for a speaker than canceled at the <a href="http://connect.ala.org/node/119686" title="Digital Preservation Interest Group (ALCTS) | ALA Connect">ALCTS Digital Preservation Interest Group meeting</a>.  Options for outsourcing storage and services for preserving digital content has been a recent interest, so I volunteered to combine two earlier <i><acronym title="Disruptive Library Technology Jester">DLTJ</acronym></i> blog posts with some new information and present it to the group for feedback.  The reaction was great, and here is the promised slide deck, links to further information, and some thoughts from the audience response.</p><p><h2>Slide Deck and References</h2><br /><div id="slideshare-options-in-storage" class="wp-caption alignright" style="width: 435px;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; float: right;"><div style="width:425px" id="__ss_6499127"><strong style="display:block;margin:12px 0 4px"><a href="http://www.slideshare.net/DataGazetteer/options-in-storage-for-digital-preservation" title="Options in Storage for Digital Preservation">Options in Storage for Digital Preservation</a></strong><object id="__sse6499127" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=201101digitalpreservationinterestgroupalcts-110109162838-phpapp02&#038;stripped_title=options-in-storage-for-digital-preservation&#038;userName=DataGazetteer" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed name="__sse6499127" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=201101digitalpreservationinterestgroupalcts-110109162838-phpapp02&#038;stripped_title=options-in-storage-for-digital-preservation&#038;userName=DataGazetteer" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object></div><p><p style=' padding: 0 4px 5px; margin: 0;'  class="wp-caption-text">Slides for &#039;Options in Storage for Digital Preservation&#039;</p></div><br />In the presentation there is a <a href="http://www.slideshare.net/DataGazetteer/options-in-storage-for-digital-preservation/14" title="Options in Storage for Digital Preservation">Table About Costs</a> that uses a scenario from an <a href="http://dltj.org/article/oclc-digital-archive-vs-amazon-s3/">earlier <i><acronym title="Disruptive Library Technology Jester">DLTJ</acronym></i> blog post</a>.  The text of the scenario is:<br /><blockquote>To examine the similarities and differences in costs, let’s use the OhioLINK Satellite Image collection as a prototypical example. It consists of about 2 terabytes (2TB) of high-quality images in TIFF format, with about 7.5GB of data going into the repository each month. In the interest of exploring everything that S3 can do, there is an assumption that approximately 4GB of data will be transferred out of the archive each month; OCLC’s Digital Archive does not have a end-user dissemination component.</p></blockquote><p>The point of showing this scenario is to show the widest range of costs &#8212; from a storage-only solution like Amazon S3 to a soup-to-nuts service like OCLC Digital Archive.  A word about the redacted costs.  Some of the numbers for OCLC&#8217;s Digital Archive response (from 2008) came from a confidential quote, so the numbers were removed from the public table.  For the numbers that are publicly listed, the values come from <a href="http://newsbreaks.infotoday.com/nbReader.asp?ArticleId=49018" title="OCLC Introduces High-Priced Digital Archive Service">Barbara Quint&#8217;s article</a>.</p><p>The articles and blog posts I referenced in the course of the presentation were:</p><p>Iglesias, Edward and Wittawat Meesangnil (2010). Using Amazon S3 in Digital Preservation in a mid sized academic library: A case study of CCSU ERIS digital archive system. <i>The Code4Lib Journal</i>, issue 12, retrieved 5-Jan-2011 from <a href="http://journal.code4lib.org/articles/4468" title="Using Amazon S3 in Digital Preservation in a mid sized academic library: A case study of CCSU ERIS digital archive system | The Code4Lib Journal">http://journal.code4lib.org/articles/4468</a></p><p>Murray, Peter (2008). Long-term Preservation Storage: OCLC Digital Archive versus Amazon S3. <i>Disruptive Library Technology Jester.</i> Retrieved 5-Jan-2011 from <a href="http://dltj.org/article/oclc-digital-archive-vs-amazon-s3/">http://dltj.org/article/oclc-digital-archive-vs-amazon-s3/</a></p><p>Murray, Peter (2009). Can We Outsource the Preservation of Digital Bits?. <i>Disruptive Library Technology Jester.</i> Retrieved 5-Jan-2011 from <a href="http://dltj.org/article/outsource-digital-bits/">http://dltj.org/article/outsource-digital-bits/</a></p><p>Quint, Barbara (2008). OCLC Introduces High-Priced Digital Archive Service. <i>Information Today.</i> Retrieved 5-Jan-2011 from <a href="http://newsbreaks.infotoday.com/nbReader.asp?ArticleId=49018" title="OCLC Introduces High-Priced Digital Archive Service | Information Today">http://newsbreaks.infotoday.com/nbReader.asp?ArticleId=49018</a></p><ul><li>Amazon S3. Retrieved 5-Jan-2011 from <a href="http://aws.amazon.com/s3/" title="Amazon Simple Storage Service (Amazon S3)">http://aws.amazon.com/s3/</a></li><li>Chronopolis. Retrieved 5-Jan-2011 from <a href="http://chronopolis.sdsc.edu/" title="Chronopolis -- Digital Preservation Program -- Long-Term Mass-Scale Federated Digital Preservation"> http://chronopolis.sdsc.edu/</a></li><li>DAITSS. Retrieved 5-Jan-2011 from <a href="http://daitss.fcla.edu/" title="DAITSS - Trac">http://daitss.fcla.edu/</a></li><li>DuraCloud. Retrieved 5-Jan-2011 from <a href="https://wiki.duraspace.org/display/duracloud/DuraCloud">https://wiki.duraspace.org/display/duracloud/DuraCloud</a></li><li>Iron Mountain. Retreived 5-Jan-2011 from <a href="http://www.ironmountain.com/news/2009/impr02232009.asp" title="Iron Mountain Digital Introduces the Industry" class="broken_link" rel="nofollow">http://www.ironmountain.com/news/2009/impr02232009.asp</a></li><li>OCLC Digital Archive. Retrieve 5-Jan-2011 from <a href="http://www.oclc.org/us/en/digitalarchive/" title="Digital Archive [OCLC - Digital Collection Services]">http://www.oclc.org/us/en/digitalarchive/</a></li><li>Private LOCKSS Networks. Retrieved 5-Jan-2011 from <a href="http://lockss.stanford.edu/lockss/Private_LOCKSS_Networks" title="Private LOCKSS Networks - LOCKSS">http://lockss.stanford.edu/lockss/Private_LOCKSS_Networks</a></li></ul><p><h2>Some Thoughts</h2><br />There was a great deal of discussion after the presentation about how good of a guarantee is good enough.  Amazon S3, offers two levels of availability:  &#8220;Designed to provide 99.999999999% durability and 99.99% availability of objects over a given year.&#8221;  The question was whether that slight risk of loss is &#8220;good enough&#8221; for our purposes.  Coming to grips with the digital storage, can we (as the librarian profession) get someone from Amazon to talk about what they do to assure that data is available?  Can the terms that they use be translated into terms that we use and understand?  Can we get a level of familiarity and comfort with their storage about what they do to trust them as a long-term data warehouse?  Can we pull out the appropriate questions of the <a href="http://www.dcc.ac.uk/resources/tools-and-applications/trustworthy-repositories" title="Trustworthy Repositories | Digital Curation Centre">Trusted Repositories Audit &amp; Certification: Criteria and Checklist</a> to see how Amazon S3 measures up?</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/preservation-storage-options/feed/</wfw:commentRss> <slash:comments>10</slash:comments> </item> <item><title>A History of the OCLC Tax-Exemption Status</title><link>http://dltj.org/article/oclc-tax-exemption-status/</link> <comments>http://dltj.org/article/oclc-tax-exemption-status/#comments</comments> <pubDate>Wed, 06 Oct 2010 03:47:38 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[L/IS Profession]]></category> <category><![CDATA[legal]]></category> <category><![CDATA[non-profit]]></category> <category><![CDATA[OCLC]]></category> <category><![CDATA[Ohio Board of Tax Appeals]]></category> <category><![CDATA[SkyRiver/Innovative versus OCLC lawsuit]]></category><guid isPermaLink="false">http://dltj.org/?p=1701</guid> <description><![CDATA[One of the baffling elements I&#8217;ve found in discussions of the history of OCLC is that of its tax exempt status under Ohio law. The latest example of this comes from documents filed in the SkyRiver/Innovative-vs.-OCLC case that make disparaging &#8230; <a href="http://dltj.org/article/oclc-tax-exemption-status/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=1701"></abbr><p>One of the baffling elements I&#8217;ve found in discussions of the history of OCLC is that of its tax exempt status under Ohio law.  The latest example of this comes from documents filed in the SkyRiver/Innovative-vs.-OCLC case that make disparaging remarks about how OCLC got its state tax-advantaged status.  (The text of the remarks in those documents are included below.)  I was curious about this a while back and so did some research on the topic.  I had set it aside and forgotten about it until this latest lawsuit brought it up again.  So, to set the record straight, here is at least one version &#8212; hopefully written from a neutral perspective &#8212; of what happened nearly three decades ago.<br /><span id="more-1701"></span><br />A couple of notes before we begin.  First, the title of this post purposefully starts with an indefinite article.  This is not <em>the</em> story; this is <em>a</em> story that I have managed to piece together.  I don&#8217;t do legal research and legislative history for a living, but I did have a class in those topics during library school.  (Hi, Professor Wise!)  This should not be considered a definitive version&#8230;it is as best as I can piece together.</p><p>Second note.  Many of the documents come from LexisNexis Academic Universe.  Even though the documents themselves are in the public domain (being of government origin), I&#8217;m not sure the LexisNexis versions can be openly published because they contain information from the LexisNexis editors. <sup><a href="http://dltj.org/article/oclc-tax-exemption-status/#footnote_0_1701" id="identifier_0_1701" class="footnote-link footnote-identifier-link" title="See this discussion of FriendFeed to get a sense of the frustration in trying to figure out the LexisNexis Academic Universe Terms and Conditions.">1</a></sup> I could publish the raw documents if I were to go to the various governement offices that have this information and make copies, but that &#8212; quite frankly &#8212; is more effort that I&#8217;m willing to put into this project.  So you&#8217;ll either have to take my word for it or look up the citations (provided in the text below) yourself if you have access to LexisNexis Academic Universe.</p><p>Third note.  This post only deals with the tax status of OCLC as it relates to Ohio law.  In other words, the taxes that OCLC would owe to the State of Ohio.  OCLC&#8217;s exemption from federal income taxes under section 501(c)(3) of the Internal Revenue Service Code is another matter entirely.  I&#8217;m sure there is a paper trail to that as well (part of which includes the <a href="http://www.irs.gov/charities/article/0,,id=181089,00.html" title="Form 990 Instructions and Background Documents">IRS Form 990</a> declarations from which details of the SkyRiver/Innovative complaint are drawn), but I don&#8217;t know how to get to it.</p><p><h2>The Origin</h2><br />We start with a request OCLC made to the Ohio Tax Commissioner in 1980.  This summary comes from the decision of the Ohio Board of Tax Appeals in 1983:<sup><a href="http://dltj.org/article/oclc-tax-exemption-status/#footnote_1_1701" id="identifier_1_1701" class="footnote-link footnote-identifier-link" title="OCLC Online Computer Library Center, Inc., Appellant, vs. Robert R. Kinney, Commissioner of Tax Equalization, Appellee. CASE NO. 81-D-602 (REAL PROPERTY TAX) STATE OF OHIO &amp;#8212; BOARD OF TAX APPEALS. 1983 Ohio Tax LEXIS 162. October 11, 1983.">2</a></sup><br /><blockquote>On January 23, 1980, appellant, OCLC Online Computer Library Center, Inc. (hereinafter referred to as &#8220;OCLC&#8221;), filed an application for real property exemption pertaining to certain land located in Dublin, Ohio, together with the five-story building and other improvements thereon, relating to the tax year 1979 and thereafter. The Commissioner denied OCLC&#8217;s application for exemption. OCLC has appealed from such denial to this Board.</p></blockquote><p> On September 3, 1981, the Ohio Tax Commissioner denied the appeal.  I don&#8217;t have a copy of that decision, but the decision at the appellate level includes the text of the appeal from OCLC.  Based on that text, it seems like the denial was based on the &#8220;fact that Appellant&#8217;s real property is not used exclusively for charitable purposes&#8221; as defined by the Ohio Revised Code (sections <a href="http://codes.ohio.gov/orc/5709.12" title="Lawriter - ORC - 5709.12 Exemption of property used for public or charitable purposes.">5709.12</a> and <a href="http://codes.ohio.gov/orc/5709.121" title="Lawriter - ORC - 5709.121 Exclusive charitable or public purposes defined.">5709.121</a>, if you must).</p><p>There are numerous references to case law in Ohio that OCLC used in its appeal that I won&#8217;t list here, but there are two references to what I would offer are core values of OCLC that were refuted by the Ohio Tax Commissioner:<br /><blockquote>The Commissioner erroneously held as a matter of law and fact that Appellant OCLC failed to establish its uniquely charitable purpose of furthering the growth of human knowlege. The Commissioner&#8217;s narrow focusing upon and findings regarding Appellant OCLC&#8217;s provision of services at a cost savings improperly ignored other substantial elements of Appellant OCLC&#8217;s charitable purpose and interactivity with its library members.</p><p>The Commissioner erroneously held as a matter of law and fact that there is nothing unique in the nature of Appellant OCLC&#8217;s services that would make it an unlikely service to be engaged in by private enterprise. The essence of Appellant OCLC is the cooperative sharing of the bibliographic records of all its member libraries and the completeness of the resulting data base in which all its library members have a beneficial right of ownership and use and without which Appellant OCLC as a national library resource would not exist. It is an accident of fate that the resource is physically located in the subject real estate.</p></blockquote><p><h2>The Appellate Court Decision</h2><br />The decision from the appellate court goes on at some length<sup><a href="http://dltj.org/article/oclc-tax-exemption-status/#footnote_2_1701" id="identifier_2_1701" class="footnote-link footnote-identifier-link" title="For instance, the decision from the appellate court says:On February 3, 4, 5, and 8, 1982, a hearing was held for the purpose of permitting the parties hereto to provide additional evidence, as authorized by R.C. 5717.02. Both parties were represented by counsel. A transcript of such proceedings (pp. 1 &amp;#8211; 730, comprising 4 volumes) was subsequently filed.&amp;#8230;to which I say, &amp;#8220;Wow! 730 pages of transcripts!&amp;#8221;">3</a></sup>, including an interesting snapshot of OCLC technology to serve its members as it existed in the early 1980s.  It then gets to the heart of the matter (legal citations removed for sake of clarity):<br /><blockquote>An institution which has as its essential purpose the accumulation of bibliographic catalog data supplied by members organizations which is then made available to and for the use of its member organizations is not a &#8220;public college,&#8221; &#8220;academy,&#8221; or &#8220;public institution of learning&#8221; as such terms are used in [Ohio Revised Code Section] 5709.07.  While it may by true that OCLC may actually teach individuals concerning the nature of the data stored in OCLC&#8217;s computer data base and how to use OCLC&#8217;s equipment in obtaining such data information through the use of OCLC&#8217;s equipment, such teaching process primarily subserves OCLC&#8217;s fundamental purpose of providing information as to the existence, owner and holder of particular books and other educational materials and a means to make arrangements to obtain such educational materials, if desired.</p><p>In the last analysis, OCLC is not an institution having a teacher-student character. OCLC is in the business of selling, leasing or renting equipment and facilities capable of providing information and access to written information, primarily in form the of books, held in public or private libraries of OCLC&#8217;s member organizations. The providing of such information by OCLC in the form of printed bibliographic catalog cards or via telephonic terminal connections with OCLC&#8217;s computer facilities, is primarily, if not solely, for members contracting with OCLC and for an established per transaction price paid by the member organization obtaining such information.</p><p>[...]</p><p>Although a laudable purpose, the business of providing a computerized library network, the storing of library related bibliographical catalog data, and the selling or licensing the use of equipment necessary to gain access to such stored data or to communicate with member organizations having a contractual relationship with OCLC, is not a charitable purpose. There are many data processing corporations providing data storage capabilities for the benefit of its users. The providing of date storage or data processing services and equipment for an established price, whether for profit or otherwise, is not a charitable purpose or activity.</p><p>The fact that the particular users of data storage or data processing services may be charitable institutions that may benefit, in terms of cost, in obtaining information from a data storage or data processing organization below what would otherwise be involved, in terms of time and cost in obtaining like benefits, does not thereby make the supplier of such services and facilities a vicarious charitable institution merely because the facilities supplied are so immediate, intertwined and necessary to the efficient conduct of the users&#8217; charitable activities.</p><p>&#8230;The subject real property is used to house, maintain and sell its computerized services and equipment to its member library organizations. Such services and facilities are not sold directly to the general public nor made available by OCLC directly to the public. Such sales are made only to member organizations having a contractual arrangement made with OCLC under which the member organizations agree and provide bibliographic data pertaining to their own library holdings of books and related materials and agree to pay a per transaction cost for use of OCLC&#8217;s data and system. OCLC does not offer or provide its services and facilities directly to the general public or offer or provide the same except upon a contractual arrangement and an obligation to pay therefor&#8230;.</p><p>The evidence does indicate that OCLC has in fact produced a &#8220;profit&#8221; in the sense that it has accumulated operating revenues in excess of operating expenses although, by virtue of being a non-profit corporation, such profits are not directly distributable as such to its member organizations.</p><p>Although OCLC contends that the Commissioner&#8217;s findings of fact and conclusions of law are erroneous, upon review of the Commissioner&#8217;s findings and conclusions of law we find that they are reasonable and proper in all respects.</p><p>The Board of Tax Appeals finds and determines upon the record as a matter of law that all specifications of error are without merit.</p><p>IT IS ORDERED that the Commissioner of Tax Equalization&#8217;s final determination, rendered by Journal Entry, dated September 3, 1981, here involved, should be, and hereby is, affirmed.</p></blockquote><p><h2>The Ohio Supreme Court Decision</h2><br />OCLC took the case to the Ohio Supreme Court.  In a decision on June 20, 1984, the Ohio Supreme Court upheld the the decision of the appellate court and the tax commissioner.  This decision summed up the previous events this way:<sup><a href="http://dltj.org/article/oclc-tax-exemption-status/#footnote_3_1701" id="identifier_3_1701" class="footnote-link footnote-identifier-link" title="OCLC ONLINE COMPUTER LIBRARY CENTER, INC., APPELLANT, v. KINNEY, COMMR., APPELLEE. No. 83-1713 Supreme Court of Ohio. 11 Ohio St. 3d 198; 464 N.E.2d 572; 1984 Ohio LEXIS 1136; 11 Ohio B. Rep. 509">4</a></sup><br /><blockquote>OCLC&#8217;s membership consists of one hundred eighty-four academic and public libraries in Ohio, and over two thousand six hundred libraries throughout North America. Its services are not directly available to the public at large. Members are charged a fee, and in return they receive terminal equipment for online access to OCLC&#8217;s computer facilities via dedicated telephone lines. OCLC further provides personnel and equipment in order to service and maintain its customers&#8217; computer terminals and other associated computer hardware. The bibliographic holdings of member libraries are entered into the data base for access by all other member libraries, thus facilitating the location of books and materials held by member organizations, and simplifying procedures for inter-library book transfers.</p><p>OCLC also provides cataloging information to member libraries. For example, when a library purchases a book already owned by another member library, a search of OCLC&#8217;s data is made and, because the bibliographic record already exists in the data base, a catalogue record can be generated at a greatly reduced cost.</p><p>In addition to the aforementioned services, OCLC has engaged in research for companies, including Bank One of Columbus and Warner Amex QUBE, a cable television company. On one occasion, OCLC performed research for a telecomputing firm and, in return, was offered fifty percent of the firm&#8217;s stock for an additional price of one dollar.</p><p>The Commissioner of Tax Equalization (hereinafter &#8220;commissioner&#8221;) denied the exemption on the basis that OCLC is neither a &#8220;public college or academy&#8221; or &#8220;public institution of learning,&#8221; as those terms are employed under R.C. 5709.07, nor was the property found to be used &#8220;exclusively for charitable purposes&#8221; so as to entitle OCLC to an exemption under R.C. 5709.12. On appeal, the Board of Tax Appeals affirmed. In its appeal to this court, OCLC has abandoned its argument for an exemption as a public institution of learning in conjunction with R.C. 5709.07, and, as such, contests only the board&#8217;s determination that it failed to meet the qualifications for a charitable exemption under R.C. 5709.12.</p></blockquote><p>Some excerpts from the legal reasoning in deciding the case:<br /><blockquote>OCLC submits that because it serves libraries, which in turn benefit the general public through the dissemination of knowledge for the edification and improvement of mankind, it qualifies as an institution furthering human knowledge and, therefore, is a charitable institution. This argument, however, simply constitutes an attempt by OCLC to obtain a vicarious charitable exemption by virtue of the activities of its customers.</p><p>In Joint Hospital Services v. Lindley (1977), we rejected a similar vicarious charitable exemption theory. In that case, a group of hospitals pooled their resources and established a laundry and linen service for their members, as well as for several other non-profit organizations.</p><p>As in the present case, the taxpayer relied upon the charitable status of its customers as the basis for seeking a charitable exemption. The court observed, however, that notwithstanding the charitable nature of the institutional customers who obtained the laundry and linen services, it is the charitable activities of the taxpayer seeking the exemption which must be considered when reviewing an application for a charitable exemption. Although a laudable purpose, the taxpayer&#8217;s laundry and linen service neither improved health nor alleviated illness and, accordingly, the exemption was denied.</p><p>So, too, in the instant cause, the record demonstrates that OCLC&#8217;s activities more closely resemble those of a publisher of library materials or a data base firm specializing in information retrieval, such as Lexis or the New York Times Information Bank, rather than that of a library. Although OCLC&#8217;s service may greatly enhance the ability of libraries to better serve the public, OCLC essentially offers a product to charitable institutions, for a fee exceeding its cost, and, as the board concluded, is not itself a charitable organization.</p><p>In addition, OCLC fails to address the effect that its commercial fee paying research endeavors have upon its status as an organization seeking a charitable exemption. &#8230;  Such commercial research endeavors have previously been determined by this court to preclude the issuance of a charitable tax exemption. Moreover, although OCLC may have originated as a charitable organization upon its creation in 1967, the organization, which now operates throughout North America for any library willing to pay its fee, and which engages in fee paying research projects for the private gain of commercial industries, has since transcended the realm of a charitable institution.</p><p>For the foregoing reasons, the decision of the Board of Tax Appeals, being neither unreasonable nor unlawful, is affirmed.</p></blockquote><p><h2>The New Law</h2><br />Now we get to the part where the state legislature essentially negated the Ohio Tax Commissioner&#8217;s argument by carving out an exemption for OCLC.  One might think that &#8220;carving out&#8221; is a pejorative phrase, but I know of only one organization that meets the definition decribed below.  A new section of the Ohio Revised Code giving OCLC its special status was added in a law that went into effect on September 11, 1985 as part of a large tax restructuring.  Why this tax restructuring happened is an interesting story, but tangential to the topic at hand.  (See the full story at the end of this post under the heading &#8220;A Brief History of the State Support of Libraries in Ohio.&#8221;) <a href="http://codes.ohio.gov/orc/5709.72" title="Lawriter - ORC - 5709.72 Exemption for library technology development.">Ohio Revised Code contains section 5709.72</a> says:<br /><blockquote>Section 5709.72. Library technology development exemption</p><p>All tangible and intangible personal property shall be exempt from taxation if the following conditions exist in the year for which exemption is sought:</p><ol type="A"><li>The owner is a nonprofit corporation that is exempt from federal income taxes under the provisions of section 501(c)(3) of the Internal Revenue Code of 1954, as amended, and the owner&#8217;s primary purposes are conducting research and development in library technology and providing computerized or automated services to public, charitable, or educational libraries;</li><li>The property is used in any of the following:<ol type="1"><li>Furnishing services to libraries and to similar information resource agencies or institutions whose activities directly benefit libraries, provided at least eighty per cent of the owner&#8217;s revenues from furnishing those services are paid by libraries, agencies, and institutions that are public, charitable, or educational;</li><li>Conducting research and development in technology specifically for use in libraries, the majority of which are public, charitable, or educational;</li><li>Providing products, internal support, or auxiliary services related to activities described in divisions (B)(1) and (2) of this section.</li></ol></li></ol></blockquote><p>Section 5709 of the Ohio Revised Code contains the various types of organizations exempt from reporting personal property for the purposes of taxation.  The legislative history of how this section came to be added to the Ohio Revised Code is probably lost in the mists of time; we don&#8217;t know who proposed it or why.</p><p><h2>The Link to the &#8220;SkyOCLC&#8221; Case</h2><br />What brought this all to mind recently was when I was reading through the motions filed by SkyRiver/Innovative and OCLC in their California anti-trust case.  The <a href="http://docs.justia.com/cases/federal/district-courts/california/candce/3:2010cv03305/230152/20/" title="SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Document 20 - :: Justia Docs:">memorandum in opposition</a> to OCLC&#8217;s motion to transfer the case to Ohio has this bit (emphasis added):<sup><a href="http://dltj.org/article/oclc-tax-exemption-status/#footnote_4_1701" id="identifier_4_1701" class="footnote-link footnote-identifier-link" title="SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc., Document #20, page 13 line 21 through page 14 line 5">5</a></sup></p><blockquote><p>OCLC argues that because the Complaint alleges that OCLC has abused its tax-exempt status, the interests of the Ohio court are greater than the interests of this California Court. The purported reason given is that the court’s decision may have an impact on other Ohio nonprofit corporations. This Court, however, is not going to determine OCLC’s state or federal tax-exempt status nor would an Ohio court. Moreover, OCLC has not explained how any impact on other nonprofits involved in other unrelated activities would result whether this case is tried in California or Ohio. Nor has OCLC explained why a federal court decision will affect only nonprofits in Ohio. In fact, the Supreme Court of Ohio in 1984 determined that	OCLC was engaged in commercial activities and upheld an Ohio Board of Tax Appeals’ decision denying OCLC an exemption from property taxes. See OCLC Online Computer Library Center, Inc. v. Kinney, 11 Ohio St. 3d 198, 464 N.E. 2d 572 (1984), 1984 Ohio LEXIS 1136&#8230;. <em>Apparently, OCLC had the political clout in Ohio to have the Supreme Court’s decision nullified legislatively.</em></p></blockquote><p>OCLC, <a href="http://docs.justia.com/cases/federal/district-courts/california/candce/3:2010cv03305/230152/25/" title="SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Document 25 - :: Justia Docs:">in its response</a>, gets in its two cents as well (emphasis added):<sup><a href="http://dltj.org/article/oclc-tax-exemption-status/#footnote_5_1701" id="identifier_5_1701" class="footnote-link footnote-identifier-link" title="SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc., Document #25, page 11 lines 6 through 24">6</a></sup></p><blockquote><p>OCLC established in its Motion that Ohio courts have a stronger interest in the issues raised by this lawsuit because, among other concerns, Plaintiffs have raised allegations that OCLC is improperly exercising its non-profit status. In response, Plaintiffs cite a 1984 Ohio Supreme Court case which found that OCLC was not exempt from real estate and property tax under then-enacted Ohio law.  In reality, as a result of that lawsuit, the Ohio Legislature demonstrated that it was concerned with OCLC’s non-profit mission by creating for OCLC a statutory exemption from the property tax requirements. Indeed, the language of the statute reinforces OCLC’s argument that the State of Ohio has a particularly strong interest in OCLC’s tax-exempt mission&#8230;. Plaintiffs’ unsupported claim that the Ohio Legislature acted because OCLC improperly exercised “political clout in Ohio” is an unwarranted attack on the Ohio Legislature and its members. <em>Surely, Plaintiffs are not suggesting that the judges in the Southern District of Ohio would be unable to adjudicate fairly a lawsuit brought by an out-of-state litigant, or that they would improperly favor OCLC.</em></p></blockquote><p><h2>A Brief History of the State Support of Libraries in Ohio</h2><br />This history comes from <a href="http://www.olc.org/pdf/FactSheetLGF.pdf" title="Ohio's Future: Funding History of Ohio's Public Libraries">Ohio&#8217;s Future: Funding History of Ohio&#8217;s Public Libraries</a>, a publication of the <a href="http://www.olc.org/" title="Ohio Library Council homepage">Ohio Library Council</a>.  It provides the backstory to the 1983 funding model change that added a section to the Ohio Revised Code that effectively exempted OCLC from state taxes.<br /><blockquote><h3>State Support</h3><br />Beginning in 1933, public libraries in Ohio were supported almost entirely from revenues from the intangible personal property tax. This was a tax levied on individuals&rsquo; holdings of intangible assets &ndash; mainly stocks and bonds. Though the tax was state-imposed and applied uniformly throughout Ohio, it was collected locally. The revenue remained in the county of origin, where it was distributed to library systems in that county in accordance with &ldquo;need.&rdquo;</p><p>This system of library finance, unique in the United States, had several results. Some counties, because of large holdings of taxable intangible property or vigorous local tax enforcement efforts, realized great revenue. Others received very little. By the late 1960s some of Ohio&rsquo;s largest library systems were justly renowned for the strength of their collections, the breadth and variety of services offered, and their qualified professional staffs. However, there were many areas of Ohio where public library service was virtually nonexistent. In Adams County, with a 1970 population of about 19,000, the public library consisted of two small local libraries that received a total of only $19,100.</p><p><h3>Library and Local Government Support Fund</h3><br />In 1983, the Ohio General Assembly repealed the intangible tax based on recommendations from a bi-partisan study of the state&rsquo;s entire tax system.<br />Governor Richard Celeste then created the Public Library Financing and Support Committee, consisting of members of the House and Senate, public library directors, the dean of the Kent State University School of Library Science, teachers, and financial experts, to determine how the state should replace the intangible tax funding for public libraries.</p><p>This committee determined that the loss libraries experienced through the repeal of the intangible tax was equal to 6.3% of Ohio&rsquo;s personal income tax revenue. Therefore, 6.3% of Ohio&rsquo;s personal income tax receipts were earmarked for the [Library and Local Government Support Fund].</p><p>The Public Library Finance and Support Committee set forth two goals for the state&rsquo;s library fund distribution plan: 1. To preserve excellence in existing service, and 2. To improve library service in under funded and underserved areas. To accomplish these goals, the distribution formula divides the LLGSF among all of the state&rsquo;s 88 counties in two ways. First, the formula guarantees each county the amount of revenue received from the fund in the preceding year plus an adjustment for inflation. This part of the distribution is called the &ldquo;guarantee share.&rdquo; Second, if any money remains in the fund after paying each county&rsquo;s guarantee share, then that remainder is distributed according to an &ldquo;equalization ration.&rdquo; The equalization aspect of the formula distributes the excess over the guarantee in inverse proportion to per capita funding levels among the counties &ndash; those counties which received less per capita in the guarantee share, receive more in the equalization share. Thus, over time, the distribution to counties begins to balance.</p></blockquote><h2>Footnotes</h2><ol class="footnotes"><li id="footnote_0_1701" class="footnote">See this <a href="http://friendfeed.com/lsw/356cd04f/help-needed-i-m-looking-to-get-terms-conditions" title="HELP NEEDED: I'm looking to get the... - LSW - FriendFeed">discussion of FriendFeed</a> to get a sense of the frustration in trying to figure out the LexisNexis Academic Universe Terms and Conditions.</li><li id="footnote_1_1701" class="footnote">OCLC Online Computer Library Center, Inc., Appellant, vs. Robert R. Kinney, Commissioner of Tax Equalization, Appellee. CASE NO. 81-D-602 (REAL PROPERTY TAX) STATE OF OHIO &#8212; BOARD OF TAX APPEALS. 1983 Ohio Tax LEXIS 162. October 11, 1983.</li><li id="footnote_2_1701" class="footnote">For instance, the decision from the appellate court says:<br /><blockquote>On February 3, 4, 5, and 8, 1982, a hearing was held for the purpose of permitting the parties hereto to provide additional evidence, as authorized by R.C. 5717.02. Both parties were represented by counsel. A transcript of such proceedings (pp. 1 &#8211; 730, comprising 4 volumes) was subsequently filed.</p></blockquote><p>&#8230;to which I say, &#8220;Wow! 730 pages of transcripts!&#8221;</li><li id="footnote_3_1701" class="footnote">OCLC ONLINE COMPUTER LIBRARY CENTER, INC., APPELLANT, v. KINNEY, COMMR., APPELLEE. No. 83-1713 Supreme Court of Ohio. 11 Ohio St. 3d 198; 464 N.E.2d 572; 1984 Ohio LEXIS 1136; 11 Ohio B. Rep. 509</li><li id="footnote_4_1701" class="footnote">SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc., Document #<a href="http://docs.justia.com/cases/federal/district-courts/california/candce/3:2010cv03305/230152/20/" title="SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Document 20 - :: Justia Docs:">20</a>, page 13 line 21 through page 14 line 5</li><li id="footnote_5_1701" class="footnote">SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc., Document #<a href="http://docs.justia.com/cases/federal/district-courts/california/candce/3:2010cv03305/230152/25/" title="SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Document 25 - :: Justia Docs:">25</a>, page 11 lines 6 through 24</li></ol>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/oclc-tax-exemption-status/feed/</wfw:commentRss> <slash:comments>23</slash:comments> </item> <item><title>Views on Sharing (or, What Do We Want From OCLC?)</title><link>http://dltj.org/article/views-on-sharing/</link> <comments>http://dltj.org/article/views-on-sharing/#comments</comments> <pubDate>Wed, 29 Sep 2010 01:51:16 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[L/IS Profession]]></category> <category><![CDATA[Carl Grant]]></category> <category><![CDATA[cooperatives]]></category> <category><![CDATA[discovery]]></category> <category><![CDATA[OCLC]]></category> <category><![CDATA[privacy]]></category> <category><![CDATA[SkyRiver/Innovative versus OCLC lawsuit]]></category> <category><![CDATA[WorldCat]]></category><guid isPermaLink="false">http://dltj.org/?p=1681</guid> <description><![CDATA[Within the span of a recent week we&#8217;ve had two views of the OCLC cooperative. In one we have a proposition that OCLC has gone astray from its core roots and in the other a celebration of what OCLC can &#8230; <a href="http://dltj.org/article/views-on-sharing/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=1681"></abbr><p>Within the span of a recent week we&#8217;ve had two views of the OCLC cooperative.  In one we have a proposition that OCLC has gone astray from its core roots and in the other a celebration of what OCLC can do.  One proposes a new mode of cooperation while the other extols the virtues of the existing cooperative.  Both writers claim &#8212; independently &#8212; to &#8220;talk to librarians&#8221; and represent the prevailing mood of the profession.  Can these two viewpoints be reconciled?</p><p><h2>&#8220;Too Many Cooks?&#8221;</h2><br />The pro-establishment view first.  In a <a href="http://community.oclc.org/cooperative/2010/09/too-many-cooks.html" title="Too many cooks? - The OCLC Cooperative Blog">post</a> by <a href="http://www.oclc.org/speakers/bios/nilges_chip.htm" title="William &#038;039;Chip&#038;039; Nilges [OCLC]">Chip Nilges</a> on the <a href="http://community.oclc.org/cooperative/" title="The OCLC Cooperative Blog: Insights and information from OCLC staff on topics that are fundamental to your cooperative.">OCLC Cooperative Blog</a>, we get the view that the backing of the wider librarian community is key to OCLC being able to <a href="http://www.oclc.org/news/releases/2010/201049.htm" title="H.W. Wilson databases indexed in WorldCat Local [OCLC]">negotiate with content vendors like H.W. Wilson</a>.  Chip&#8217;s &#8220;talk to librarians&#8221; quote is:<br /><blockquote>I spend quite a bit of time talking both to librarians and industry partners&#8211;publishers, booksellers, Web-technology providers, search engine companies&#8211;all kinds of people doing interesting things in our space. And in those talks, there is often a discussion of one of the following: content, technology or community. What I&#8217;ve come to realize, though, is that the best results come from places where all three come together.</p></blockquote><p> Chip&#8217;s post is short but clear in its view that the community of OCLC members is something special and that it adds value to member libraries.</p><p><h2>&#8220;The Cooperative We Need&#8221;</h2><br />The other perspective comes from <a href="http://www.exlibrisgroup.com/?catid=%7B795BD8B6-47DE-4722-8D5D-B664EEEFB34C%7D" title="Bio: Carl Grant">Carl Grant</a> in a <a href="http://commentary.exlibrisgroup.com/2010/09/cooperative-we-need-open-collaborative.html" title="The cooperative we need: Open &amp; Collaborative Library Content" class="broken_link" rel="nofollow">post</a> on his <a href="http://commentary.exlibrisgroup.com/" title="Commentary from Carl Grant" class="broken_link" rel="nofollow">Ex Libris blog</a>.  His thesis is that OCLC has an important role to play in adding value to bibliographic data, but that its motives are too intertwined with for-profit interests to carry out this role effectively.  Carl&#8217;s &#8220;talk to librarians&#8221; quote is:<br /><blockquote>It appears to me that the interests of the OCLC we know today do not appear to be in total alignment with the needs and interests of its overall actual membership. Perhaps they are in alignment with the interests of the Board, Council, and other governing and administrative arms, but the feeling I get in talks with librarians is that it is not in alignment with what they want. As I talk to librarians, across the country today, I hear that what they want is an organization, a cooperative that is focused on developing and providing open and collaborative library content and services that are widely accessible by all in order that they (the librarians) can focus on re-establishing and/or maintaining the value of libraries in our society.</p></blockquote><p> Carl goes on to propose the creation of a utility that aggregates the ratings and rankings of individual users into a database that can enhance the relevance ranking of the emerging generation of discovery layer products.</p><p><h2>My Thoughts</h2><br />This &#8220;talk to librarians&#8221; thread through the two posts makes me reflect on a question I asked earlier on <i><acronym title="Disruptive Library Technology Jester">DLTJ</acronym></i>: <a href="http://dltj.org/article/oclc-social-contract/" title="What Does It Mean to Be a Member of OCLC? | Disruptive Library Technology Jester">&#8220;What Does it Mean to be a Member of OCLC?&#8221;</a> Although I probably haven&#8217;t talked to nearly the number of librarians as Chip and Carl, in my discussions within the profession I still haven&#8217;t come to a resolution to this basic question.  That question itself is tied to another question coming through in the contrast between these two posts:  What Do We Want From OCLC?</p><p>Carl describes the problem in his post.  When a not-for-profit vendor acquires a significant number of for-profit companies (and spins them back out again), how can we (members, vendors, and the library community in general) understand how the mix of commercial and non-commercial interests are playing out at the management level?  Can the OCLC that is the bibliographic utility, the metadata <a href="http://orweblog.oclc.org/archives/001611.html" title="Platforming a library network: destination and switch - Lorcan Dempsey's Weblog">switch</a> between bibliographic-based services, and the <acronym title="Research and Development">R&#038;D</acronym> braintrust co-exist with the for-profit businesses, motivations, and operations?  Or, to put it more sharply, does the negotiation of H.W. Wilson content for use on the subscription-based WorldCat database hinder the evolution of discovery layers that being developed by companies that don&#8217;t have the tax-advantaged not-for-profit status?  (And don&#8217;t forget about the allegations of anti-competitive behavior in the <a href="http://www.librarytechnology.org/web/breeding/skyriver-vs-oclc/" title="http://www.librarytechnology.org/web/breeding/skyriver-vs-oclc/">SkyRiver/Innovative-versus-OCLC lawsuit</a>.)</p><p>In closing this section, I want to pull out and emphasize another quotation from Carl&#8217;s post:<br /><blockquote>In the end, all of these business initiatives, and now resulting lawsuit, strongly work against OCLC being able to do what it does best—building collaboration, content, and related services as a non-profit entity to serve the larger profession.</p></blockquote><p> Agreed.</p><p><h2>Carl&#8217;s Grand Idea</h2><br />What might get lost if you only closely read the first half of Carl&#8217;s post &#8212; as it initially did for me &#8212; is the second half where he describes the concept for enhancing WorldCat in a manner that benefits all&#8230;both library members and commercial entities.  He does this by noting that the &#8220;valuable points of open source software&#8221; can be applied &#8212; in a social media fashion &#8212; to a service that aggregates usage, ratings, and comments in a way that advances relevance ranking of discovery tools.  Now initially the mind swirls with concerns of privacy and informed user consent in gathering this data in one central pool.  I don&#8217;t think we know enough yet in the library community about building privacy-robust systems that meet an American librarian&#8217;s information privacy ethos.  But done right it also has the ability to build a reputation-based social feedback loop that adds important new information to the bibliographic utility.  And because of its better-when-bigger characteristic, only a neutral party like the not-for-profit OCLC cooperative could serve as an aggregator and distributor of this data.</p><p>I highly recommend reading <a href="http://commentary.exlibrisgroup.com/2010/09/cooperative-we-need-open-collaborative.html" title="The cooperative we need: Open &amp; Collaborative Library Content" class="broken_link" rel="nofollow">Carl&#8217;s post</a> and thinking about ways of answering the question &#8220;What Do We Want From OCLC?&#8221;  I commend Carl for his courage and vision in articulating his points and proposing something new for the profession to drive towards.</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/views-on-sharing/feed/</wfw:commentRss> <slash:comments>11</slash:comments> </item> <item><title>Early September Summary of the SkyRiver/Innovative vs. OCLC Case</title><link>http://dltj.org/article/skyoclc-september-2010/</link> <comments>http://dltj.org/article/skyoclc-september-2010/#comments</comments> <pubDate>Sat, 18 Sep 2010 22:12:16 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[L/IS Profession]]></category> <category><![CDATA[Innovative Interfaces Inc.]]></category> <category><![CDATA[legal]]></category> <category><![CDATA[OCLC]]></category> <category><![CDATA[SkyRiver]]></category> <category><![CDATA[WorldCat]]></category><guid isPermaLink="false">http://dltj.org/?p=1674</guid> <description><![CDATA[On September 9th, OCLC filed its first substantial response with the court to the antitrust lawsuit file by SkyRiver and Innovative Interfaces. And in a motion where OCLC requests a change of venue from the Northern District of California to &#8230; <a href="http://dltj.org/article/skyoclc-september-2010/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=1674"></abbr><p>On September 9th, <a href="http://www.oclc.org/" title="OCLC homepage" rel="homepage">OCLC</a> filed its first substantial response with the court to the <a href="http://docs.justia.com/cases/federal/district-courts/california/candce/3:2010cv03305/230152/1/" title="SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Document 1 - :: Justia Docs:">antitrust lawsuit</a> file by <a href="http://theskyriver.com/" title="SkyRiver Technology Solutions" rel="homepage">SkyRiver</a> and <a href="http://iii.com/" title="Innovative Interfaces Inc." rel="homepage">Innovative Interfaces</a>.  And in a motion where OCLC requests a change of venue from the <a href="http://www.cand.uscourts.gov/" title="United States District Court - Northern District of California" rel="homepage">Northern District of California</a> to the <a href="http://www.ohsd.uscourts.gov/" title="United States District Court - Southern District of Ohio" rel="homepage">Southern District of Ohio</a> &mdash; something seemingly mundane &mdash; they certainly pulled no punches:<br /><blockquote>Through a lengthy recitation of inaccurate facts, Plaintiffs allege six claims against OCLC. In short, Plaintiffs allege that OCLC, a forty-year old non-profit entity, is making it difficult for Innovative and its one-year old sister-company, SkyRiver, to compete and gain market share in the ILL, ILS, and the online cataloging library world. Through a variety of uncited references in their Complaint to &ldquo;prominent library-related internet blogs,&rdquo; unnamed commentators, and unattributed articles and reports, as well as through creating <a href="http://www.choiceforlibraries.com/" title="Choice for Libraries">an anti-OCLC website</a>, Plaintiffs have levied a propaganda war on OCLC simply because Plaintiffs have been unable to compete successfully with OCLC&rsquo;s membership base and bibliographic data which OCLC earned through forty years of dedicated service to its member libraries.<cite>SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Filing: 16. Page 4. <a href="http://docs.justia.com/cases/federal/district-courts/california/candce/3:2010cv03305/230152/16/" title="SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Document 16 - :: Justia Docs:">Retrieved from Justia Docs</a> on 18-Sep-2010. (link added)</cite></p></blockquote><p> The question at hand seems to be a bit more than a propogada war between SkyRiver and OCLC.  But the court is not yet at the meat of the matter.</p><p>The text accompanying the motion for change of venue, though, does not deal with the issues raised in the lawsuit.  Instead, it requests the California court &#8220;transfer this action from this District to the Southern District of Ohio, Eastern Division, located in Columbus, Ohio.&#8221;<br /><blockquote>&ldquo;Litigation should proceed where the case finds its center of gravity.&rdquo;  The &ldquo;center of gravity&rdquo; is determined by the location of key witnesses and documents. Here, the &ldquo;center of gravity&rdquo; is plainly the Southern District of Ohio, for these reasons:</p><ul><li>OCLC&rsquo;s headquarters and virtually all of the key witnesses and documentary evidence are located in or near Central Ohio.</li><li>OCLC has a relatively small presence in California, as compared to its much larger and longer-established presence in Ohio.</li><li>OCLC made all decisions and actions operative to the allegations of Plaintiffs SkyRiver Technology Solutions, LLC (&ldquo;SkyRiver&rdquo;) and Innovative Interfaces, Inc. (&ldquo;Innovative&rdquo;) (collectively &ldquo;Plaintiffs&rdquo;) in Ohio.</li><li>The State of California does not have an interest in this lawsuit beyond the fact that Plaintiffs are residents of California, whereas the State of Ohio has a great interest in this lawsuit because Plaintiffs have alleged that one of Ohio&rsquo;s non-profit entities is abusing its non-profit status, an allegation that can impact other Ohio non-profit entities.</li></ul><p>For these and other reasons discussed in more detail below, all parties and the Court will be better served by transferring this case to the Southern District of Ohio, Eastern Division. In making this Motion, OCLC reserves any defenses that it may have against Plaintiffs&rsquo; claims.<cite>SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Filing: 16. Pages 1-2. <a href="http://docs.justia.com/cases/federal/district-courts/california/candce/3:2010cv03305/230152/16/" title="SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Document 16 - :: Justia Docs:">Retrieved from Justia Docs</a> on 18-Sep-2010. (Legal citations removed from text.)</cite></p></blockquote><p>The last point is probably the most interesting to the layperson watching this epic battle unfold.  Pages 12 and 13 contain these statements:<br /><blockquote>The state of Ohio&rsquo;s interest in adjudicating this matter within its borders also militates towards transferring this case. OCLC&rsquo;s relevant policies and practices were developed and implemented in Ohio and the most important witnesses and evidence are located there.</p><p>Further, Plaintiffs have stated serious, albeit unfounded, allegations regarding OCLC&rsquo;s non-profit status.  Plaintiffs have stated that OCLC is &ldquo;abusing its status as a tax exempt, non-profit entity and unfairly competes with for-profit companies, such as Innovative and SkyRiver, by using its non-profit status as leverage to	monopolize the library services industry&#8230;.&rdquo;  These baseless allegations could create serious implications for other Ohio non-profit entities, and Ohio courts have a greater interest in litigating these issues. While Ohio courts also have more experience applying Ohio&rsquo;s laws, more importantly, they also have more experience with the routine customs and practices of non-profit entities in Ohio.	In addition, the Ohio Attorney General has oversight over Ohio non-profit entities and would likewise have an interest in this lawsuit.</p><p>In contrast, California does not have an interest in litigating this action because none of the operative actions occurred in California. Though Plaintiffs asserted California state law claims, as explained below, Ohio courts will be equally skilled at applying and interpreting those laws.<cite>SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Filing: 16. Pages 12-13. <a href="http://docs.justia.com/cases/federal/district-courts/california/candce/3:2010cv03305/230152/16/" title="SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Document 16 - :: Justia Docs:">Retrieved from Justia Docs</a> on 18-Sep-2010. (Legal citations removed from text.)</cite></p></blockquote><p> The question of OCLC&#8217;s tax exempt status is one that bubbles up on occasion.  It would seem like OCLC&#8217;s legal team is willing to take this head on.</p><p>Accompaying the motion to transfer is a <a href="http://docs.justia.com/cases/federal/district-courts/california/candce/3:2010cv03305/230152/17/" title="SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Document 17 - :: Justia Docs:">declaration by Bruce Crocco</a>, Vice President, Library Services for the Americas for OCLC, and this is a more interesting document.  It goes into the history of OCLC &mdash; its founding in 1967 on the Ohio State University campus, how OCLC revolutionized the production of paper cards for card catalogs and the movement into online catalogs, and the evolution of the WorldCat brand name.</p><p>Earlier this week, the judge in the case set this schedule for hearing from the parties on the motion:<br /><blockquote>This matter is set for a hearing on October 29, 2010 on Defendant OCLC Online Computer Library Center’s motion to transfer venue. The Court HEREBY ORDERS that an opposition to the motion shall be filed by no later than September 27, 2010 and a reply brief shall be filed by no later than October 4, 2010.</p><p>If the Court determines that the matter is suitable for resolution without oral argument, it will so advise the parties in advance of the hearing date. If the parties wish to modify this schedule, they may submit for the Court’s consideration a stipulation and proposed order demonstrating good cause for any modification requested.</p></blockquote><p>To refresh your memory, the lawsuit was filed on July 28, 2010, and was <a href="http://docs.justia.com/cases/federal/district-courts/california/candce/3:2010cv03305/230152/8/" title="SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Document 8 -  :: Justia Docs">assigned</a> to <a href="http://en.wikipedia.org/wiki/Jeffrey_White" title="Jeffrey White - Wikipedia">Judge Jeffrey S. White</a> on August 6, 2010.  On August 12th, Judge White <a href="http://docs.justia.com/cases/federal/district-courts/california/candce/3:2010cv03305/230152/9/" title="SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Document 9 -  :: Justia Docs">set a Case Management Conference</a> for January 14, 2011, and on August 13th lawyers for OCLC filed a <a href="http://docs.justia.com/cases/federal/district-courts/california/candce/3:2010cv03305/230152/10/" title="SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Document 10 - :: Justia Docs:">notice</a> with their intent to request a change of venue.  Stay tuned for an update as the case moves on&#8230;</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/skyoclc-september-2010/feed/</wfw:commentRss> <slash:comments>10</slash:comments> </item> <item><title>Mashups of Bibliographic Data: A Report of the ALCTS Midwinter Forum</title><link>http://dltj.org/article/mashups-of-bib-data/</link> <comments>http://dltj.org/article/mashups-of-bib-data/#comments</comments> <pubDate>Wed, 27 Jan 2010 21:14:52 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Meeting]]></category> <category><![CDATA[ALA Midwinter Conference 2010]]></category> <category><![CDATA[Association for Library Collections and Technical Services]]></category> <category><![CDATA[Dewey Decimal Classification]]></category> <category><![CDATA[Google Book Search]]></category> <category><![CDATA[Internet Archive]]></category> <category><![CDATA[MARC]]></category> <category><![CDATA[OCLC]]></category> <category><![CDATA[onix]]></category> <category><![CDATA[Open Library]]></category> <category><![CDATA[WorldCat]]></category><guid isPermaLink="false">http://dltj.org/?p=1478</guid> <description><![CDATA[This year the ALCTS Forum at ALA Midwinter brought together three perspectives on massaging bibliographic data of various sorts in ways that use MARC, but where MARC is not the end goal. What do you get when you swirl MARC, &#8230; <a href="http://dltj.org/article/mashups-of-bib-data/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=1478"></abbr><p>This year the <a href="http://connect.ala.org/node/91406" title="ALCTS Forum: Mix and Match: Mashups of Bibliographic Data | ALA Connect"><acronym title="Association for Library Collections and Technical Services">ALCTS</acronym> Forum at <acronym title="American Library Association">ALA</acronym> Midwinter</a> brought together three perspectives on massaging bibliographic data of various sorts in ways that <em>use</em> <acronym title="Machine Readable Cataloging">MARC</acronym>, but where MARC is not the end goal.  What do you get when you swirl MARC, <acronym title="ONline Information eXchange">ONIX</acronym>, and various other formats of metadata in a big pot?  Three projects:  ONIX Enrichment at OCLC, the Open Library Project, and Google Book Search metadata.<br /><span id="more-1478"></span><br />Below is a summary of how these three projects are messin&#8217; with metadata, as told by the Forum panelists.  I also recommend reading Eric Hellman&#8217;s <a href="http://go-to-hellman.blogspot.com/2010/01/google-exposes-book-metadata-privates.html" title="Google Exposes Book Metadata Privates at ALA Forum | Go-to-Hellman">Google Exposes Book Metadata Privates at ALA Forum</a> for his recollection and views of the same meeting.</p><p><h2 id="post-1478-h2-OCLC-ONIX">ONIX Enrichment at OCLC</h2></p><p><span class="removed_link" title="http://www.oclc.org/speakers/bios/register_renee.htm">Renee Register</span>, Global Product Manager for OCLC Cataloging and Metadata Services, was the first to present on the panel.  Her talk looked at a new and evolving product at OCLC on the enhancement of ONIX records with WorldCat records, and vice versa. <sup><a href="http://dltj.org/article/mashups-of-bib-data/#footnote_0_1478" id="identifier_0_1478" class="footnote-link footnote-identifier-link" title="For those not familiar with ONIX, it is a suite of standards promulgated by EDItEUR for the interchange of information on books and serial publications.  It is primarily used as the communication channel between the publishing industry through distribution chains to retail establishments.">1</a></sup></p><p>As libraries, Renee said &#8220;our instincts are collaborative&#8221; but &#8220;our data and workflow silos encourage redundancy and inhibit interoperability.&#8221;  Beyond the obvious differences in metadata formats, the workflows of libraries differ dramatically from other metadata providers and consumers. In libraries (with the exception of <acronym title="Cataloging in Print">CIP</acronym> and brief on-order records) the major work of bibliographic production is performed at the end of the publication cycle and ends with the receipt of the published item.  In the publisher supply chain, bibliographic data evolves over time, usually beginning months before publication and continuing to grow for months and years (sales information, etc.) after publication.  Renee had a graphic showing the current flow of metadata around the broader bibliographic universe that highlighted the isolation of library activity relative to publisher, wholesaler, and retailer activity.</p><p><div id="attachment_1484" class="wp-caption alignright" style="width: 310px;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; float: right;"><a href="http://www5.oclc.org/downloads/presentations/MDS4Pubs_August_Webinar_200908.ppt" title="Slides from Publisher Supply Chain Webinar, August 2009"><img src="http://cdn.dltj.org/wp-content/uploads/2010/01/ONIX-enhancement-300x225.jpg" alt="" title="Diagram of the Process of Enhancing ONIX Records" width="300" height="225" class="size-medium wp-image-1484" /></a><p style=' padding: 0 4px 5px; margin: 0;'  class="wp-caption-text">Diagram of the Process of Enhancing ONIX Records, from OCLC Services for the Publisher Supply Chain Webinar, August 2009</p></div>Renee when on to describe a &#8220;next generation cataloging data flow&#8221; where OCLC facilitates the inclusion of publisher data into <a href="http://www.worldcat.org/" title="WorldCat homepage" rel="homepage">WorldCat</a> and enhances publisher data with information extracted from WorldCat.  To the right is a version of the graphic she used at Midwinter taken from an earlier presentation on the same topic.  It show ONIX-formatted metadata coming into WorldCat, being cross-walked and matched with existing MARC data in WorldCat, and finally extracted and cross-walked back to ONIX resulting in <a href="http://publishers.oclc.org/en/metadata/default.htm" title="OCLC Metadata Services for Publishers"> enhanced ONIX metadata</a> for publishers to use in their supply chain.  If there is an exact match for the incoming ONIX record in WorldCat, the WorldCat record is enhanced with certain fields from the ONIX record (descriptions, author biographies, web links) &#8212; being careful not to override authority work being done by libraries, but adding enhancements that libraries may not otherwise input.  In turn, enhancements from exact match record and FRBR work set records (hardcover versus softcover versus audiobook, etc.) are added to the ONIX record (non-English subject headings, adding a Dewey Decimal Classification (DDC) field from another similar record if one doesn&#8217;t already exist, change the author field to an authority-controlled version).  If there is not an exact match for the ONIX record in WorldCat, a new WorldCat record is built from the ONIX record and it is subsequently enhanced by metadata found in the FRBR work set records.  In doing so, we are &#8220;increasing the goodness of metadata in the marketplace,&#8221; as Renee put it in her presentation.  OCLC is also creating a mapping between <a href="http://www.bisg.org/what-we-do-20-73-bisac-subject-headings-2009-edition.php" title="Standards &amp; Best Practices | Classification Schemes | BISAC Subject Headings 2009 Edition | Book Industry Study Group">BISAC Subject Headings</a><sup><a href="http://dltj.org/article/mashups-of-bib-data/#footnote_1_1478" id="identifier_1_1478" class="footnote-link footnote-identifier-link" title="By the way, it seems like BISAC is an acronym for &amp;#8220;Book Industry Systems Advisory Committee&amp;#8221;, the former name of the Book Industry Study Group.">2</a></sup> and the DDC system.  This allows the enhancement of ONIX with suggestions of BISAC Subject Terms and the enhancement of WorldCat records with generic DDC fields given an incoming BISAC Subject Term value from the ONIX record.</p><p>In her experience, Renee said that libraries need ways to enable our metadata to evolve over time and allow for publisher-created metadata to merge effectively with library-created metadata.  The bibliographic record needs to be a &#8220;living, growing&#8221; thing throughout the lifecycle of a title and beyond.  In concluding her remarks, she offered several resources to explore for further information:  the OCLC/NISO study on <a href="http://www.niso.org/publications/white_papers/StreamlineBookMetadataWorkflowWhitePaper.pdf" title="Streamlining Book Metadata Workflow">Streamlining Book Metadata Workflow</a>, the U.K. Research Information Network report on <a href="http://rin.ac.uk/creating-catalogues" title="Creating Catalogues: Bibliographic Records in a Networked World">Creating Catalogues: Bibliographic Records in a Networked World</a>, the Library of Congress <a href="http://www.loc.gov/bibliographic-future/news/" title="News, Press Releases and Reports - Working Group on the Future of Bibliographic Control (Library of Congress)">Study of the North American MARC Records Marketplace</a>, the Library of Congress <a href="http://cip.loc.gov/onixpro.html" title="LC ONIX Pilot Project" class="broken_link" rel="nofollow">CIP/ONIX Pilot Project</a>, and the <a href="http://publishers.oclc.org/en/default.htm" title="OCLC Publisher Supply Chain Website">OCLC Publisher Supply Chain Website</a>.</p><p><h2 id="post-1478-h2-Open-Library">From MARC to Wiki with Open Library</h2><br />The second presenter on the panel was <a href="http://kcoyle.net/" rel="homepage" title="Karen Coyle's home page">Karen Coyle</a>, talking about the mashup of metadata at the <a href="http://openlibrary.org/" title="Open Library project homepage" rel="homepage">Open Library</a> project at the <a href="http://archive.org/" title="Internet Archive homepage" rel="homepage">Internet Archive</a>.  The slides from her presentation are <a href="http://kcoyle.net/presentations/ol_boston.pdf" title="Open Library - Mix and Match Metadata presentation slides [PDF]">available from her website</a>.</p><p>Karen said right at the start that the Open Library project is different from most of what happens in libraries &#8212; it is &#8220;someone outside the library world making use of library data&#8221; &#8212; although the goal is arguably the same as others &#8212; &#8220;<a href="http://openlibrary.org/about" title="About Us (Open Library)">One web page for every book ever published</a>.&#8221;  As such, the Open Library isn&#8217;t a library catalog as librarians think of it in that it is not a representation of a libraries inventory. It has metadata for every book it can know about and a pointer to places where the book can be found, including all of the electronic books in Internet Archive (<a href="http://www.opencontentalliance.org/" rel="homepage" title="Open Content Alliance (OCA)">Open Content Alliance</a>, Google Public Domain, etc.) as well as pointers back to OCLC WorldCat.  Karen&#8217;s role for the project is that of &#8220;Library Data Informant.&#8221; The Internet Archive decided that they needed someone who understood library data in order to try to use it.  From Karen&#8217;s perspective, she is trying to be a resource for project but not give them any guidance on how to implement the service.  She is curious to see what the project would do when bibliographic data is viewed from a non-librarian perspective.  If they have questions, or if they have assumptions about data that are wrong, then she intervenes.</p><p>Karen went on to briefly describe the Open Library system.  Open Library doesn&#8217;t have records; rather, it has field types and data properties.  In this way, it uses semantic web concepts.  &#8220;Author&#8221; is a type, &#8220;Author birthdate&#8221; is another type, and so forth.  There are no set field types, so if the project gets data from source for which a type doesn&#8217;t yet exist, it can create a new one.  Each type can have data properties such as string, boolean, text, link, etc.  Nothing is required and everything is repeatable.  Everything &#8212; types, properties, and values &#8212; gets a <acronym title="Uniform Resource Identifier">URI</acronym> (a URI is an identifier like a URL, but conceptually a superset of the universe of URLs).  Titles, authors, subjects, author birthdates, and so on have URIs.  Lastly, the underlying data structures are based on wiki principles: all edits are saved and viewable, anyone can edit any value, anyone can add new types or properties, anyone can develop their own displays, etc.</p><p>The data that is now in Open Library came from a variety of sources.  They started with a copy of books from the Library of Congress, and continue to receive the weekly updates. They performed a crawl of Amazon&#8217;s book data.  They have gotten some from publishers, libraries, and individual users.  The last is perhaps the most interesting because it is mainly people outside the western world who are otherwise having trouble getting their works recognized.</p><p><h3 id="post-1478-h3-Problems-Issues">Problems, Issues, Challenges, and Opportunities with the Data</h3><br />People who use library data without the biases or assumptions of librarians come up with interesting ways to view the data.  Karen described a few of them.</p><dl class="inlineClass"><dt>Names -</dt><dd>&#8220;These library forms of names? Honestly no one but us can stand them.&#8221;  Even something as simple as the form of last-name-comma-first-name is troublesome.  No one else uses this form of the name: Amazon, Wikipedia, etc.  In processing these, any information between parenthesis has been deleted, birth and death dates move into separate field types.</dd><dt>Titles -</dt><dd>In working with the Open Library developers, this is one place that Karen tried insisting on applying a library practice:  knowing the initial article.  For us, this is important for sorting books in alphabetical order.  The developer response &#8212; why do we have to sort in alphabetical order?  &#8220;Where else but library catalogs to we see things sorted in alphabetical order?  Not in Google, not in Amazon, not anywhere.  Alphabetical order is not in the mindset anymore.&#8221;  They also found that the title might include extraneous data.  Amazon, for instance, appends the series title in parenthesis to the main title.  This is a demonstration of how other communities are not as concerned about strongly typing and separating information into fields. Amazon, of course, has reasons for series information into the main title: it helps sell books.</dd><dt>Product dimensions -</dt><dd>Publishers and distributors need to know characteristics of an item such as height, width, depth, and weight; they, of course, need to put it in a box and ship it.  Libraries, concerned about placing the item on the shelf, record just height.  Recording pagination is different, too: libraries use odd notations &#8220;ill. (some col)&#8221; and &#8220;xv, 200p.&#8221; versus simply &#8220;200 pages.&#8221;</dd><dt>Birthdates -</dt><dd>Librarians use birthdates to distinguish names; if there is no need to distinguish a name, birth and death dates are not added.  Someone looking at this from the outside would ask &#8216;Why don&#8217;t all authors have birth and death dates?&#8217;  This can be useful information for viewing the context of an item, not just to distinguish author names.  Open Library ran author names against Wikipedia to pick up not only birth and death years, but also the actual dates.</dd><dt>Subject headings -</dt><dd>Open Library using Library of Congress Subject Headings was out of the question. In processing the data, the Open Library developers just broke them apart into segments and used them. But because they were able to do data mining on the subject field types, they did find statistical relationships between the disassembled precoordinated headings and were able to present those to the user.</dd><dt>The View of the Data -</dt><dd>Rather than a traditional library view of long lists of author-title, the Open Library (in its next version coming in February) will have several different views into the mass of data: Authors; Books (what we would call <acronym title="Functional Requirements for Bibliographic Records">FRBR</acronym> &#8216;manifestations&#8217;); Works; Subjects; and eventually places, publishers, etc.  For example, when searching for an author one would get the author page.  On it would be all of the works from the author as well as other biographical information.  It looks similar to a WorldCat identities page, except it is the actual user interface built into the system.  Similarly, every work will have a page, and at the bottom of it one will see all of the editions of the work.  Also, each subject will have a page, and one will see a list of works with that subject as well as authors who write on that subject.  As Karen said, &#8220;The subject itself becomes an object of interest in the database, not just something that is just tacked on to the bottom of the library record.&#8221;</dd><dt>Data mining -</dt><dd>With the data in this format, it is possible to perform data mining actions against it. For instance, simple data mining such as country of publication, popular places that appear, etc.  When they had the problem of author names &#8212; knowing when to reverse surname and forname &#8212; they ran the names against Amazon and Wikipedia and retained the ones where they found the order of the entry was the same. The Open Library developers are also experimenting with data mining to find publisher names.  Publisher names, of course, vary dramatically, but by using ISBN prefixes they can pull together related items into a &#8220;publisher&#8221; view.</dd></dl><p>Karen suggested watching the <a href="http://edwardbetts.com/ol/" title="Index of /ol">Edward Betts&#8217;s site</a>, one of the developers of the Open Library project with an eye on the data mining aspects.  She said it is fun to look at our data when it can be viewed from this different point-of-view.  She also said to watch out for a new version of the <a href="http://openlibrary.org/" title="Open Library (Open Library)">Open Library website</a> coming in February.</p><p><h2 id="post-1478-h2-Google-Book-Search-Metadata">Google Book Search Metadata</h2><br />The final presenter was <a href="http://www.google.com/profiles/kurt.groetsch" title="Kurt Groetsch's Google Profile">Kurt Groetsch</a>, Technical Collections Specialist at Google where he works to provide understanding and insight into library partner collections and the digitized books from Google.  Kurt said that &#8220;Google has been fairly circumspect over the years about what we do on the Book Search project.&#8221;  He said it was a bit of a cultural legacy from the rest of the company and also possibly an artifact of the copyright litigation, but he is hoping to change that.  His presentation looked at how Google works with book metadata from three vantage points &#8212; the inputs into Google&#8217;s system, parsing by Google&#8217;s algorithms, and analysis and output into the public interfaces.</p><p>On the input side, Google is getting bibliographic metadata from over 100 sources in a variety of formats. MARC records are coming from libraries, union catalogs, commercial providers (OCLC), publishers/retails (one publisher supplies records in MARC format).  Google also gets ONIX records from commercial providers (such as Ingram and Bowker), publishers, and retailers.  Google is especially interested in data from non-U.S. retailers because it is a source of information about books published outside the United States; it helps facilitate discovery of items that they may not otherwise encounter in the <a href="https://books.google.com/partner/">publisher</a> and <a href="http://www.google.com/googlebooks/library.html" title="Google Books Library Project">library</a> programs.  Google also receives records in a variety of &#8220;idiosyncratic formats&#8221; &#8212; for example, publisher-contributed metadata (via the Publisher Partner Program); information associating books with jacket images; name authority records (from LC); reviews; popularity signals (sales data as well as <a name="anonymized_circulation_data">anonymized circulation data</a> from some library partners, useful for feeding into the relevancy ranking algorithm); and internally-generated metadata (for instance, whether a book is commercially available or not).  Google processes all of this information to come up with a single record that describes a book.  At this point they have over 800 million bibliographic records and one trillion bits of information in those records.</p><p>All of these records from all of these sources are processed and remixed with Google&#8217;s parsing algorithms about twice a week.  The first step is to transform the incoming records into a &#8220;less verbose format&#8221; for storage and processing.  It is a SQL-like structure that allows elements of the metadata to be queried.  Records are then parsed to extract specific bits of information, transform the bits as necessary, and write the information to an internal &#8220;resolved records&#8221; data structure (a subset of the data coming from the input formats).  In the presentation, Kurt had examples of how making inferences from data coming from both MARC and ONIX can be troublesome.  Parsing also involves extracting &#8220;bibkeys&#8221; from the records to aid in matching across sources of data.  Four types of identifiers are extracted from bibliographic records: OCLC numbers, <acronym title="Library of Congress Control Numbers">LCCN</acronym>s, ISBNs, and ISSNs.  They provide usually useful signals when matching bibliographic and help with assertions that two records describe the same manifestation.  Google also tries to parse item data when present in records representing multi-volume works, enumeration and chronology.  They will also treat barcode as a form of a &#8220;bibkey&#8221; if they get it from a library.  The parsing algorithm will also split records containing multiple ISBNs representing different product forms (e.g. hardback, paperback, etc.).</p><p>With all of this data parsed into records, Google starts its clustering process where records are examined and attached to each other.  Bibkeys provide significant evidence for relating records to each other, but bibkeys are not always present in a record (non-U.S. records and older records frequently contain no bibkeys).  The algorithms then fall back on text similarity matching using title, subtitle, contributor and other fields such as publisher and publication year.  The results are clusters of records representing the same manifestation. An algorithm then attempts to derive the &#8220;best-of&#8221; record for a single cluster from all of the parsed input records.  This is done in a field-by-field voting process based on the trustworthiness of individual fields from record sources.</p><p>Kurt went into some of the challenges facing the team building the clustering and best-of record creation algorithms.  For instance, in dealing with multivolume works they know of 5 numbering schemas with 3 number types in 15 different languages.  Enumeration is now showing in the public display, but the development team is still working with unparsable item data due to inconsistent cataloging practices between institutions&#8230;and sometimes inconsistencies within an institution.  Another problem is non-unique identifiers. In the current data set ISBN 7899964709 is shared by 75 books and ISBN 7533305353 is associated with 1413 books. There are also poor quality or &#8220;junk records&#8221;.  Kurt said his favorite was &#8220;The Mosaic Navigator&#8221; by Sigmund Freud published in 1939.  These are hard to identify with an algorithm, and they rely on reports of problems that enable the developers to go in and &#8220;kill&#8221; the troublesome record.  Another example is a book by Virginia Woolf where the incoming record had conflicting information; it had two 260 fields that contained different dates (1961, correct, and 1900) with fixed field information that strongly suggested that 1900 was the single date of publication.  When the data problem is systematic, they can identify it and compensate for it.  Kurt&#8217;s example for this case was &#8220;The United States Since 1945&#8243; published in 1899.  This one was highlighted in <a href="http://chronicle.com/article/Googles-Book-Search-A/48245/" title="Google's Book Search: A Disaster for Scholars - The Chronicle Review - The Chronicle of Higher Education">Geoffrey Nunberg&#8217;s criticism of Google Books metadata</a>.  In this case, there was a source of metadata from Brazil that when they didn&#8217;t know the date of publication would use 1899.  When Google went back and looked at the date distribution of books there was a huge spike in 1899.  Once Google knew about it they were able to go in and kill that information from that source of records. <sup><a href="http://dltj.org/article/mashups-of-bib-data/#footnote_2_1478" id="identifier_2_1478" class="footnote-link footnote-identifier-link" title="A side note: Google isn&amp;#8217;t the only one tripped up by this.  If one searches for the ISBN of the item, 0195038487, you get to more than one site that has the same incorrect publication date.  At least Google is attempting to clean up the data!">3</a></sup></p><p>In closing, Kurt said that Google is committed to engaging with the library community on improving metadata and metadata processing.</p><p style="padding:0;margin:0;font-style:italic;">The text was modified to update a link from http://www.niso.org/publications/white_papers/Stream lineBookMetadataWorkflowWhitePaper.pdf to http://www.niso.org/publications/white_papers/StreamlineBookMetadataWorkflowWhitePaper.pdf on January 19th, 2011.</p><p style="padding:0;margin:0;font-style:italic;" class="removed_link">The text was modified to remove a link to http://www.oclc.org/speakers/bios/register_renee.htm on February 11th, 2011.</p><h2>Footnotes</h2><ol class="footnotes"><li id="footnote_0_1478" class="footnote">For those not familiar with <a href="http://www.editeur.org/8/ONIX/" title="ONIX Overview">ONIX</a>, it is a suite of standards promulgated by <a href="http://www.editeur.org/" title="EDItEUR homepage" rel="homepage">EDItEUR</a> for the interchange of information on books and serial publications.  It is primarily used as the communication channel between the publishing industry through distribution chains to retail establishments.</li><li id="footnote_1_1478" class="footnote">By the way, it seems like BISAC is an acronym for &#8220;Book Industry Systems Advisory Committee&#8221;, the former name of the <a href="http://www.bisg.org/" title="Book Industry Study Group homepage" rel="homepage">Book Industry Study Group</a>.</li><li id="footnote_2_1478" class="footnote">A side note: Google isn&#8217;t the only one tripped up by this.  If one searches for the ISBN of the item, 0195038487, you get to <a href="http://www.biggerbooks.com/book/9780195038484" title="The United States Since 1945 at BiggerBooks.com -  Leuchtenburg, 9780195038484, History">more</a> <a href="http://www.chegg.com/details/the-united-states-since-1945/0195038487/" title="Chegg.com: The United States Since 1945 by Leuchtenburg">than</a> <a href="http://www.amazon.co.uk/The-United-States-Since-1945/dp/0195038487" title="The United States Since 1945: Amazon.co.uk: Books">one</a> site that has the same incorrect publication date.  At least Google is attempting to clean up the data!</li></ol>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/mashups-of-bib-data/feed/</wfw:commentRss> <slash:comments>23</slash:comments> </item> <item><title>More on What Does It Mean to Be a Member of OCLC</title><link>http://dltj.org/article/oclc-social-contract-2/</link> <comments>http://dltj.org/article/oclc-social-contract-2/#comments</comments> <pubDate>Mon, 18 Jan 2010 15:35:45 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[L/IS Profession]]></category> <category><![CDATA[ALA Midwinter Conference 2010]]></category> <category><![CDATA[OCLC]]></category> <category><![CDATA[WorldCat]]></category><guid isPermaLink="false">http://dltj.org/?p=1467</guid> <description><![CDATA[Jay Jordan&#8217;s remarks during the OCLC Update Breakfast and the discussion at the Developers Network table at that breakfast generated further fuel for my previous philosophical thoughts on &#8220;Who is a member of the OCLC Cooperative?&#8221; In the context of &#8230; <a href="http://dltj.org/article/oclc-social-contract-2/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=1467"></abbr><p>Jay Jordan&#8217;s remarks during the <a href="http://dltj.org/article/alamw10-oclc-update/">OCLC Update Breakfast</a> and the discussion at the <a href="http://www.worldcat.org/devnet/wiki/Main_Page" title="Main Page - WorldCat Developers' Network">Developers Network</a> table at that breakfast generated further fuel for my previous philosophical thoughts on &#8220;<a href="http://dltj.org/article/oclc-social-contract/">Who is a member of the OCLC Cooperative?</a>&#8221;  In the context of things like Developer Network API keys<sup><a href="http://dltj.org/article/oclc-social-contract-2/#footnote_0_1467" id="identifier_0_1467" class="footnote-link footnote-identifier-link" title="&amp;#8220;API&amp;#8221; is an acronym for Application Programming Interface.  In summary, an API is the set of rules by which one program can task another program for data or to perform a service.  An &amp;#8220;API key&amp;#8221; is the mechanism through which a requesting application establishes the right to be able to use data and services of another application. In this context, the holder of an OCLC Developer Network API key can access the wealth of data and services being offered by OCLC.">1</a></sup> this question of who is a member of OCLC the cooperative and who is not meets the on-or-off, ones-and-zeros nature of computers.  One can&#8217;t &#8220;kinda&#8221; have an API Key unless that capability is programmed into the software (or a human chooses to override the established rules for who has a key).</p><p>The discussion around the table after Jay&#8217;s remarks tested some of the &#8220;edge cases&#8221; to the established hard-and fast rules.  OCLC Governance, at least as I understand it, surrounds institutions who are members of the OCLC Cooperative.  By implication, the benefits (and also the responsibilities) of membership in the OCLC Cooperative transfer to staff employed at those member institutions.  People who work for governing (institutional) members of OCLC can get Developer Network API keys to create services for their library.  And except for the fuzzy notion explored previously whether individuals are members of the OCLC cooperative, this all seems pretty clear.</p><p>But what about individuals in the employment of a library that is not a member of OCLC that want to create applications that benefit library users.  Further, following Jay Jordan&#8217;s definition of partners OCLC wants to work with (as expressed during the <a href="http://dltj.org/article/alamw10-record-use-policy/">OCLC Record Use Policy Council Update</a>), that non-institutional-member individual can return value back to the cooperative (by contributing the ideas to the Developer&#8217;s Network Showcase, by contributing working code to a software repository, etc.).  Isn&#8217;t that returning value to the cooperative? Should this person be granted a Developer Network API key even though their institution is not a member?  Can we conceive of a process and guidelines by which this could happen?  (Has it perhaps already happened in the human-created fuzz of OCLC-the-stewards overriding the established rules?)</p><p>I&#8217;ll state here that I would support the creating of a process by which anyone &#8212; from a member library or a consultant doing work on behalf of a member library or a 12-year-old kid) can request a Developer Network API key. What is needed is a clear understanding that those who get such keys gain some kind of adjunct membership in the OCLC-the-cooperative, and as so are expected to return value to the cooperative through their work with the API key.  (There would need to be transparency from OCLC-the-steward on who in this category is getting keys, who is not, and for what reasons &#8212; at least in summary.)  The area of consultants or other agents doing work on behalf of a library is an area that would need further exploration.  I would offer that it is okay for one such agent to use an API key to help a particular client, but I also agree with Jay that it is not okay for that same agent to use that key to derive revenue from helping non-OCLC members with the same key.  The latter fails the renumeration-back-to-the-cooperative  test (either financial or intangible benefit) that Jay remarked on during <a href="http://dltj.org/article/alamw10-record-use-policy/">Saturday&#8217;s Record Use Council session</a> and one that the Record Use Council seems to be struggling with.</p><p>These are tough but interesting issues.  We&#8217;ve left the days where the benefits of membership were tied to the delivery of shelf-ready cards and dedicated terminals for online original and copy cataloging.  OCLC seems to be transforming itself from a library-used service to a world-used service.  I give OCLC-the-stewards credit for listening to OCLC-the-membership about appropriate ways to make use of the shared resource that is WorldCat, and credit to OCLC-the-membership for pushing OCLC-the-stewards into creating ways to expand access to that shared resource.<p style="padding:0;margin:0;font-style:italic;">The text was modified to update a link from http://dltj.org/article/alamw10-oclc-record-use to http://dltj.org/article/alamw10-record-use-policy/ on December 31st, 2010.</p><p style="padding:0;margin:0;font-style:italic;">The text was modified to update a link from http://dltj.org/article/alamw10-oclc-record-use to http://dltj.org/article/alamw10-record-use-policy/ on December 31st, 2010.</p><h2>Footnotes</h2><ol class="footnotes"><li id="footnote_0_1467" class="footnote">&#8220;API&#8221; is an acronym for <a href="http://en.wikipedia.org/wiki/Application_programming_interface" title="Application programming interface - Wikipedia, the free encyclopedia">Application Programming Interface</a>.  In summary, an API is the set of rules by which one program can task another program for data or to perform a service.  An &#8220;API key&#8221; is the mechanism through which a requesting application establishes the right to be able to use data and services of another application. In this context, the holder of an <a>OCLC Developer Network API key</a> can access the wealth of data and services being offered by OCLC.</li></ol>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/oclc-social-contract-2/feed/</wfw:commentRss> <slash:comments>3</slash:comments> </item> </channel> </rss>
<!-- Served from: dltj.org @ 2012-02-11 08:55:06 by W3 Total Cache -->
