<?xml version="1.0" encoding="UTF-8"?> <rss version="2.0" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule"><channel><title>Disruptive Library Technology Jester &#187; WorldCat</title> <atom:link href="http://dltj.org/tag/worldcat/feed/" rel="self" type="application/rss+xml" /><link>http://dltj.org</link> <description>We&#039;re Disrupted, We&#039;re Librarians, and We&#039;re Not Going to Take It Anymore</description> <lastBuildDate>Fri, 18 May 2012 15:43:10 +0000</lastBuildDate> <language>en</language> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <cloud domain='dltj.org' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' /> <creativeCommons:license>http://creativecommons.org/licenses/by-nc-sa/3.0/us/</creativeCommons:license> <item><title>WorldCat May Become Available as Library Linked Data under ODC-BY</title><link>http://dltj.org/article/worldcat-lld-may-become-available-under-odc-by/</link> <comments>http://dltj.org/article/worldcat-lld-may-become-available-under-odc-by/#comments</comments> <pubDate>Wed, 18 Apr 2012 02:46:39 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Linking Technologies]]></category> <category><![CDATA[Digital Public Library of America]]></category> <category><![CDATA[Europeana]]></category> <category><![CDATA[Library Linked Data]]></category> <category><![CDATA[licensing]]></category> <category><![CDATA[linked data]]></category> <category><![CDATA[OCLC]]></category> <category><![CDATA[Open Data Commons]]></category> <category><![CDATA[WorldCat]]></category> <category><![CDATA[WorldShare]]></category><guid isPermaLink="false">http://dltj.org/?p=3665</guid> <description><![CDATA[On the second day of the OCLC Global Council meeting [agenda PDF] there was a presentation by Robin Murray (VP, OCLC Global Product Management) and Jim Michalko (VP, OCLC Research Library Partnership) called &#8220;Linked Open Data&#8221;. The title of the &#8230; <a href="http://dltj.org/article/worldcat-lld-may-become-available-under-odc-by/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=3665"></abbr><p>On the second day of the <a href="http://www.oclc.org/content/dam/oclc/forms/en/april-gc-agenda.pdf" title="OCLC Global Council Agenda, 16-18 Apr 2012 [PDF]">OCLC Global Council meeting</a> [agenda PDF] there was a presentation by Robin Murray (VP, OCLC Global Product Management) and Jim Michalko (VP, OCLC Research Library Partnership) called &#8220;Linked Open Data&#8221;.  The title of the presentation was an understatement because the real heart of the matter was <em>WorldCat</em> data as linked open data.  The presentation was about an hour long, and despite the technical difficulties was fascinating to listen to through the <a href="http://www.oclc.org/livefeed/" title="OCLC Global Council Annual Meeting 2012 live feed">streaming video feed</a>.  OCLC says the archive of the meeting will be available at some point, and I urge you to check it out when it becomes available.</p><p>Robin Murray&#8217;s first half of the presentation walked through OCLC&#8217;s thoughts about how their implementation of linked data is evolving along side their programatic API offerings.  (Almost as an aside, Robin said that use of the WorldCat API has been growing steadily and is now up to 15 million hits per month &#8212; or 347 hits/minute.  Wow!)  He had a really slick diagram that showed the exposure of data through linked data services and API services across a half-dozen OCLC products.  He called library linked data the &#8220;Data Exposure Service&#8221; of the WorldShare platform.  There was one slide that I did copy down: with a title of &#8220;Opportunity or Threat&#8221;, Robin proposed:</p><blockquote><p>IF, collectively we can re-envision cataloging as &#8216;registering nodes in a global web of data&#8217;;</p><p>AND we can position WorldCat as the trusted, global web of library data;</p><p>THEN this dramatically increases the global value and utility of metadata management with WorldCat.</p></blockquote><p>Robin went on to say that he sees publishing the OCLC cooperative&#8217;s data assets as linked data is more opportunity than threat.  That doing so grows the possible base of funding support for WorldCat; or, as he put it a &#8220;significant opportunity to &#8216;Grow the Denominator&#8217;&#8221; (where the fixed costs of adding value to member contributed records is the numerator).</p><p>Jim Michalko&#8217;s second half of the presentation walked through the history of the cooperative&#8217;s <a href="http://www.oclc.org/worldcat/recorduse/policy/default.htm" title="WorldCat Rights and Responsibilities for the OCLC Cooperative | OCLC">Rights and Responsibilities document</a>, past discussions about publishing WorldCat data, and lead up to a recommendation that OCLC leadership had for the Global Council and the Board of Trustees:  publish WorldCat data as linked data with an <a href="http://opendatacommons.org/licenses/by/summary/" title="ODC-BY attribution license summary | Open Data Commons">Open Data Commons &#8220;BY&#8221; attribution license</a> (a.k.a. ODC-BY).  In fact, Jim explicitly said &#8220;no restrictions on commercial reuse&#8221; of individual records.</p><p>After the presentation and a short question/answer period, there was to be discussions at the tables where one of the questions was whether the Global Council would recommend to the Board of Trustees the adoption of ODC-BY to WorldCat data.  Each table was to report the summary of their deliberations, but unfortunately that part of the meeting wasn&#8217;t webcast.  We may have to wait for formal minutes of the meeting to be published to see what the conclusion of the discussion was.</p><p>This discussion at Global Council about WorldCat data follows a similar announcement from Thom Hickey about the <a href="http://outgoing.typepad.com/outgoing/2012/04/viaf-developments.html" title="VIAF Developments | Outgoing">Virtual International Authority File being published with an ODC-BY license</a> with attribution to <a href="http://viaf.org/" title="VIAF homepage">VIAF</a>.  Jonathan Rochkind posted his appreciation and approval of the VIAF move but <a href="https://bibwild.wordpress.com/2012/04/09/re-usable-linked-big-data-for-real/" title="Re-usable linked big data for real | Bibliographic Wilderness">predicted a couple of pain points</a> with the ODC-BY licensing.  What he says there is also true of an ODC-BY licensed WorldCat.</p><p>Both Robin and Jim made a point of modeling their practices on what is happening in the general linked data community, but I&#8217;m concerned that a Google search for &#8220;<a href="http://www.google.com/search?q=ODC-BY+linked+data" title="Google search of &#038;039;ODC-BY linked data&#038;039;">ODC-BY linked data</a>&#8221; doesn&#8217;t show many examples beyond OCLC&#8217;s efforts.  It is quite possible that my Google results are artificially skewed towards library results. <del datetime="2012-04-18T13:54:09+00:00">On the <a href="http://code4lib.org/irc/" title="IRC | code4lib">Code4Lib IRC channel</a> it was noted, for instance, that the <a href="http://linkedgeodata.org/" title="Linked Geo Data homepage">Linked Geo Data</a> (derived from <a href="http://www.openstreetmap.org" title="OpenStreetMap homepage">Open Street Map</a> project) is or soon will be ODC-BY.</del> <ins datetime="2012-04-18T13:54:09+00:00">[See clarification in the comments.]</ins> If there are other examples, I&#8217;d appreciate hearing about them in the comments.</p><p>There was also a discussion about whether WorldCat data could be given by members to <a href="http://www.europeana.eu/portal/" title="Europeana homepage">Europeana</a>, which <a href="https://creativecommons.org/weblog/entry/29133">announced</a> it is using the <a href="http://creativecommons.org/about/cc0" title="About CC0 &mdash; &#8220;No Rights Reserved&#8221; | Creative Commons">CC0 public domain dedication</a>.  There is an important mismatch between CC0 and ODC-BY &#8212; notably the attribution requirement in the latter that isn&#8217;t in the former.  I can&#8217;t faithfully recall direct quotes from the discussion &#8212; you&#8217;ll have to watch the video archive &#8212; but the summary I remember was that doing so would be okay as long as the member was being reasonable about what was being shared with Europeana.  And that the same would hold true for the Digital Public Library of America.</p><p>On the whole, the presentation and discussion were fascinating to follow, and from my point of view represents a welcome and appropriate liberalization of how WorldCat data can be reused and the intention of OCLC to public bibliographic (and other data) to become a stable, generally usable anchor of library linked data.  This is an important milestone to putting libraries and library data at the core of the linked open data movement, and it will spur innovation and uses that we can only dream about now.</p><p><h2>Update &#8212; 18-Apr-2012</h2><br />News from Twitter:<br /><blockquote class="twitter-tweet"><p>Global Council passes resolution advising move to Open Data Commons Attribution License ODC-BY <a href="https://twitter.com/search/%2523oclcgc">#oclcgc</a></p><p>&mdash; OCLCEMEARC (@OCLCEMEARC) <a href="https://twitter.com/OCLCEMEARC/status/192636783534682115" data-datetime="2012-04-18T15:32:42+00:00">April 18, 2012</a></p></blockquote><p><script src="//platform.twitter.com/widgets.js" charset="utf-8"></script><br />As I recall the process, it now moves to approval by the OCLC Board of Trustees.</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/worldcat-lld-may-become-available-under-odc-by/feed/</wfw:commentRss> <slash:comments>68</slash:comments> </item> <item><title>Real Life Example of Creative Commons License Applied to MARC Records</title><link>http://dltj.org/article/cc0-marc-records/</link> <comments>http://dltj.org/article/cc0-marc-records/#comments</comments> <pubDate>Fri, 18 Mar 2011 16:45:06 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[policy]]></category> <category><![CDATA[cc0]]></category> <category><![CDATA[Creative Commons]]></category> <category><![CDATA[MARC]]></category> <category><![CDATA[University of Florida]]></category> <category><![CDATA[WorldCat]]></category><guid isPermaLink="false">http://dltj.org/?p=2727</guid> <description><![CDATA[Eric Morgan posted a message to the Next Generation Catalog for Libraries mailing list this morning that points to a announcement by the University of Florida library that they are now applying a Creative Commons Public Domain Dedication statement to &#8230; <a href="http://dltj.org/article/cc0-marc-records/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=2727"></abbr><p><a href="http://infomotions.com/" title="Infomotions, LLC">Eric Morgan</a> posted a <a href="http://article.gmane.org/gmane.culture.libraries.ngc4lib/9018" title="NGC4LIB mailing list message with the subject 'university of florida' by Eric Morgan on March 18,2011 | Gmane">message</a> to the Next Generation Catalog for Libraries mailing list this morning that points to a <a href="http://www.uflib.ufl.edu/catmet/creativecommons.html" title="Creative Commons License | University of Florida George A. Smathers Libraries">announcement</a> by the <a href="http://www.uflib.ufl.edu/" title="University of FLorida George A. Smathers Libraries homepage">University of Florida library</a> that they are now applying a <a href="http://creativecommons.org/publicdomain/zero/1.0/" title="CC0 1.0 Universal | Creative Commons">Creative Commons Public Domain Dedication</a> statement to <abbr title="MAchine Readable Cataloging">MARC</abbr> records they create.  Their announcement says:</p><blockquote><p>Beginning March 2011, the University of Florida Smathers Libraries implemented a policy to include a Creative Commons license in all of its original cataloging records. The records are considered public domain with unrestricted downstream use for any purpose.</p><p>The following MARC 588 field (Source of Description Note) is added to new records contributed to WorldCat. It has not been added retrospectively to University of Florida original records in WorldCat.</p><p style="padding-left:2em;font-family:monospace;">588::|a This bibliographic record is available under a Creative Commons CC0 license. The University of Florida Libraries, as creator of this bibliographic record, has waived all rights to it worldwide under copyright law, including all related and neighboring rights, to the extent allowed by law.</p></blockquote><p>Their announcement page also provides links to some examples from the OPAC.  Scroll down to the bottom of the page to see the CC0 graphic and declaration.</p><ul><li><a href="http://uf.catalog.fcla.edu/permalink.jsp?20UF005023800" title="http://uf.catalog.fcla.edu/permalink.jsp?20UF005023800">Latinoamericanismo : historia intelectual de una geografía inestable</a></li><li> <a href="http://uf.catalog.fcla.edu/permalink.jsp?20UF005056555" title="http://uf.catalog.fcla.edu/permalink.jsp?20UF005056555">Mapa Everest de carreteras, España y Portugal</a></li><li> <a href="http://uf.catalog.fcla.edu/permalink.jsp?20UF005023882" title="http://uf.catalog.fcla.edu/permalink.jsp?20UF005023882">The Song of Ceylon</a></li></ul><p>The University of Florida joins <a href="http://www.lib.umich.edu/open-access-bibliographic-records" title="Open Access Bibliographic Records Available for Download and Use | Library Information Technology | MLibrary">University of Michigan</a> in making original cataloging records available under CC0.  To refresh your memory CC0 &#8220;Public Domain Dedication&#8221; statement (<a href="http://www.plagiarismtoday.com/2009/02/25/cc0-waiving-copyrights/" title="CC0: Waiving Copyrights">it isn&#8217;t a license!</a>) says:<br /><blockquote><p>The person who associated a work with this deed has <b>dedicated</b> the work to the public domain by waiving all of his or her rights to the work worldwide under copyright law, including all related and neighboring rights, to the extent allowed by law.</p><p>You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission.</p></blockquote><p>Now this is an interesting development because it is a kind of viral declaration of the same sort that <a href="http://wiki.code4lib.org/index.php/OCLC_Policy_Change" title="OCLC Policy Change - Code4Lib">OCLC proposed</a> with the <a href="http://dltj.org/article/oclc-review-board-initial-recommendations/">withdrawn</a> draft of the record use policy.  There OCLC was going to add a <a href="http://www.oclc.org/us/en/bibformats/en/9xx/996.shtm" title="996 WorldCat Record Use Policy Link [OCLC]">996 field</a> to all records exported from WorldCat that would say:</p><table border="0" cellpadding="1" cellspacing="0" style="padding-left:2em;font-family:monospace;"><tbody><tr valign="top"><td width="30" align="left">996</td><td width="10" align="right"></td><td width="17" align="left"></td><td width="90%" align="left">OCLCWCRUP &Dagger;i Use and transfer of this record is governed by the OCLC&reg; Policy for Use and Transfer of WorldCat&reg; Records &Dagger;u http://purl.org/oclc/wcrup</td></tr></tbody></table><p>This would do something similar except that it would make viral the public domain declaration on records added to OCLC WorldCat.  (It is probably also an oversight that the <a href="http://www.oclc.org/us/en/bibformats/en/9xx/996.shtm" title="996 WorldCat Record Use Policy Link [OCLC]">996 field</a> documentation is still on OCLC&#8217;s site.)  Does this begin to segment WorldCat into records that can and cannot be used?  Or is it redundant since some think that MARC records, as a recitation of facts, cannot be copyrighted anyway?</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/cc0-marc-records/feed/</wfw:commentRss> <slash:comments>15</slash:comments> </item> <item><title>Thursday Threads: RDA Revolt, Google Book Search Algorithm, Google Helps Improve Web Servers, Google&#8217;s Internet Traffic Hugeness</title><link>http://dltj.org/article/thursday-threads-2010w44/</link> <comments>http://dltj.org/article/thursday-threads-2010w44/#comments</comments> <pubDate>Thu, 04 Nov 2010 16:00:57 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Thursday Threads]]></category> <category><![CDATA[Google]]></category> <category><![CDATA[Google Book Search]]></category> <category><![CDATA[internet]]></category> <category><![CDATA[mod_pagespeed]]></category> <category><![CDATA[PageRank]]></category> <category><![CDATA[Resource Description and Access]]></category> <category><![CDATA[WorldCat]]></category><guid isPermaLink="false">http://dltj.org/?p=1829</guid> <description><![CDATA[Receive DLTJ Thursday Threads by E-mail!Enter your email address:Delivered by FeedBurner This week is a mostly Google edition of DLTJ Thursday Threads. Below is a high-level overview of Google&#8217;s Book Search algorithm, how Google is helping web servers improve the &#8230; <a href="http://dltj.org/article/thursday-threads-2010w44/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=1829"></abbr><div id="feedburner-thursday-threads-email" class="wp-caption alignright" style="width: 310px;;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; float: right;"><form style="padding:3px;text-align:center;" action="http://feedburner.google.com/fb/a/mailverify" method="post" target="popupwindow" onsubmit="window.open('http://feedburner.google.com/fb/a/mailverify?uri=thursday-threads', 'popupwindow', 'scrollbars=yes,width=550,height=520');return true"><p><a href="http://feedburner.google.com/fb/a/mailverify?uri=thursday-threads&#038;loc=en_US" title="FeedBurner Email Subscription for DLTJ Thursday Threads">Receive <i><acronym title="Disruptive Library Technology Jester">DLTJ</acronym></i> Thursday Threads by E-mail!</a><br /><label for="email">Enter your email address:</label></p><input type="text" style="width:140px" name="email"/><input type="hidden" value="thursday-threads" name="uri"/><input type="hidden" name="loc" value="en_US"/><input type="submit" value="Subscribe" /><p style="font-size: 80%">Delivered by <a href="http://feedburner.google.com" target="_blank" title="Google Feedburner Service">FeedBurner</a></p></form></div><p> This week is a mostly Google edition of <i><acronym title="Disruptive Library Technology Jester">DLTJ</acronym></i> <a href="http://dltj.org/category/thursday-threads/">Thursday Threads</a>.  Below is a high-level overview of Google&#8217;s Book Search algorithm, how Google is helping web servers improve the speed at which content loads, and how Google&#8217;s internet traffic is growing as a percentage of all internet traffic.  But first, there is an uprising on the RDA test records in the WorldCat database.</p><p>If you find these interesting and useful, you might want to add the <a href="http://feeds.dltj.org/thursday-threads/">Thursday Threads RSS Feed</a> to your feed reader or subscribe to e-mail delivery using the form to the right.  If you would like a more raw and immediate version of these types of stories, watch <a href="http://friendfeed.com/dltj" title="Peter Murray - FriendFeed">my FriendFeed stream</a> (or subscribe to <a href="http://friendfeed.com/dltj?format=atom" title="Atom feed for Peter Murray's FriendFeed account">its feed</a> in your feed reader).  Comments and tips, as always, are welcome.<br /><br /><h2>Memorandum Against RDA Test</h2></p><blockquote><p>We have found ourselves in an unenviable position of opposing the work that supposedly has been authorized by agencies representing our interests. I might compare it to a military coup d’état. I mean here the RDA “test” and its implications on the cataloging world at large. After extensive discussions on the PCC, OCLC cataloging e-mail lists with opinions from the British Library, Australia and North America, we can safely conclude that there is a broad consensus against principles of RDA and the way RDA “test” has been imposed on the cataloging world.</p></blockquote><p>The <a href="http://cdn.dltj.org/wp-content/uploads/2010/11/wa.exe?A2=ind1011a&amp;L=oclc-cat&amp;D=0&amp;F=P&amp;T=0&amp;X=5D7D8800A8770C99F0&amp;P=2298" title="OCLC-CAT post &#039;November 2010 Memorandum Against RDA Test&#039; by Wojciech Siemaszkiewicz" class="broken_link" rel="nofollow">original post on the OCLC-CAT list</a> by Wojciech Siemaszkiewicz of the New York Public Library is behind a must-subscribe-and-authenticate form, but it has been copied out <a href="http://cdn.dltj.org/wp-content/uploads/2010/11/oclc-cat-rda.txt.gzip" title="Public copy of OCLC-CAT post 'November 2010 Memorandum Against RDA Test' by Wojciech Siemaszkiewicz">copied to an open website</a> by Becky Yoose (thanks, Becky!).  The subsequent discussion resulted in a <a href="http://www.ipetitions.com/petition/norda/" title="Memorandum Against RDA Test">Petition against the RDA Test</a> by Jacqueline Byrd at Indiana University.  The link to the position has been <a href="http://article.gmane.org/gmane.education.libraries.autocat/35094" title="Gmane -- Mail To News And Back Again">posted to the open AUTOCAT list</a>, and there has been <a href="http://article.gmane.org/gmane.education.libraries.autocat/35108" title="Gmane -- Mail To News And Back Again">subsequent</a> <a href="http://article.gmane.org/gmane.education.libraries.autocat/35109" title="Gmane -- Mail To News And Back Again">discussion</a> <a href="http://article.gmane.org/gmane.education.libraries.autocat/35114" title="Gmane -- Mail To News And Back Again">there</a>. (<a href="http://friendfeed.com/lsw/4e3f279b/for-anyone-else-who-has-recently-heard" title="For anyone else who has recently heard reference... - LSW - FriendFeed">Hat tip to Kirsten Davis</a>.)</p><p><h2>Inside the Google Books Algorithm</h2></p><blockquote><p>Rich Results is the latest in a series of smaller front-end tweaks that have been matched by backend improvements. Now, the book search algorithm takes into account more than 100 &#8220;signals,&#8221; individual data categories that Google statistically integrates to rank your results. When you search for a book, Google Books doesn&#8217;t just look at word frequency or how closely your query matches the title of a book. They now take into account web search frequency, recent book sales, the number of libraries that hold the title, and how often an older book has been reprinted.</p></blockquote><p>Alexis Madrigal <a href="http://www.theatlantic.com/technology/archive/2010/11/inside-the-google-books-algorithm/65422/" title="Inside the Google Books Algorithm - Alexis Madrigal - Technology - The Atlantic">article</a> in TheAtlantic.com draws a comparison between the techniques and algorithms used for web search with those used for book materials.  The need for relevant search results is the same, but books don&#8217;t have the same inter-page linking hints that drive the <a href="http://en.wikipedia.org/wiki/PageRank" title="PageRank | Wikipedia">PageRank</a> algorithm for web search.  The use of <a href="http://dltj.org/article/mashups-of-bib-data/#anonymized_circulation_data">anonymized circulation data</a> in creating clustered bibliographic descriptions was mentioned at the ALA Midwinter ALCTS Forum on Mashups of Bibliographic Data, and apparently it is also used in the relevance ranking of Google Books search results.  (Hat tip to Ron Murray.)</p><p><h2>Google Releases mod_pagespeed</h2></p><blockquote><p>mod_pagespeed is an open-source Apache module that automatically optimizes web pages and resources on them. It does this by rewriting the resources using filters that implement web performance best practices. Webmasters and web developers can use mod_pagespeed to improve the performance of their web pages when serving content with the Apache HTTP Server. mod_pagespeed includes several filter that optimize JavaScript, HTML and CSS stylesheets. It also includes filters for optimizing JPEG and PNG images. The filters are based on a set of best practices known to enhance web page performance. Webmasters who set up mod_pagespeed in addition to configuring proper caching and compression on their Apache distribution should expect to see an improvement in the loading time of the pages on their websites.</p></blockquote><p>Google has promoted <a href="http://code.google.com/speed/page-speed/" title="Page Speed Home | Google Code">best practices for improving the rate at which web pages load</a> for a number of years.  This week they introduced <a href="http://code.google.com/speed/page-speed/docs/module.html" title="mod_pagespeed Overview | Google Code">mod_pagespeed</a>: an Apache web server module that brings these practices to bear by rewriting HTML, JavaScript, and Cascading Style Sheets on-the-fly.  Since Google now includes the <a href="http://googlewebmastercentral.blogspot.com/2010/04/using-site-speed-in-web-search-ranking.html" title="Using site speed in web search ranking | Google Webmaster Central Blog">speed at which pages are rendered in a browser as a factor in ranking search results</a>, this would seem to be a good module to explore for anyone running an Apache web server with public content.  (<a href="http://friendfeed.com/edsu/3fa00167/official-google-webmaster-central-blog-make" title="Official Google Webmaster Central Blog: Make your... - Ed Summers - FriendFeed">Hat tip to Ed Summers</a>.)</p><p><div class="wp-caption alignright" style="width: 290px;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; float: right;"><a href="http://asert.arbornetworks.com/2010/10/google-breaks-traffic-record/" title="Google Sets New Internet Traffic Record | Security to the Core | Arbor Networks Security"><img alt="A graph showing a rising percentage from roughly one percent in June 2007 to six percent in October 2010" src="http://m.friendfeed-media.com/945015fd271257990df9350241a36c80ea490542" title="Google as a percentage of all internet traffic" width="280" height="175" /></a><p style=' padding: 0 4px 5px; margin: 0;'  class="wp-caption-text">Google as a percentage of all internet traffic</p></div><h2>Google Sets Internet Traffic Record</h2></p><blockquote><p>Google now represents an average 6.4% of all Internet traffic around the world. This number grows even larger (to as much as 8-12%) if I include estimates of traffic offloaded by the increasingly common Google Global Cache (GGC) deployments and error in our data due to the extremely high degree of Google edge peering with consumer networks. Keep in mind that these numbers represent increased market share — Google is growing considerably faster than overall Internet volumes which are already increasing 40-45% each year.</p></blockquote><p>Craig Labovitz of Arbor Networks <a href="http://asert.arbornetworks.com/2010/10/google-breaks-traffic-record/" title="Google Sets New Internet Traffic Record | Security to the Core | Arbor Networks Security">notes</a> that if Google were an internet service provider, it would now be &#8220;the second largest carrier on the planet.&#8221;  Wow!  That is a lot of data sloshing around on its own internal network!</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/thursday-threads-2010w44/feed/</wfw:commentRss> <slash:comments>9</slash:comments> </item> <item><title>Views on Sharing (or, What Do We Want From OCLC?)</title><link>http://dltj.org/article/views-on-sharing/</link> <comments>http://dltj.org/article/views-on-sharing/#comments</comments> <pubDate>Wed, 29 Sep 2010 01:51:16 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[L/IS Profession]]></category> <category><![CDATA[Carl Grant]]></category> <category><![CDATA[cooperatives]]></category> <category><![CDATA[discovery]]></category> <category><![CDATA[OCLC]]></category> <category><![CDATA[privacy]]></category> <category><![CDATA[SkyRiver/Innovative versus OCLC lawsuit]]></category> <category><![CDATA[WorldCat]]></category><guid isPermaLink="false">http://dltj.org/?p=1681</guid> <description><![CDATA[Within the span of a recent week we&#8217;ve had two views of the OCLC cooperative. In one we have a proposition that OCLC has gone astray from its core roots and in the other a celebration of what OCLC can &#8230; <a href="http://dltj.org/article/views-on-sharing/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=1681"></abbr><p>Within the span of a recent week we&#8217;ve had two views of the OCLC cooperative.  In one we have a proposition that OCLC has gone astray from its core roots and in the other a celebration of what OCLC can do.  One proposes a new mode of cooperation while the other extols the virtues of the existing cooperative.  Both writers claim &#8212; independently &#8212; to &#8220;talk to librarians&#8221; and represent the prevailing mood of the profession.  Can these two viewpoints be reconciled?</p><p><h2>&#8220;Too Many Cooks?&#8221;</h2><br />The pro-establishment view first.  In a <a href="http://community.oclc.org/cooperative/2010/09/too-many-cooks.html" title="Too many cooks? - The OCLC Cooperative Blog">post</a> by <a href="http://www.oclc.org/speakers/bios/nilges_chip.htm" title="William &#038;039;Chip&#038;039; Nilges [OCLC]">Chip Nilges</a> on the <a href="http://community.oclc.org/cooperative/" title="The OCLC Cooperative Blog: Insights and information from OCLC staff on topics that are fundamental to your cooperative.">OCLC Cooperative Blog</a>, we get the view that the backing of the wider librarian community is key to OCLC being able to <a href="http://www.oclc.org/news/releases/2010/201049.htm" title="H.W. Wilson databases indexed in WorldCat Local [OCLC]">negotiate with content vendors like H.W. Wilson</a>.  Chip&#8217;s &#8220;talk to librarians&#8221; quote is:<br /><blockquote>I spend quite a bit of time talking both to librarians and industry partners&#8211;publishers, booksellers, Web-technology providers, search engine companies&#8211;all kinds of people doing interesting things in our space. And in those talks, there is often a discussion of one of the following: content, technology or community. What I&#8217;ve come to realize, though, is that the best results come from places where all three come together.</p></blockquote><p> Chip&#8217;s post is short but clear in its view that the community of OCLC members is something special and that it adds value to member libraries.</p><p><h2>&#8220;The Cooperative We Need&#8221;</h2><br />The other perspective comes from <a href="http://www.exlibrisgroup.com/?catid=%7B795BD8B6-47DE-4722-8D5D-B664EEEFB34C%7D" title="Bio: Carl Grant">Carl Grant</a> in a <a href="http://commentary.exlibrisgroup.com/2010/09/cooperative-we-need-open-collaborative.html" title="The cooperative we need: Open &amp; Collaborative Library Content" class="broken_link" rel="nofollow">post</a> on his <a href="http://commentary.exlibrisgroup.com/" title="Commentary from Carl Grant" class="broken_link" rel="nofollow">Ex Libris blog</a>.  His thesis is that OCLC has an important role to play in adding value to bibliographic data, but that its motives are too intertwined with for-profit interests to carry out this role effectively.  Carl&#8217;s &#8220;talk to librarians&#8221; quote is:<br /><blockquote>It appears to me that the interests of the OCLC we know today do not appear to be in total alignment with the needs and interests of its overall actual membership. Perhaps they are in alignment with the interests of the Board, Council, and other governing and administrative arms, but the feeling I get in talks with librarians is that it is not in alignment with what they want. As I talk to librarians, across the country today, I hear that what they want is an organization, a cooperative that is focused on developing and providing open and collaborative library content and services that are widely accessible by all in order that they (the librarians) can focus on re-establishing and/or maintaining the value of libraries in our society.</p></blockquote><p> Carl goes on to propose the creation of a utility that aggregates the ratings and rankings of individual users into a database that can enhance the relevance ranking of the emerging generation of discovery layer products.</p><p><h2>My Thoughts</h2><br />This &#8220;talk to librarians&#8221; thread through the two posts makes me reflect on a question I asked earlier on <i><acronym title="Disruptive Library Technology Jester">DLTJ</acronym></i>: <a href="http://dltj.org/article/oclc-social-contract/" title="What Does It Mean to Be a Member of OCLC? | Disruptive Library Technology Jester">&#8220;What Does it Mean to be a Member of OCLC?&#8221;</a> Although I probably haven&#8217;t talked to nearly the number of librarians as Chip and Carl, in my discussions within the profession I still haven&#8217;t come to a resolution to this basic question.  That question itself is tied to another question coming through in the contrast between these two posts:  What Do We Want From OCLC?</p><p>Carl describes the problem in his post.  When a not-for-profit vendor acquires a significant number of for-profit companies (and spins them back out again), how can we (members, vendors, and the library community in general) understand how the mix of commercial and non-commercial interests are playing out at the management level?  Can the OCLC that is the bibliographic utility, the metadata <a href="http://orweblog.oclc.org/archives/001611.html" title="Platforming a library network: destination and switch - Lorcan Dempsey's Weblog">switch</a> between bibliographic-based services, and the <acronym title="Research and Development">R&#038;D</acronym> braintrust co-exist with the for-profit businesses, motivations, and operations?  Or, to put it more sharply, does the negotiation of H.W. Wilson content for use on the subscription-based WorldCat database hinder the evolution of discovery layers that being developed by companies that don&#8217;t have the tax-advantaged not-for-profit status?  (And don&#8217;t forget about the allegations of anti-competitive behavior in the <a href="http://www.librarytechnology.org/web/breeding/skyriver-vs-oclc/" title="http://www.librarytechnology.org/web/breeding/skyriver-vs-oclc/">SkyRiver/Innovative-versus-OCLC lawsuit</a>.)</p><p>In closing this section, I want to pull out and emphasize another quotation from Carl&#8217;s post:<br /><blockquote>In the end, all of these business initiatives, and now resulting lawsuit, strongly work against OCLC being able to do what it does best—building collaboration, content, and related services as a non-profit entity to serve the larger profession.</p></blockquote><p> Agreed.</p><p><h2>Carl&#8217;s Grand Idea</h2><br />What might get lost if you only closely read the first half of Carl&#8217;s post &#8212; as it initially did for me &#8212; is the second half where he describes the concept for enhancing WorldCat in a manner that benefits all&#8230;both library members and commercial entities.  He does this by noting that the &#8220;valuable points of open source software&#8221; can be applied &#8212; in a social media fashion &#8212; to a service that aggregates usage, ratings, and comments in a way that advances relevance ranking of discovery tools.  Now initially the mind swirls with concerns of privacy and informed user consent in gathering this data in one central pool.  I don&#8217;t think we know enough yet in the library community about building privacy-robust systems that meet an American librarian&#8217;s information privacy ethos.  But done right it also has the ability to build a reputation-based social feedback loop that adds important new information to the bibliographic utility.  And because of its better-when-bigger characteristic, only a neutral party like the not-for-profit OCLC cooperative could serve as an aggregator and distributor of this data.</p><p>I highly recommend reading <a href="http://commentary.exlibrisgroup.com/2010/09/cooperative-we-need-open-collaborative.html" title="The cooperative we need: Open &amp; Collaborative Library Content" class="broken_link" rel="nofollow">Carl&#8217;s post</a> and thinking about ways of answering the question &#8220;What Do We Want From OCLC?&#8221;  I commend Carl for his courage and vision in articulating his points and proposing something new for the profession to drive towards.</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/views-on-sharing/feed/</wfw:commentRss> <slash:comments>11</slash:comments> </item> <item><title>Early September Summary of the SkyRiver/Innovative vs. OCLC Case</title><link>http://dltj.org/article/skyoclc-september-2010/</link> <comments>http://dltj.org/article/skyoclc-september-2010/#comments</comments> <pubDate>Sat, 18 Sep 2010 22:12:16 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[L/IS Profession]]></category> <category><![CDATA[Innovative Interfaces Inc.]]></category> <category><![CDATA[legal]]></category> <category><![CDATA[OCLC]]></category> <category><![CDATA[SkyRiver]]></category> <category><![CDATA[WorldCat]]></category><guid isPermaLink="false">http://dltj.org/?p=1674</guid> <description><![CDATA[On September 9th, OCLC filed its first substantial response with the court to the antitrust lawsuit file by SkyRiver and Innovative Interfaces. And in a motion where OCLC requests a change of venue from the Northern District of California to &#8230; <a href="http://dltj.org/article/skyoclc-september-2010/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=1674"></abbr><p>On September 9th, <a href="http://www.oclc.org/" title="OCLC homepage" rel="homepage">OCLC</a> filed its first substantial response with the court to the <a href="http://docs.justia.com/cases/federal/district-courts/california/candce/3:2010cv03305/230152/1/" title="SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Document 1 - :: Justia Docs:">antitrust lawsuit</a> file by <a href="http://theskyriver.com/" title="SkyRiver Technology Solutions" rel="homepage">SkyRiver</a> and <a href="http://iii.com/" title="Innovative Interfaces Inc." rel="homepage">Innovative Interfaces</a>.  And in a motion where OCLC requests a change of venue from the <a href="http://www.cand.uscourts.gov/" title="United States District Court - Northern District of California" rel="homepage">Northern District of California</a> to the <a href="http://www.ohsd.uscourts.gov/" title="United States District Court - Southern District of Ohio" rel="homepage">Southern District of Ohio</a> &mdash; something seemingly mundane &mdash; they certainly pulled no punches:<br /><blockquote>Through a lengthy recitation of inaccurate facts, Plaintiffs allege six claims against OCLC. In short, Plaintiffs allege that OCLC, a forty-year old non-profit entity, is making it difficult for Innovative and its one-year old sister-company, SkyRiver, to compete and gain market share in the ILL, ILS, and the online cataloging library world. Through a variety of uncited references in their Complaint to &ldquo;prominent library-related internet blogs,&rdquo; unnamed commentators, and unattributed articles and reports, as well as through creating <a href="http://www.choiceforlibraries.com/" title="Choice for Libraries">an anti-OCLC website</a>, Plaintiffs have levied a propaganda war on OCLC simply because Plaintiffs have been unable to compete successfully with OCLC&rsquo;s membership base and bibliographic data which OCLC earned through forty years of dedicated service to its member libraries.<cite>SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Filing: 16. Page 4. <a href="http://docs.justia.com/cases/federal/district-courts/california/candce/3:2010cv03305/230152/16/" title="SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Document 16 - :: Justia Docs:">Retrieved from Justia Docs</a> on 18-Sep-2010. (link added)</cite></p></blockquote><p> The question at hand seems to be a bit more than a propogada war between SkyRiver and OCLC.  But the court is not yet at the meat of the matter.</p><p>The text accompanying the motion for change of venue, though, does not deal with the issues raised in the lawsuit.  Instead, it requests the California court &#8220;transfer this action from this District to the Southern District of Ohio, Eastern Division, located in Columbus, Ohio.&#8221;<br /><blockquote>&ldquo;Litigation should proceed where the case finds its center of gravity.&rdquo;  The &ldquo;center of gravity&rdquo; is determined by the location of key witnesses and documents. Here, the &ldquo;center of gravity&rdquo; is plainly the Southern District of Ohio, for these reasons:</p><ul><li>OCLC&rsquo;s headquarters and virtually all of the key witnesses and documentary evidence are located in or near Central Ohio.</li><li>OCLC has a relatively small presence in California, as compared to its much larger and longer-established presence in Ohio.</li><li>OCLC made all decisions and actions operative to the allegations of Plaintiffs SkyRiver Technology Solutions, LLC (&ldquo;SkyRiver&rdquo;) and Innovative Interfaces, Inc. (&ldquo;Innovative&rdquo;) (collectively &ldquo;Plaintiffs&rdquo;) in Ohio.</li><li>The State of California does not have an interest in this lawsuit beyond the fact that Plaintiffs are residents of California, whereas the State of Ohio has a great interest in this lawsuit because Plaintiffs have alleged that one of Ohio&rsquo;s non-profit entities is abusing its non-profit status, an allegation that can impact other Ohio non-profit entities.</li></ul><p>For these and other reasons discussed in more detail below, all parties and the Court will be better served by transferring this case to the Southern District of Ohio, Eastern Division. In making this Motion, OCLC reserves any defenses that it may have against Plaintiffs&rsquo; claims.<cite>SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Filing: 16. Pages 1-2. <a href="http://docs.justia.com/cases/federal/district-courts/california/candce/3:2010cv03305/230152/16/" title="SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Document 16 - :: Justia Docs:">Retrieved from Justia Docs</a> on 18-Sep-2010. (Legal citations removed from text.)</cite></p></blockquote><p>The last point is probably the most interesting to the layperson watching this epic battle unfold.  Pages 12 and 13 contain these statements:<br /><blockquote>The state of Ohio&rsquo;s interest in adjudicating this matter within its borders also militates towards transferring this case. OCLC&rsquo;s relevant policies and practices were developed and implemented in Ohio and the most important witnesses and evidence are located there.</p><p>Further, Plaintiffs have stated serious, albeit unfounded, allegations regarding OCLC&rsquo;s non-profit status.  Plaintiffs have stated that OCLC is &ldquo;abusing its status as a tax exempt, non-profit entity and unfairly competes with for-profit companies, such as Innovative and SkyRiver, by using its non-profit status as leverage to	monopolize the library services industry&#8230;.&rdquo;  These baseless allegations could create serious implications for other Ohio non-profit entities, and Ohio courts have a greater interest in litigating these issues. While Ohio courts also have more experience applying Ohio&rsquo;s laws, more importantly, they also have more experience with the routine customs and practices of non-profit entities in Ohio.	In addition, the Ohio Attorney General has oversight over Ohio non-profit entities and would likewise have an interest in this lawsuit.</p><p>In contrast, California does not have an interest in litigating this action because none of the operative actions occurred in California. Though Plaintiffs asserted California state law claims, as explained below, Ohio courts will be equally skilled at applying and interpreting those laws.<cite>SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Filing: 16. Pages 12-13. <a href="http://docs.justia.com/cases/federal/district-courts/california/candce/3:2010cv03305/230152/16/" title="SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Document 16 - :: Justia Docs:">Retrieved from Justia Docs</a> on 18-Sep-2010. (Legal citations removed from text.)</cite></p></blockquote><p> The question of OCLC&#8217;s tax exempt status is one that bubbles up on occasion.  It would seem like OCLC&#8217;s legal team is willing to take this head on.</p><p>Accompaying the motion to transfer is a <a href="http://docs.justia.com/cases/federal/district-courts/california/candce/3:2010cv03305/230152/17/" title="SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Document 17 - :: Justia Docs:">declaration by Bruce Crocco</a>, Vice President, Library Services for the Americas for OCLC, and this is a more interesting document.  It goes into the history of OCLC &mdash; its founding in 1967 on the Ohio State University campus, how OCLC revolutionized the production of paper cards for card catalogs and the movement into online catalogs, and the evolution of the WorldCat brand name.</p><p>Earlier this week, the judge in the case set this schedule for hearing from the parties on the motion:<br /><blockquote>This matter is set for a hearing on October 29, 2010 on Defendant OCLC Online Computer Library Center’s motion to transfer venue. The Court HEREBY ORDERS that an opposition to the motion shall be filed by no later than September 27, 2010 and a reply brief shall be filed by no later than October 4, 2010.</p><p>If the Court determines that the matter is suitable for resolution without oral argument, it will so advise the parties in advance of the hearing date. If the parties wish to modify this schedule, they may submit for the Court’s consideration a stipulation and proposed order demonstrating good cause for any modification requested.</p></blockquote><p>To refresh your memory, the lawsuit was filed on July 28, 2010, and was <a href="http://docs.justia.com/cases/federal/district-courts/california/candce/3:2010cv03305/230152/8/" title="SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Document 8 -  :: Justia Docs">assigned</a> to <a href="http://en.wikipedia.org/wiki/Jeffrey_White" title="Jeffrey White - Wikipedia">Judge Jeffrey S. White</a> on August 6, 2010.  On August 12th, Judge White <a href="http://docs.justia.com/cases/federal/district-courts/california/candce/3:2010cv03305/230152/9/" title="SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Document 9 -  :: Justia Docs">set a Case Management Conference</a> for January 14, 2011, and on August 13th lawyers for OCLC filed a <a href="http://docs.justia.com/cases/federal/district-courts/california/candce/3:2010cv03305/230152/10/" title="SkyRiver Technology Solutions, LLC et al v. OCLC Online Computer Library Center, Inc. Document 10 - :: Justia Docs:">notice</a> with their intent to request a change of venue.  Stay tuned for an update as the case moves on&#8230;</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/skyoclc-september-2010/feed/</wfw:commentRss> <slash:comments>10</slash:comments> </item> <item><title>Interesting Google Book Search Settlement Bits in Advance of Thursday&#8217;s Fairness Hearing</title><link>http://dltj.org/article/interesting-gbs-bits/</link> <comments>http://dltj.org/article/interesting-gbs-bits/#comments</comments> <pubDate>Tue, 16 Feb 2010 02:22:04 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[policy]]></category> <category><![CDATA[digitization]]></category> <category><![CDATA[Google Book Search]]></category> <category><![CDATA[Judge Denny Chin]]></category> <category><![CDATA[legal]]></category> <category><![CDATA[WorldCat]]></category><guid isPermaLink="false">http://dltj.org/?p=1529</guid> <description><![CDATA[Thursday will be a big day in the Google Book Search lawsuit settlement: the parties to the lawsuit, along with the objectors, supporters, and friends-of-the-court, will be in the courtroom of United States District Judge Denny Chin offering oral arguments &#8230; <a href="http://dltj.org/article/interesting-gbs-bits/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=1529"></abbr><p>Thursday will be a big day in the Google Book Search lawsuit settlement:  the parties to the lawsuit, along with the objectors, supporters, and friends-of-the-court, will be in the courtroom of United States District Judge Denny Chin offering oral arguments in the final settlement/fairness hearing. <a href="http://docs.justia.com/cases/federal/district-courts/new-york/nysdce/1:2005cv08136/273913/930/0.html" title="The Author's Guild et al v. Google Inc. Document 930 - :: Justia Docs">In his order</a>, Judge Chin recognized 26 parties that will speak for up to five minutes each on their positions in the settlement (21 in opposition, 5 in favor).  The U.S. Department of Justice will also speak at the hearing.  But I think we&#8217;re all eagerly awaiting to hear what the judge himself will say about the settlement agreement.</p><p>In the lead-up to the hearing, Associate Professor <a href="http://james.grimmelmann.net/" title="James Grimmelmann homepage" rel="homepage">James Grimmelmann</a> at the New York Law School has continued <a href="http://laboratorium.net/" title="The Laboratorium">his efforts</a>, along with <a href="http://thepublicindex.org/about" title="About The Public Index">the students from the Institute for Information Law and Policy</a> at New York Law School, to make the documents and proceedings of the lawsuit accessible and understandable to non-lawyers.  In the most recent court filings leading up to Thursday&#8217;s hearing are some interesting nuggets.<br /><span id="more-1529"></span><br />In his <a href="http://laboratorium.net/archive/2010/02/15/gbs_a_little_on_the_fee_motion" title="The Laboratorium: GBS: A Little on the Fee Motion">posting</a> on the <a href="http://thepublicindex.org/docs/amended_settlement/Motion_for_fees.pdf" title="Notice of Motion and Motion for Approval of Attorneys' Fees and Reimbursement of Costs">motion for attorneys fees</a>, he notes that &#8220;counsel for the author sub-class are asking for the full $30 million in fees and reimbursement of their out-of-pocket costs.&#8221;  The filing contains information about the number of hours and the billing rate for some of the lawyers working on the case.  Some of the stuff is just really interesting, like <a href="http://thepublicindex.org/docs/amended_settlement/Dumain_Declaration.pdf" title="Declaration of Sanford P. Dumain in Support of Final Settlement Approval and Application of Counsel for the Author Sub-Class for Award of Fees and Reimbursement of Costs">one filing</a> that included everything from 18 hours by a partner of a firm (who is also a law professor at <acronym title="New York University">NYU</acronym>) at rate of $995/hour to an itemization of 51¢ for long distance calls by the firm related to the case.  Whew!</p><p>More interesting to <acronym title="Disruptive Library Technology Jester"><i>DLTJ</i></acronym> readers would be Grimmelmann&#8217;s <a href="http://laboratorium.net/archive/2010/02/15/gbs_some_highlights_of_dan_clancys_declaration" title="GBS: Some Highlights of Dan Clancy's Declaration">highlights</a> of <a href="http://thepublicindex.org/docs/amended_settlement/dan_clancy_declaration.pdf" title="Declaration of Daniel Clancy in Support of Motion for Final Approval of Amended Settlement Agreement">Dan Clancy&#8217;s declaration</a> in support of the agreement. <a href="http://www.computerhistory.org/events/index.php?spkid=0&amp;ssid=1246406058" title="Dan Clancy - Computer History Museum - Events">Dan Clancy</a> is engineering director of the Google Book Search project, so he has a unique insight into the inner workings.  Grimmlemann notes that Clancy states:<ul type="square"><li>To date, Google has Digitized over twelve million books, and intends to continue Digitizing books in the future.</li><li>Google has received metadata from 48 libraries.</li><li>Google pays approximately $2.5 million per year to license metadata from 21 commercial databases of information about books.</li><li>Google has gathered 3.27 billion records about Books, and analyzed them to identify more than 174 million unique works.</li></ul><p>The third bullet is interesting in that I think we can eliminate one of the &#8220;commercial databases&#8221; from the list.  I can&#8217;t find it in my notes from ALA Midwinter, but I seem to recall hearing <a href="http://www.oclc.org/about/trustees/members/jay_jordan.htm" title="Jay Jordan [OCLC - 2009-2010 Board members]">Jay Jordan</a> (<a href="http://www.linkedin.com/pub/jay-jordan/0/495/86" title="Jay Jordan - LinkedIn">OCLC President</a>) say something along the lines that OCLC was not receiving a monetary return from the sharing of bibliographic data with Google; the value OCLC gets for its membership comes from the links back to WorldCat from Google services.  If I got this wrong, I hope someone from OCLC will call me out on it.</p><p>The last bullet is interesting, too:  Google has identifying 174 million works in analyzing all of the sources of data coming into it.  I tried to find some numbers in the descriptions of WorldCat to compare that to, but didn&#8217;t have any luck this evening.  (There isn&#8217;t anything about statistics available on <a href="http://www.worldcat.org/" title="WorldCat Homepage" rel="homepage">http://worldcat.org/</a>?)</p><p>To Grimmelmann&#8217;s highlights I would add this statement that seems strangely out-of-place.</p><ul type="square"><li>Google has no interest in censorship. Indeed, Google&#8217;s mission is to organize the world&#8217;s information and make it universally accessible and useful.</li></ul><p>Has anyone brought censorship into the discussion yet?  Privacy for sure, but censorship?</p><p>Also:<ul type="square"><li>Google has developed algorithms to compare these numerous sources of metadata and identify the most accurate data about each book.</li></ul><p>They certainly seem to have invested a lot of effort in this area.  More info can be found in <a href="http://dltj.org/article/mashups-of-bib-data/">my summary of Kurt Groetsch&#8217;s presentation at ALA Midwinter 2010</a>.</p><div class='series_links'><a href='http://dltj.org/article/revised-gbs-settlement/' title='Revised Google Book Search Settlement from a Library Perspective'>Previous in series</a> <a href='http://dltj.org/article/gbs-settlement-rejected/' title='Google Book Search Settlement Rejected'>Next in series</a></div>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/interesting-gbs-bits/feed/</wfw:commentRss> <slash:comments>9</slash:comments> </item> <item><title>Mashups of Bibliographic Data: A Report of the ALCTS Midwinter Forum</title><link>http://dltj.org/article/mashups-of-bib-data/</link> <comments>http://dltj.org/article/mashups-of-bib-data/#comments</comments> <pubDate>Wed, 27 Jan 2010 21:14:52 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Meeting]]></category> <category><![CDATA[ALA Midwinter Conference 2010]]></category> <category><![CDATA[Association for Library Collections and Technical Services]]></category> <category><![CDATA[Dewey Decimal Classification]]></category> <category><![CDATA[Google Book Search]]></category> <category><![CDATA[Internet Archive]]></category> <category><![CDATA[MARC]]></category> <category><![CDATA[OCLC]]></category> <category><![CDATA[onix]]></category> <category><![CDATA[Open Library]]></category> <category><![CDATA[WorldCat]]></category><guid isPermaLink="false">http://dltj.org/?p=1478</guid> <description><![CDATA[This year the ALCTS Forum at ALA Midwinter brought together three perspectives on massaging bibliographic data of various sorts in ways that use MARC, but where MARC is not the end goal. What do you get when you swirl MARC, &#8230; <a href="http://dltj.org/article/mashups-of-bib-data/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=1478"></abbr><p>This year the <a href="http://connect.ala.org/node/91406" title="ALCTS Forum: Mix and Match: Mashups of Bibliographic Data | ALA Connect"><acronym title="Association for Library Collections and Technical Services">ALCTS</acronym> Forum at <acronym title="American Library Association">ALA</acronym> Midwinter</a> brought together three perspectives on massaging bibliographic data of various sorts in ways that <em>use</em> <acronym title="Machine Readable Cataloging">MARC</acronym>, but where MARC is not the end goal.  What do you get when you swirl MARC, <acronym title="ONline Information eXchange">ONIX</acronym>, and various other formats of metadata in a big pot?  Three projects:  ONIX Enrichment at OCLC, the Open Library Project, and Google Book Search metadata.<br /><span id="more-1478"></span><br />Below is a summary of how these three projects are messin&#8217; with metadata, as told by the Forum panelists.  I also recommend reading Eric Hellman&#8217;s <a href="http://go-to-hellman.blogspot.com/2010/01/google-exposes-book-metadata-privates.html" title="Google Exposes Book Metadata Privates at ALA Forum | Go-to-Hellman">Google Exposes Book Metadata Privates at ALA Forum</a> for his recollection and views of the same meeting.</p><p><h2 id="post-1478-h2-OCLC-ONIX">ONIX Enrichment at OCLC</h2></p><p><span class="removed_link" title="http://www.oclc.org/speakers/bios/register_renee.htm">Renee Register</span>, Global Product Manager for OCLC Cataloging and Metadata Services, was the first to present on the panel.  Her talk looked at a new and evolving product at OCLC on the enhancement of ONIX records with WorldCat records, and vice versa. <sup><a href="http://dltj.org/article/mashups-of-bib-data/#footnote_0_1478" id="identifier_0_1478" class="footnote-link footnote-identifier-link" title="For those not familiar with ONIX, it is a suite of standards promulgated by EDItEUR for the interchange of information on books and serial publications.  It is primarily used as the communication channel between the publishing industry through distribution chains to retail establishments.">1</a></sup></p><p>As libraries, Renee said &#8220;our instincts are collaborative&#8221; but &#8220;our data and workflow silos encourage redundancy and inhibit interoperability.&#8221;  Beyond the obvious differences in metadata formats, the workflows of libraries differ dramatically from other metadata providers and consumers. In libraries (with the exception of <acronym title="Cataloging in Print">CIP</acronym> and brief on-order records) the major work of bibliographic production is performed at the end of the publication cycle and ends with the receipt of the published item.  In the publisher supply chain, bibliographic data evolves over time, usually beginning months before publication and continuing to grow for months and years (sales information, etc.) after publication.  Renee had a graphic showing the current flow of metadata around the broader bibliographic universe that highlighted the isolation of library activity relative to publisher, wholesaler, and retailer activity.</p><p><div id="attachment_1484" class="wp-caption alignright" style="width: 310px;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; float: right;"><a href="http://www5.oclc.org/downloads/presentations/MDS4Pubs_August_Webinar_200908.ppt" title="Slides from Publisher Supply Chain Webinar, August 2009"><img src="http://cdn.dltj.org/wp-content/uploads/2010/01/ONIX-enhancement-300x225.jpg" alt="" title="Diagram of the Process of Enhancing ONIX Records" width="300" height="225" class="size-medium wp-image-1484" /></a><p style=' padding: 0 4px 5px; margin: 0;'  class="wp-caption-text">Diagram of the Process of Enhancing ONIX Records, from OCLC Services for the Publisher Supply Chain Webinar, August 2009</p></div>Renee when on to describe a &#8220;next generation cataloging data flow&#8221; where OCLC facilitates the inclusion of publisher data into <a href="http://www.worldcat.org/" title="WorldCat homepage" rel="homepage">WorldCat</a> and enhances publisher data with information extracted from WorldCat.  To the right is a version of the graphic she used at Midwinter taken from an earlier presentation on the same topic.  It show ONIX-formatted metadata coming into WorldCat, being cross-walked and matched with existing MARC data in WorldCat, and finally extracted and cross-walked back to ONIX resulting in <a href="http://publishers.oclc.org/en/metadata/default.htm" title="OCLC Metadata Services for Publishers"> enhanced ONIX metadata</a> for publishers to use in their supply chain.  If there is an exact match for the incoming ONIX record in WorldCat, the WorldCat record is enhanced with certain fields from the ONIX record (descriptions, author biographies, web links) &#8212; being careful not to override authority work being done by libraries, but adding enhancements that libraries may not otherwise input.  In turn, enhancements from exact match record and FRBR work set records (hardcover versus softcover versus audiobook, etc.) are added to the ONIX record (non-English subject headings, adding a Dewey Decimal Classification (DDC) field from another similar record if one doesn&#8217;t already exist, change the author field to an authority-controlled version).  If there is not an exact match for the ONIX record in WorldCat, a new WorldCat record is built from the ONIX record and it is subsequently enhanced by metadata found in the FRBR work set records.  In doing so, we are &#8220;increasing the goodness of metadata in the marketplace,&#8221; as Renee put it in her presentation.  OCLC is also creating a mapping between <a href="http://www.bisg.org/what-we-do-20-73-bisac-subject-headings-2009-edition.php" title="Standards &amp; Best Practices | Classification Schemes | BISAC Subject Headings 2009 Edition | Book Industry Study Group">BISAC Subject Headings</a><sup><a href="http://dltj.org/article/mashups-of-bib-data/#footnote_1_1478" id="identifier_1_1478" class="footnote-link footnote-identifier-link" title="By the way, it seems like BISAC is an acronym for &amp;#8220;Book Industry Systems Advisory Committee&amp;#8221;, the former name of the Book Industry Study Group.">2</a></sup> and the DDC system.  This allows the enhancement of ONIX with suggestions of BISAC Subject Terms and the enhancement of WorldCat records with generic DDC fields given an incoming BISAC Subject Term value from the ONIX record.</p><p>In her experience, Renee said that libraries need ways to enable our metadata to evolve over time and allow for publisher-created metadata to merge effectively with library-created metadata.  The bibliographic record needs to be a &#8220;living, growing&#8221; thing throughout the lifecycle of a title and beyond.  In concluding her remarks, she offered several resources to explore for further information:  the OCLC/NISO study on <a href="http://www.niso.org/publications/white_papers/StreamlineBookMetadataWorkflowWhitePaper.pdf" title="Streamlining Book Metadata Workflow">Streamlining Book Metadata Workflow</a>, the U.K. Research Information Network report on <a href="http://rin.ac.uk/creating-catalogues" title="Creating Catalogues: Bibliographic Records in a Networked World">Creating Catalogues: Bibliographic Records in a Networked World</a>, the Library of Congress <a href="http://www.loc.gov/bibliographic-future/news/" title="News, Press Releases and Reports - Working Group on the Future of Bibliographic Control (Library of Congress)">Study of the North American MARC Records Marketplace</a>, the Library of Congress <a href="http://cip.loc.gov/onixpro.html" title="LC ONIX Pilot Project" class="broken_link" rel="nofollow">CIP/ONIX Pilot Project</a>, and the <a href="http://publishers.oclc.org/en/default.htm" title="OCLC Publisher Supply Chain Website">OCLC Publisher Supply Chain Website</a>.</p><p><h2 id="post-1478-h2-Open-Library">From MARC to Wiki with Open Library</h2><br />The second presenter on the panel was <a href="http://kcoyle.net/" rel="homepage" title="Karen Coyle's home page">Karen Coyle</a>, talking about the mashup of metadata at the <a href="http://openlibrary.org/" title="Open Library project homepage" rel="homepage">Open Library</a> project at the <a href="http://archive.org/" title="Internet Archive homepage" rel="homepage">Internet Archive</a>.  The slides from her presentation are <a href="http://kcoyle.net/presentations/ol_boston.pdf" title="Open Library - Mix and Match Metadata presentation slides [PDF]">available from her website</a>.</p><p>Karen said right at the start that the Open Library project is different from most of what happens in libraries &#8212; it is &#8220;someone outside the library world making use of library data&#8221; &#8212; although the goal is arguably the same as others &#8212; &#8220;<a href="http://openlibrary.org/about" title="About Us (Open Library)">One web page for every book ever published</a>.&#8221;  As such, the Open Library isn&#8217;t a library catalog as librarians think of it in that it is not a representation of a libraries inventory. It has metadata for every book it can know about and a pointer to places where the book can be found, including all of the electronic books in Internet Archive (<a href="http://www.opencontentalliance.org/" rel="homepage" title="Open Content Alliance (OCA)">Open Content Alliance</a>, Google Public Domain, etc.) as well as pointers back to OCLC WorldCat.  Karen&#8217;s role for the project is that of &#8220;Library Data Informant.&#8221; The Internet Archive decided that they needed someone who understood library data in order to try to use it.  From Karen&#8217;s perspective, she is trying to be a resource for project but not give them any guidance on how to implement the service.  She is curious to see what the project would do when bibliographic data is viewed from a non-librarian perspective.  If they have questions, or if they have assumptions about data that are wrong, then she intervenes.</p><p>Karen went on to briefly describe the Open Library system.  Open Library doesn&#8217;t have records; rather, it has field types and data properties.  In this way, it uses semantic web concepts.  &#8220;Author&#8221; is a type, &#8220;Author birthdate&#8221; is another type, and so forth.  There are no set field types, so if the project gets data from source for which a type doesn&#8217;t yet exist, it can create a new one.  Each type can have data properties such as string, boolean, text, link, etc.  Nothing is required and everything is repeatable.  Everything &#8212; types, properties, and values &#8212; gets a <acronym title="Uniform Resource Identifier">URI</acronym> (a URI is an identifier like a URL, but conceptually a superset of the universe of URLs).  Titles, authors, subjects, author birthdates, and so on have URIs.  Lastly, the underlying data structures are based on wiki principles: all edits are saved and viewable, anyone can edit any value, anyone can add new types or properties, anyone can develop their own displays, etc.</p><p>The data that is now in Open Library came from a variety of sources.  They started with a copy of books from the Library of Congress, and continue to receive the weekly updates. They performed a crawl of Amazon&#8217;s book data.  They have gotten some from publishers, libraries, and individual users.  The last is perhaps the most interesting because it is mainly people outside the western world who are otherwise having trouble getting their works recognized.</p><p><h3 id="post-1478-h3-Problems-Issues">Problems, Issues, Challenges, and Opportunities with the Data</h3><br />People who use library data without the biases or assumptions of librarians come up with interesting ways to view the data.  Karen described a few of them.</p><dl class="inlineClass"><dt>Names -</dt><dd>&#8220;These library forms of names? Honestly no one but us can stand them.&#8221;  Even something as simple as the form of last-name-comma-first-name is troublesome.  No one else uses this form of the name: Amazon, Wikipedia, etc.  In processing these, any information between parenthesis has been deleted, birth and death dates move into separate field types.</dd><dt>Titles -</dt><dd>In working with the Open Library developers, this is one place that Karen tried insisting on applying a library practice:  knowing the initial article.  For us, this is important for sorting books in alphabetical order.  The developer response &#8212; why do we have to sort in alphabetical order?  &#8220;Where else but library catalogs to we see things sorted in alphabetical order?  Not in Google, not in Amazon, not anywhere.  Alphabetical order is not in the mindset anymore.&#8221;  They also found that the title might include extraneous data.  Amazon, for instance, appends the series title in parenthesis to the main title.  This is a demonstration of how other communities are not as concerned about strongly typing and separating information into fields. Amazon, of course, has reasons for series information into the main title: it helps sell books.</dd><dt>Product dimensions -</dt><dd>Publishers and distributors need to know characteristics of an item such as height, width, depth, and weight; they, of course, need to put it in a box and ship it.  Libraries, concerned about placing the item on the shelf, record just height.  Recording pagination is different, too: libraries use odd notations &#8220;ill. (some col)&#8221; and &#8220;xv, 200p.&#8221; versus simply &#8220;200 pages.&#8221;</dd><dt>Birthdates -</dt><dd>Librarians use birthdates to distinguish names; if there is no need to distinguish a name, birth and death dates are not added.  Someone looking at this from the outside would ask &#8216;Why don&#8217;t all authors have birth and death dates?&#8217;  This can be useful information for viewing the context of an item, not just to distinguish author names.  Open Library ran author names against Wikipedia to pick up not only birth and death years, but also the actual dates.</dd><dt>Subject headings -</dt><dd>Open Library using Library of Congress Subject Headings was out of the question. In processing the data, the Open Library developers just broke them apart into segments and used them. But because they were able to do data mining on the subject field types, they did find statistical relationships between the disassembled precoordinated headings and were able to present those to the user.</dd><dt>The View of the Data -</dt><dd>Rather than a traditional library view of long lists of author-title, the Open Library (in its next version coming in February) will have several different views into the mass of data: Authors; Books (what we would call <acronym title="Functional Requirements for Bibliographic Records">FRBR</acronym> &#8216;manifestations&#8217;); Works; Subjects; and eventually places, publishers, etc.  For example, when searching for an author one would get the author page.  On it would be all of the works from the author as well as other biographical information.  It looks similar to a WorldCat identities page, except it is the actual user interface built into the system.  Similarly, every work will have a page, and at the bottom of it one will see all of the editions of the work.  Also, each subject will have a page, and one will see a list of works with that subject as well as authors who write on that subject.  As Karen said, &#8220;The subject itself becomes an object of interest in the database, not just something that is just tacked on to the bottom of the library record.&#8221;</dd><dt>Data mining -</dt><dd>With the data in this format, it is possible to perform data mining actions against it. For instance, simple data mining such as country of publication, popular places that appear, etc.  When they had the problem of author names &#8212; knowing when to reverse surname and forname &#8212; they ran the names against Amazon and Wikipedia and retained the ones where they found the order of the entry was the same. The Open Library developers are also experimenting with data mining to find publisher names.  Publisher names, of course, vary dramatically, but by using ISBN prefixes they can pull together related items into a &#8220;publisher&#8221; view.</dd></dl><p>Karen suggested watching the <a href="http://edwardbetts.com/ol/" title="Index of /ol">Edward Betts&#8217;s site</a>, one of the developers of the Open Library project with an eye on the data mining aspects.  She said it is fun to look at our data when it can be viewed from this different point-of-view.  She also said to watch out for a new version of the <a href="http://openlibrary.org/" title="Open Library (Open Library)">Open Library website</a> coming in February.</p><p><h2 id="post-1478-h2-Google-Book-Search-Metadata">Google Book Search Metadata</h2><br />The final presenter was <a href="http://www.google.com/profiles/kurt.groetsch" title="Kurt Groetsch's Google Profile">Kurt Groetsch</a>, Technical Collections Specialist at Google where he works to provide understanding and insight into library partner collections and the digitized books from Google.  Kurt said that &#8220;Google has been fairly circumspect over the years about what we do on the Book Search project.&#8221;  He said it was a bit of a cultural legacy from the rest of the company and also possibly an artifact of the copyright litigation, but he is hoping to change that.  His presentation looked at how Google works with book metadata from three vantage points &#8212; the inputs into Google&#8217;s system, parsing by Google&#8217;s algorithms, and analysis and output into the public interfaces.</p><p>On the input side, Google is getting bibliographic metadata from over 100 sources in a variety of formats. MARC records are coming from libraries, union catalogs, commercial providers (OCLC), publishers/retails (one publisher supplies records in MARC format).  Google also gets ONIX records from commercial providers (such as Ingram and Bowker), publishers, and retailers.  Google is especially interested in data from non-U.S. retailers because it is a source of information about books published outside the United States; it helps facilitate discovery of items that they may not otherwise encounter in the <a href="https://books.google.com/partner/">publisher</a> and <a href="http://www.google.com/googlebooks/library.html" title="Google Books Library Project">library</a> programs.  Google also receives records in a variety of &#8220;idiosyncratic formats&#8221; &#8212; for example, publisher-contributed metadata (via the Publisher Partner Program); information associating books with jacket images; name authority records (from LC); reviews; popularity signals (sales data as well as <a name="anonymized_circulation_data">anonymized circulation data</a> from some library partners, useful for feeding into the relevancy ranking algorithm); and internally-generated metadata (for instance, whether a book is commercially available or not).  Google processes all of this information to come up with a single record that describes a book.  At this point they have over 800 million bibliographic records and one trillion bits of information in those records.</p><p>All of these records from all of these sources are processed and remixed with Google&#8217;s parsing algorithms about twice a week.  The first step is to transform the incoming records into a &#8220;less verbose format&#8221; for storage and processing.  It is a SQL-like structure that allows elements of the metadata to be queried.  Records are then parsed to extract specific bits of information, transform the bits as necessary, and write the information to an internal &#8220;resolved records&#8221; data structure (a subset of the data coming from the input formats).  In the presentation, Kurt had examples of how making inferences from data coming from both MARC and ONIX can be troublesome.  Parsing also involves extracting &#8220;bibkeys&#8221; from the records to aid in matching across sources of data.  Four types of identifiers are extracted from bibliographic records: OCLC numbers, <acronym title="Library of Congress Control Numbers">LCCN</acronym>s, ISBNs, and ISSNs.  They provide usually useful signals when matching bibliographic and help with assertions that two records describe the same manifestation.  Google also tries to parse item data when present in records representing multi-volume works, enumeration and chronology.  They will also treat barcode as a form of a &#8220;bibkey&#8221; if they get it from a library.  The parsing algorithm will also split records containing multiple ISBNs representing different product forms (e.g. hardback, paperback, etc.).</p><p>With all of this data parsed into records, Google starts its clustering process where records are examined and attached to each other.  Bibkeys provide significant evidence for relating records to each other, but bibkeys are not always present in a record (non-U.S. records and older records frequently contain no bibkeys).  The algorithms then fall back on text similarity matching using title, subtitle, contributor and other fields such as publisher and publication year.  The results are clusters of records representing the same manifestation. An algorithm then attempts to derive the &#8220;best-of&#8221; record for a single cluster from all of the parsed input records.  This is done in a field-by-field voting process based on the trustworthiness of individual fields from record sources.</p><p>Kurt went into some of the challenges facing the team building the clustering and best-of record creation algorithms.  For instance, in dealing with multivolume works they know of 5 numbering schemas with 3 number types in 15 different languages.  Enumeration is now showing in the public display, but the development team is still working with unparsable item data due to inconsistent cataloging practices between institutions&#8230;and sometimes inconsistencies within an institution.  Another problem is non-unique identifiers. In the current data set ISBN 7899964709 is shared by 75 books and ISBN 7533305353 is associated with 1413 books. There are also poor quality or &#8220;junk records&#8221;.  Kurt said his favorite was &#8220;The Mosaic Navigator&#8221; by Sigmund Freud published in 1939.  These are hard to identify with an algorithm, and they rely on reports of problems that enable the developers to go in and &#8220;kill&#8221; the troublesome record.  Another example is a book by Virginia Woolf where the incoming record had conflicting information; it had two 260 fields that contained different dates (1961, correct, and 1900) with fixed field information that strongly suggested that 1900 was the single date of publication.  When the data problem is systematic, they can identify it and compensate for it.  Kurt&#8217;s example for this case was &#8220;The United States Since 1945&#8243; published in 1899.  This one was highlighted in <a href="http://chronicle.com/article/Googles-Book-Search-A/48245/" title="Google's Book Search: A Disaster for Scholars - The Chronicle Review - The Chronicle of Higher Education">Geoffrey Nunberg&#8217;s criticism of Google Books metadata</a>.  In this case, there was a source of metadata from Brazil that when they didn&#8217;t know the date of publication would use 1899.  When Google went back and looked at the date distribution of books there was a huge spike in 1899.  Once Google knew about it they were able to go in and kill that information from that source of records. <sup><a href="http://dltj.org/article/mashups-of-bib-data/#footnote_2_1478" id="identifier_2_1478" class="footnote-link footnote-identifier-link" title="A side note: Google isn&amp;#8217;t the only one tripped up by this.  If one searches for the ISBN of the item, 0195038487, you get to more than one site that has the same incorrect publication date.  At least Google is attempting to clean up the data!">3</a></sup></p><p>In closing, Kurt said that Google is committed to engaging with the library community on improving metadata and metadata processing.</p><p style="padding:0;margin:0;font-style:italic;">The text was modified to update a link from http://www.niso.org/publications/white_papers/Stream lineBookMetadataWorkflowWhitePaper.pdf to http://www.niso.org/publications/white_papers/StreamlineBookMetadataWorkflowWhitePaper.pdf on January 19th, 2011.</p><p style="padding:0;margin:0;font-style:italic;" class="removed_link">The text was modified to remove a link to http://www.oclc.org/speakers/bios/register_renee.htm on February 11th, 2011.</p><h2>Footnotes</h2><ol class="footnotes"><li id="footnote_0_1478" class="footnote">For those not familiar with <a href="http://www.editeur.org/8/ONIX/" title="ONIX Overview">ONIX</a>, it is a suite of standards promulgated by <a href="http://www.editeur.org/" title="EDItEUR homepage" rel="homepage">EDItEUR</a> for the interchange of information on books and serial publications.  It is primarily used as the communication channel between the publishing industry through distribution chains to retail establishments.</li><li id="footnote_1_1478" class="footnote">By the way, it seems like BISAC is an acronym for &#8220;Book Industry Systems Advisory Committee&#8221;, the former name of the <a href="http://www.bisg.org/" title="Book Industry Study Group homepage" rel="homepage">Book Industry Study Group</a>.</li><li id="footnote_2_1478" class="footnote">A side note: Google isn&#8217;t the only one tripped up by this.  If one searches for the ISBN of the item, 0195038487, you get to <a href="http://www.biggerbooks.com/book/9780195038484" title="The United States Since 1945 at BiggerBooks.com -  Leuchtenburg, 9780195038484, History">more</a> <a href="http://www.chegg.com/details/the-united-states-since-1945/0195038487/" title="Chegg.com: The United States Since 1945 by Leuchtenburg">than</a> <a href="http://www.amazon.co.uk/The-United-States-Since-1945/dp/0195038487" title="The United States Since 1945: Amazon.co.uk: Books">one</a> site that has the same incorrect publication date.  At least Google is attempting to clean up the data!</li></ol>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/mashups-of-bib-data/feed/</wfw:commentRss> <slash:comments>23</slash:comments> </item> <item><title>More on What Does It Mean to Be a Member of OCLC</title><link>http://dltj.org/article/oclc-social-contract-2/</link> <comments>http://dltj.org/article/oclc-social-contract-2/#comments</comments> <pubDate>Mon, 18 Jan 2010 15:35:45 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[L/IS Profession]]></category> <category><![CDATA[ALA Midwinter Conference 2010]]></category> <category><![CDATA[OCLC]]></category> <category><![CDATA[WorldCat]]></category><guid isPermaLink="false">http://dltj.org/?p=1467</guid> <description><![CDATA[Jay Jordan&#8217;s remarks during the OCLC Update Breakfast and the discussion at the Developers Network table at that breakfast generated further fuel for my previous philosophical thoughts on &#8220;Who is a member of the OCLC Cooperative?&#8221; In the context of &#8230; <a href="http://dltj.org/article/oclc-social-contract-2/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=1467"></abbr><p>Jay Jordan&#8217;s remarks during the <a href="http://dltj.org/article/alamw10-oclc-update/">OCLC Update Breakfast</a> and the discussion at the <a href="http://www.worldcat.org/devnet/wiki/Main_Page" title="Main Page - WorldCat Developers' Network">Developers Network</a> table at that breakfast generated further fuel for my previous philosophical thoughts on &#8220;<a href="http://dltj.org/article/oclc-social-contract/">Who is a member of the OCLC Cooperative?</a>&#8221;  In the context of things like Developer Network API keys<sup><a href="http://dltj.org/article/oclc-social-contract-2/#footnote_0_1467" id="identifier_0_1467" class="footnote-link footnote-identifier-link" title="&amp;#8220;API&amp;#8221; is an acronym for Application Programming Interface.  In summary, an API is the set of rules by which one program can task another program for data or to perform a service.  An &amp;#8220;API key&amp;#8221; is the mechanism through which a requesting application establishes the right to be able to use data and services of another application. In this context, the holder of an OCLC Developer Network API key can access the wealth of data and services being offered by OCLC.">1</a></sup> this question of who is a member of OCLC the cooperative and who is not meets the on-or-off, ones-and-zeros nature of computers.  One can&#8217;t &#8220;kinda&#8221; have an API Key unless that capability is programmed into the software (or a human chooses to override the established rules for who has a key).</p><p>The discussion around the table after Jay&#8217;s remarks tested some of the &#8220;edge cases&#8221; to the established hard-and fast rules.  OCLC Governance, at least as I understand it, surrounds institutions who are members of the OCLC Cooperative.  By implication, the benefits (and also the responsibilities) of membership in the OCLC Cooperative transfer to staff employed at those member institutions.  People who work for governing (institutional) members of OCLC can get Developer Network API keys to create services for their library.  And except for the fuzzy notion explored previously whether individuals are members of the OCLC cooperative, this all seems pretty clear.</p><p>But what about individuals in the employment of a library that is not a member of OCLC that want to create applications that benefit library users.  Further, following Jay Jordan&#8217;s definition of partners OCLC wants to work with (as expressed during the <a href="http://dltj.org/article/alamw10-record-use-policy/">OCLC Record Use Policy Council Update</a>), that non-institutional-member individual can return value back to the cooperative (by contributing the ideas to the Developer&#8217;s Network Showcase, by contributing working code to a software repository, etc.).  Isn&#8217;t that returning value to the cooperative? Should this person be granted a Developer Network API key even though their institution is not a member?  Can we conceive of a process and guidelines by which this could happen?  (Has it perhaps already happened in the human-created fuzz of OCLC-the-stewards overriding the established rules?)</p><p>I&#8217;ll state here that I would support the creating of a process by which anyone &#8212; from a member library or a consultant doing work on behalf of a member library or a 12-year-old kid) can request a Developer Network API key. What is needed is a clear understanding that those who get such keys gain some kind of adjunct membership in the OCLC-the-cooperative, and as so are expected to return value to the cooperative through their work with the API key.  (There would need to be transparency from OCLC-the-steward on who in this category is getting keys, who is not, and for what reasons &#8212; at least in summary.)  The area of consultants or other agents doing work on behalf of a library is an area that would need further exploration.  I would offer that it is okay for one such agent to use an API key to help a particular client, but I also agree with Jay that it is not okay for that same agent to use that key to derive revenue from helping non-OCLC members with the same key.  The latter fails the renumeration-back-to-the-cooperative  test (either financial or intangible benefit) that Jay remarked on during <a href="http://dltj.org/article/alamw10-record-use-policy/">Saturday&#8217;s Record Use Council session</a> and one that the Record Use Council seems to be struggling with.</p><p>These are tough but interesting issues.  We&#8217;ve left the days where the benefits of membership were tied to the delivery of shelf-ready cards and dedicated terminals for online original and copy cataloging.  OCLC seems to be transforming itself from a library-used service to a world-used service.  I give OCLC-the-stewards credit for listening to OCLC-the-membership about appropriate ways to make use of the shared resource that is WorldCat, and credit to OCLC-the-membership for pushing OCLC-the-stewards into creating ways to expand access to that shared resource.<p style="padding:0;margin:0;font-style:italic;">The text was modified to update a link from http://dltj.org/article/alamw10-oclc-record-use to http://dltj.org/article/alamw10-record-use-policy/ on December 31st, 2010.</p><p style="padding:0;margin:0;font-style:italic;">The text was modified to update a link from http://dltj.org/article/alamw10-oclc-record-use to http://dltj.org/article/alamw10-record-use-policy/ on December 31st, 2010.</p><h2>Footnotes</h2><ol class="footnotes"><li id="footnote_0_1467" class="footnote">&#8220;API&#8221; is an acronym for <a href="http://en.wikipedia.org/wiki/Application_programming_interface" title="Application programming interface - Wikipedia, the free encyclopedia">Application Programming Interface</a>.  In summary, an API is the set of rules by which one program can task another program for data or to perform a service.  An &#8220;API key&#8221; is the mechanism through which a requesting application establishes the right to be able to use data and services of another application. In this context, the holder of an <a>OCLC Developer Network API key</a> can access the wealth of data and services being offered by OCLC.</li></ol>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/oclc-social-contract-2/feed/</wfw:commentRss> <slash:comments>3</slash:comments> </item> <item><title>Interesting Bits from the OCLC Update Breakfast</title><link>http://dltj.org/article/alamw10-oclc-update/</link> <comments>http://dltj.org/article/alamw10-oclc-update/#comments</comments> <pubDate>Mon, 18 Jan 2010 13:10:46 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Meeting]]></category> <category><![CDATA[ALA Midwinter Conference 2010]]></category> <category><![CDATA[HathiTrust]]></category> <category><![CDATA[OCLC]]></category> <category><![CDATA[WorldCat]]></category><guid isPermaLink="false">http://dltj.org/?p=1458</guid> <description><![CDATA[I think it is a statistical anomaly that many of the meetings I attended during ALA Midwinter were somehow related to OCLC. That statistical anomaly has certainly played out in postings here on DLTJ of my impressions of Midwinter meetings. &#8230; <a href="http://dltj.org/article/alamw10-oclc-update/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=1458"></abbr><p>I think it is a statistical anomaly that many of the meetings I attended during ALA Midwinter were somehow related to OCLC.  That statistical anomaly has certainly played out in postings here on <acronym title="Disruptive Library Technology Jester"><i>DLTJ</i></acronym> of my impressions of Midwinter meetings.  Continuing with this thread of OCLC events, I attended the OCLC Update Breakfast Sunday morning for a membership-dues-paid croissant and orange juice, and to listen to Jay Jordon&#8217;s biannual update on the past, present and future of OCLC.  What follows are highlights that I found interesting in the course of his remarks, but certainly not a comprehensive report of what was said.  Video of Jay&#8217;s remarks where recorded and are to be posted at some point on the OCLC website (roughly six to eight weeks from now, if my memory of past events can be any guide).</p><p><h2>WorldCat Growth since 1998</h2><br />When Jay started in 1998 there were 39 million records in WorldCat.  At the start of this year, there were 170 million records representing 1.5 billion holding statements.  When I heard counts of the number of records in WorldCat, I&#8217;ve wondered if they were inclusive of all of the non-monograph activities happening in WorldCat, and as it happens it is not.  The slides showed that there are an additional 325 million electronic database records representing licensed digital content (including 4.5 million records of JSTOR items that were recently added).</p><p><h2>New &#8220;Search Engines&#8221;</h2><br />Jay set the stage for his remarks by talking about what is happening with information searching beyond the library community. &#8220;Google is king,&#8221; he remarks &#8220;but there are new launches&#8221; of systems that produce fewer but more highly relevant results. Microsoft&#8217;s <a href="http://bing.com/" title="Bing Homepage">Bing</a> and <a href="http://wolframalpha.com/" title="Wolfram|Alpha Homepage">Wolfram|Alpha</a> are probably well known, but he also mentioned &#8220;<a href="http://www.hakia.com/" title="hakia">hakia</a>&#8221; &#8212; known for indexing just selected content on the web and presenting search results &#8220;galleries&#8221; in a tabbed form &#8212; and &#8220;<a href="http://www.yebol.com/" title="Yebol.com">yebol</a>&#8221; &#8212; a knowledge-based semantic engine.  He brought it home to the cooperative&#8217;s community, though, with the description of the planning stages of &#8220;Reference Extract&#8221; &#8212; a grant-funded effort of Syracuse Univ, the Univ of Washington, and OCLC to create a search engine based on the citations and recommendations of reference librarians.</p><p><h2>OCLC Services in the Cloud</h2><br />Jay then reflected on how the current exploration of &#8220;cloud computing&#8221; elsewhere has threads &#8212; for our community &#8212; all the way back to Fred Kilgour&#8217;s vision for library services.  Portions of the <a href="http://www.oclc.org/productworks/webscale.htm" title="Web-scale Management Services">WorldCat web-scale management services</a>, where one relocates aspects of the technology supporting back-room library operations into a service provided by OCLC, continued development. A number of institutions &#8212; the CPC Regional Libraries in North Carolina, the Idaho Commission for Libraries, the Orbis Cascade Alliance and Linfield College Libraries, and Pepperdine University &#8212; are now testing the circulation component of this suite of back-room services.  Jay also remarked on the deployment of an <a href="http://www.worldcat.org/blogs/archives/2009/12/worldcat-in-redlaser-iphone-ap.htm" title="WorldCat in RedLaser iPhone App - WorldCat Blog">application</a> for iPhones and Droid smartphones that enables a user to scan the <acronym title="Universal Product Code">UPC</acronym> barcode on the back of any book and be directed to holdings information at a home library or at a library closest to the user&#8217;s location. <a href="http://www.oclc.org/navigator/" title="WorldCat Navigator [OCLC - Resource Sharing and Delivery]">WorldCat Navigator</a> &#8212; OCLC&#8217;s product to enhance <acronym title="Interlibrary Loan">ILL</acronym> with integration into the local circulation system &#8212; is being rolled out through the Texas State Library and Archives Commission to 500 public libraries; members of the Boston Library Consortium are in the process of implementing WorldCat Local and WorldCat Navigator. <a href="http://www.questionpoint.org/" title="Home [QuestionPoint]">QuestionPoint</a>, OCLC&#8217;s remote reference service, can also now be imbedded into Facebook, MySpace and a Text-a-Librarian widget.</p><p>OCLC is also looking to help with collection management as a cloud-available tool.  Working with the New York University Libraries, OCLC is bringing analytics to bear on collection management and space allocation decisions by helping with data about the location of items in the campus library, in the library&#8217;s &#8220;ReCAP&#8221; remote storage, and what is available digitally in HathiTrust.  And speaking of <a href="http://www.hathitrust.org/" title="Welcome to the Shared Digital Future | www.hathitrust.org">HathiTrust</a>, the public interface to 7.5 million volumes digitized largely through the Google Book Search partnership, OCLC is working with project participants to aid the metadata description of items in HathiTrust, to ensure that items in HathiTrust have records in WorldCat, and to add WorldCat Local as an interface to the HathiTrust collection.</p><p><h2>Recent problems for Cataloging Partners</h2><br />I have to give OCLC credit for owing up to issues with the membership.  At most recent OCLC update meetings, it was the uproar about the proposed-the-withdrawn OCLC Record Use Policy.  At this update there was mention of problems in cataloging system disruptions (October) and problems with generating labels (December).  Remediation for these problems has received dedicated effort to resolve.  The systems are fixed and the backlogs that resulted from the problems are now being worked through.</p><div class="zemanta-pixie"><img class="zemanta-pixie-img" alt="" src="http://cdn.dltj.org/wp-content/uploads/2010/01/pixy.gif?x-id=bd0c4cf9-299c-40c9-b423-4102c9387864" /><span class="zem-script more-related pretty-attribution"><script type="text/javascript" src="http://static.zemanta.com/readside/loader.js" defer="defer"></script></span></div>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/alamw10-oclc-update/feed/</wfw:commentRss> <slash:comments>16</slash:comments> </item> <item><title>Notes from the OCLC Record Use Policy Council discussion</title><link>http://dltj.org/article/alamw10-record-use-policy/</link> <comments>http://dltj.org/article/alamw10-record-use-policy/#comments</comments> <pubDate>Mon, 18 Jan 2010 13:04:08 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[policy]]></category> <category><![CDATA[ALA Midwinter Conference 2010]]></category> <category><![CDATA[Dewey Decimal Classification]]></category> <category><![CDATA[licensing]]></category> <category><![CDATA[OCLC]]></category> <category><![CDATA[WorldCat]]></category><guid isPermaLink="false">http://dltj.org/?p=1454</guid> <description><![CDATA[On Saturday morning of ALA Midwinter 2010, Dr. Jennifer Younger moderated a session on the progress of the OCLC Record Use Policy Council. The meeting started with an introduction to the reasons behind the creation of the Record Use Council, &#8230; <a href="http://dltj.org/article/alamw10-record-use-policy/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=1454"></abbr><p>On Saturday morning of ALA Midwinter 2010, Dr. Jennifer Younger moderated a session on the progress of the <a href="http://www.oclc.org/worldcat/catalog/policy/council/default.htm" title="Record Use Policy Council [OCLC - Policy for Use and Transfer of WorldCat Records]">OCLC Record Use Policy Council</a>.  The meeting started with an introduction to the reasons behind the creation of the Record Use Council, the charge of the Council from the board of trustees, and how the framing of the discussion of the policy is guided by the values and history of OCLC the cooperative. There wasn&#8217;t much new here for those that have been following the progress of the policy discussion, so I am skipping over it most of it with the exception of a few notable topics. After that,  I&#8217;m focusing on the lengthy question and answer session that followed Dr. Younger&#8217;s background presentation.</p><p><h2>Highlights of the Background Presentation</h2><br />Dr. Younger said that the review council is on track to get the proposed policy to the <a href="http://www.oclc.org/about/trustees/default.htm" title="Board of Trustees [OCLC - About OCLC]">OCLC Board of Trustees</a> in May in time for it to be reviewed at the Board&#8217;s June meeting.  They haven&#8217;t started putting pen to paper on a draft policy statement, but are close; next week the members of the Council will be in Dublin for a two day meeting, and coming out of that will be a draft of the policy.  From there, the draft policy will be reviewed by the various governance bodies of OCLC &#8212; the regional council, the global council, and the board of trustees &#8212; and there will be an extensive discussion about the draft policy at the global council meeting in April.</p><p>WorldCat itself is now made up of 170 million bibliographic records and 1.5 billion statements of holdings from libraries.  A policy is needed to create a viable business plan for sustaining this resource.</p><p>What the policy will cover:  rights and responsibilities of members that have created WorldCat &#8212; the rights of members to use elements of WorldCat and the shared responsibilities to the members of the cooperative that go along with the rights; identifying acceptable use by third parties; what are OCLC&#8217;s rights to use the records on behalf of the members; and a process for collective participation in reviewing and modifying the policy over time.  It will also have a &#8220;rather robust&#8221; preamble that answers the question of why a policy is needed, what problem is the policy is trying to solve, and what it is about WorldCat that necessitates a policy.</p><p><h2>An Aside:  What&#8217;s In a Name &#8212; OCLC-the-membership and OCLC-the-stewards</h2><br />The discussion of the record use policy is intertwined with the conversations of governance of the cooperative, and I think it is important to be aware that there are many facets to the OCLC name as it is commonly used.  In some cases we use &#8220;OCLC&#8221; to mean the cooperative, or &#8212; more specifically &#8212; the members of the cooperative.  To be more precise, I will usually refer to this group as &#8220;OCLC-the-membership.&#8221;  In other cases, it means the conglomeration of staff, hardware/software, and services centered at buildings in Dublin, OH.  Previously I have called the latter &#8220;OCLC-the-corporate&#8221; but in the course of the record use policy council discussion, Jay Jordon took issue with this phrase and said he preferred &#8220;OCLC-the-steward.&#8221;  Names carry nuances, and I agree with Jay that OCLC-the-steward is a better name to call the entity that is serving OCLC-the-membership.</p><p><h2>The View from the Database Level</h2><br />What the representatives from the review council said their work focused on WorldCat as a database of records that OCLC-the-steward is managing on behalf of OCLC-the-membership.  The council has gotten away from the discussion of individual records in favor of the value of the WorldCat database &#8212; its data, services, and infrastructure &#8212; as a whole.  They recognize that the value and use of WorldCat is not only to know about a book (its metadata) but where it is located (the attached holdings). More specifically, the review council identified three kinds of value from WorldCat:<ol start="1" type="1"><li>As a supply of bibliographic records.</li><li>The ability to represent library holdings &#8212; the collective collection of libraries and the capability to reveal what libraries have in places like Google Book Search.</li><li>Knowledge organization pieces:  taking the shared contribution of libraries and makes something more from it using authority control, terminologies, the <a class="zem_slink" href="http://en.wikipedia.org/wiki/Dewey_Decimal_Classification" title="Dewey Decimal Classification" rel="wikipedia">Dewey Decimal classification</a> system, FRBR work sets, etc.</li></ol><p> It was interesting to note a non-U.S. perspective that the council has heard regarding the value of WorldCat. While most North American libraries strongly value WorldCat as a supply of bibliographic records (copy cataloging), the national libraries outside of North America are joining because adding their records to WorldCat gives greater visibility to their holdings.  So the second and third value propositions above carry more weight than the first, which is arguably the most valuable aspect for North American libraries.</p><p>The challenge the Council said it is facing is to put enough controls in place to protect the value and viability of WorldCat while allowing enough flexibility for members, non-members, and OCLC to experiment and derive new, valuable services. One of the questions the review council is grappling with is how can the Cooperative use &#8220;community norms&#8221; to ensure the responsibilities assigned to the members are followed so we govern ourselves.</p><p>In taking this database-wide view, the council has set aside issues of individual record ownership and copyright of data in records and focused on what is valuable about the collection of records as a whole to the membership. WorldCat as a whole collective is copyrighted.  As explained in the follow-up discussion with members of the council, the intellectual property law surrounding WorldCat records extends across many juristictions, so the council chose to focus at the database level.</p><p><h2>Third Parties</h2><br />The review council heard of the need for clarification on how libraries must be able to extend rights to third party agents acting on behalf of a member library, and acknowledged the need to outline the responsibilities of members as they work with OCLC WorldCat records using non-OCLC-member third parties and agents of member libraries.  There are efforts in the policy council to structure the resulting policy such that OCLC-the-steward would take the responsibility for policing third-party data activities (presuming, of course, the OCLC member notifies OCLC-the-steward that the activity is taking place).  It was stated that there are companies that want to get WorldCat records with OCLC enhancements &#8212; the control number, the fields upgraded by the internal WorldCat auditing software, etc. &#8212; by paying for them once, or not paying for them at all, and resell them to other customers.  These are viewed as attempts to profit off of what the cooperative has built without giving anything back to the cooperative.  The policy is intended to help OCLC-the-steward prevent this from happening, not to limit what member libraries themselves can do.</p><p>In licensing WorldCat data to others, OCLC-the-steward is looking for remuneration of some sort for OCLC-the-cooperative.  If the business use by a non-member third party is one that will harm the value and viability of the WorldCat Network, then the policy council wants to see it governed in some way.  Remuneration can be in monetary form, where that external party pays a fee for the data.  Or it can be in a non-monetary form, such as the <i>quid pro quo</i> with the internet search engines that have WorldCat data and in return drive traffic back to local libraries through linkage on WorldCat.org.  As Jay Jordan put it sucinctly, &#8220;I&#8217;ll do a contract with anyone that returns value to the cooperative.  Oftentimes, that is not cash.&#8221;  It was stated that there have been attempts to download the entire WorldCat database.  In order to be able to legally stop that, there must be a policy in place that prohibits it.</p><p><h2>WorldCat as Linked Data</h2><br />I asked a question about whether anyone was advocating for the benefits to the world in general, and the specific example was putting WorldCat data into the semantic web. Significant portions of WorldCat data is freely available in a human-readable form, but not in a way that makes it easy for a machine to process and make relationships to other data &#8212; a form of data representation commonly called &#8220;linked data.&#8221;  For example, Google as an entity can come and negotiate for the rights and responsibilities to use WorldCat data as part of its services.  There isn&#8217;t a corresponding entity in the semantic web world to come along and negotiate for the dissemination of basic facts about items in WorldCat to the linked data universe.  The council has talked about the distinction between &#8220;public good&#8221; and &#8220;club (member) good.&#8221;  Some of this distinction is intended to be explained in the preamble.  Linked data is a form of innovation that the council doesn&#8217;t want to shut down.  They are trying to find how this get encouraged without shutting it down in the policy.</p><p><h2>Questions</h2><br />In reflecting on these notes and what else happened in the course of the meeting, I came up with other questions that might be valuable for the Record Use Policy Council to think about.</p><ol start="1" type="1"><li>In Jennifer&#8217;s introduction, she talked about not only the value of the bibliographic records but also the value of the holdings.  Has the Council looked a differing policies for bibliographic information versus holdings?</li><li>The discussion of linked data was incomplete due to time constraints.  Has there been a discussion about a differentiation of value for different types or views of data?  Machine access version human-oriented access?  Linked data of some portion of the bibliographic record?  Is the representation of the benefit of the world in general being taken into account in drafting policy guidelines?</li></ol>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/alamw10-record-use-policy/feed/</wfw:commentRss> <slash:comments>2</slash:comments> </item> </channel> </rss>
<!-- Served from: dltj.org @ 2012-05-24 17:40:13 by W3 Total Cache -->
