<?xml version="1.0" encoding="UTF-8"?> <rss version="2.0" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule"><channel><title>Disruptive Library Technology Jester &#187; Linking Technologies</title> <atom:link href="http://dltj.org/category/linking/feed/" rel="self" type="application/rss+xml" /><link>http://dltj.org</link> <description>We&#039;re Disrupted, We&#039;re Librarians, and We&#039;re Not Going to Take It Anymore</description> <lastBuildDate>Mon, 06 Feb 2012 20:04:22 +0000</lastBuildDate> <language>en</language> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <cloud domain='dltj.org' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' /> <creativeCommons:license>http://creativecommons.org/licenses/by-nc-sa/3.0/us/</creativeCommons:license> <item><title>EBSCOhost Connection Records Found In-The-Wild</title><link>http://dltj.org/article/ebscohost-connection/</link> <comments>http://dltj.org/article/ebscohost-connection/#comments</comments> <pubDate>Wed, 16 Sep 2009 02:36:46 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Linking Technologies]]></category> <category><![CDATA[Ebsco]]></category> <category><![CDATA[Google]]></category> <category><![CDATA[screencast]]></category><guid isPermaLink="false">http://dltj.org/?p=1272</guid> <description><![CDATA[EBSCOhost Connect was announced in the spring of 2006 as near as I can recall. (I can&#8217;t find the press release about it on the EBSCO website. As close as I can come to a date is from an announcement &#8230; <a href="http://dltj.org/article/ebscohost-connection/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=1272"></abbr><p>EBSCO<i>host</i> Connect was announced in the spring of 2006 as near as I can recall. (I can&#8217;t find the press release about it on the EBSCO website.  As close as I can come to a date is from <a href="http://old.oslis.org/index.php?page=archive040306" title="OSLIS - Oregon School Library Information System" class="broken_link" rel="nofollow">an announcement at the Oregon School Library Information System</a>.)  After three years, I&#8217;ve finally seen an EBSCO<i>host</i> Connect in Google web search results.  This screencast and accompanying transcript (below) show what I&#8217;ve found.<br /><span id="more-1272"></span><br /><h2>Screencast Video</h2><br /><object type="application/x-shockwave-flash" data="http://dltj.org/wp-content/plugins/pb-embedflash/swf/mediaplayer.swf?width=768&amp;height=596" width="768" height="596" class="embedflash"><param name="movie" value="http://dltj.org/wp-content/plugins/pb-embedflash/swf/mediaplayer.swf?width=768&amp;height=596" /><param name="allowfullscreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="flashvars" value="file=http://media.dltj.org/ebscohost-connection.mp4&amp;searchbar=false" /><small>(Please open the article to see the flash file or player.)</small></object></p><p><h2>Video Transcript with Links</h2><br />This video is a screencast demonstration of EBSCO&#8217;s records in the Google web index in a service known as EBSCO<i>host</i> Connection.  The description of this service from their website says that:<br /><blockquote>EBSCO<i>host</i> Connection is designed to bridge the gap between a search of Google and other search engines and the valuable content of EBSCO<i>host</i> that is available to you, courtesy of your library. EBSCO<i>host</i> Connection is intended to promote and expedite access to this quality content by infusing brief citation-only records from EBSCO<i>host</i> databases within result lists generated by these search engines. As such, you can click on the EBSCO<i>host</i> record in these result lists, and be appropriately directed to the database pages within your library&#8217;s EBSCO<i>host</i> resources.</p></blockquote><p>This sounds like a good idea, but until I saw a <a href="http://www.walkingpaper.org/2140" title="User Hostility: EBSCOhost Connection">post</a> last week from Aaron Schmidt on his Walking Paper blog, I hadn&#8217;t really seen any of these EBSCO<i>host</i> Connect records in Google web search results.  Aaron found that a Google search for &#8220;triumph triple connection&#8221; will show an EBSCO<i>host</i> Connect record.  In my testing, I found it to be in either the sixth or seventh position on the first screen of results.  Your experiences will, of course, vary by your geographic location and past search history, but &#8212; in any case &#8212; I made this screencast to show you what it looks like from both inside and outside an IP address range recognized as subscribing to EBSCO<i>host</i>.</p><p>In the simplest case, your machine is using an IP address configured as part of an EBSCO<i>host</i> subscription.  Starting at Google, we&#8217;ll do a search for &#8220;triumph triple connection&#8221; and scroll down to see the EBSCO<i>host</i> Connect record.  Clicking on it gives this brief citation display with a note at the top to access the complete article on EBSCO<i>host</i> courtesy of my place of work.  Selecting that link indeed takes you to the full record display with all of the EBSCO<i>host</i> functionality.</p><p>It is a little more complicated if your machine is using an IP address that is not recognized by EBSCO<i>host</i> as coming from a subscribing institution.  To demonstrate this, we&#8217;ll start again at Google and search for &#8220;triumph triple connection&#8221;.  This time when we select the search result from Google we get a page with with this heading where the link directly into EBSCO<i>host</i> used to be.  We need to come over to the left sidebar to find our institution &#8212; either by institution name, by ZIP code, or by using a narrowing down browse process.  I&#8217;ll search for &#8220;Ohio&#8221; as part of the institution name.  And indeed my place of work shows up as a choice.</p><p>It is at this point that things start to go wrong.  As Aaron noted in his post, he couldn&#8217;t get past this screen.  I had the benefit of looking at the EBSCO<i>admin</i> configuration for my place of work, and unfortunately that still didn&#8217;t help.  What we get is this rather unhelpful, generic EBSCO<i>host</i> login.  Even though our consortial configuration in EBSCO<i>admin</i> is set to use our EZproxy server to authenticate users, we still get this generic login prompt.  The EBSCO Support knowledge bank entry for <a href="http://support.epnet.com/knowledge_base/detail.php?id=2717" title="EBSCO Support:">How do I set up EBSCOhost Connection?</a> describes an authentication setting: &#8211; &#8220;From the drop-down list, select the method of authentication you want to use: User ID/Password, CPID, or Patron ID.&#8221;  The problem is that I don&#8217;t want to use any of those.  I want to send our users through our proxy server and EBSCO<i>host</i> Connect seems to be ignoring the proxy profile setting, and that option isn&#8217;t even listed in the documentation.</p><p>This might be worth a call to EBSCO to clear up, but frankly if it has taken this long for me to stumble across one of these Google search hits I&#8217;m not sure how often our users actually get this far.  I&#8217;ll probably put that call on the &#8220;if-I-have-nothing-else-better-to-do&#8221; list, and if I ever get to it or this somehow magically gets resolved, I&#8217;ll post an update on this blog post.  Thanks for listening.</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/ebscohost-connection/feed/</wfw:commentRss> <slash:comments>8</slash:comments> <enclosure url="http://media.dltj.org/ebscohost-connection.mp4" length="39495126" type="video/mp4" /> </item> <item><title>Analysis of PubGet &#8212; An Expedited Fulltext Service for Life Science Journal Articles</title><link>http://dltj.org/article/analysis-of-pubget-an-expedited-fulltext-service-for-life-science-journal-articles/</link> <comments>http://dltj.org/article/analysis-of-pubget-an-expedited-fulltext-service-for-life-science-journal-articles/#comments</comments> <pubDate>Wed, 05 Aug 2009 02:23:03 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Linking Technologies]]></category> <category><![CDATA[ejournal]]></category> <category><![CDATA[openurl]]></category> <category><![CDATA[pubmed]]></category><guid isPermaLink="false">http://dltj.org/?p=1008</guid> <description><![CDATA[In June, a new service that speeds access to life sciences literature reached a milestone. Called PubGet, it is a service that reduces the number of clicks to the full text of an article, and the milestone was activating the &#8230; <a href="http://dltj.org/article/analysis-of-pubget-an-expedited-fulltext-service-for-life-science-journal-articles/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=1008"></abbr><p>In June, a new service that speeds access to life sciences literature reached a milestone.  Called <a href="http://pubget.com/search" title="Pubget homepage">PubGet</a>, it is a service that reduces the number of clicks to the full text of an article, and the milestone was <a href="http://pubgetteam.blogspot.com/2009/06/50.html" title="Pubget Team: 50">activating the 50th institution using its service</a>.  Using its own proprietary &#8220;pathing engine&#8221;, it links directly to the full text on the publisher&#8217;s website.  PubGet does this by understanding the link structure for each journal of each publisher and constructing the link to the full-text based on information from the citation.  The PubGet service <a href="http://pubget.com/site/contact/whats_pubget" title="What's Pubget?">focuses</a> on the life sciences journals indexed in <a href="http://www.ncbi.nlm.nih.gov/pubmed/" title="PubMed Home">PubMed</a> &#8212; hence the play on names:  PubMed to PubGet.<br /><h2>How It Works</h2><br /><div id="attachment_1205" class="wp-caption alignright" style="width: 310px;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; float: right;"><a href="http://cdn.dltj.org/wp-content/uploads/2009/08/OLINKS-screen-for-Christensen-article.png"><img style=' display: block; margin-right: auto; margin-left: auto;'  src="http://cdn.dltj.org/wp-content/uploads/2009/08/OLINKS-screen-for-Christensen-article-300x216.png" alt="OLINKS screen for Christensen article" title="OLINKS screen for Christensen article" width="300" height="216" class="aligncenter size-medium wp-image-1205" /></a><p style=' padding: 0 4px 5px; margin: 0;'  class="wp-caption-text">Link Resolver screen for Christensen article</p><p><a href="http://cdn.dltj.org/wp-content/uploads/2009/08/Disruptive-Innovation-for-Social-Change_1249426587943.png"><img style=' display: block; margin-right: auto; margin-left: auto;'  src="http://cdn.dltj.org/wp-content/uploads/2009/08/Disruptive-Innovation-for-Social-Change_1249426587943-300x213.png" alt="EBSCOhost screen for Christensen article" title="EBSCOhost screen for Christensen article" width="300" height="213" class="aligncenter size-medium wp-image-1206" /></a><p style=' padding: 0 4px 5px; margin: 0;'  class="wp-caption-text">EBSCOhost screen for Christensen article</p><p><a href="http://cdn.dltj.org/wp-content/uploads/2009/08/Nature-The-shale-revolution_1249432025376.png"><img style=' display: block; margin-right: auto; margin-left: auto;'  src="http://cdn.dltj.org/wp-content/uploads/2009/08/Nature-The-shale-revolution_1249432025376-300x222.png" alt="Typical View of a PubGet Article Display" title="Typical View of a PubGet Article Display" width="300" height="222" class="aligncenter size-medium wp-image-1207" /></a><p style=' padding: 0 4px 5px; margin: 0;'  class="wp-caption-text">Typical View of a PubGet Article Display</p></div> In a typical interaction, a user would start at a web page with a journal article citation that has a link to the user&#8217;s OpenURL resolver.  Contained in that link is the citation metadata that identifies the specific article.  Clicking on that link takes you to the OpenURL resolver web page for that specific article.  That web page contains links to any online versions of the article, and might also include links to library catalog records for physical copies, and options to search for similar articles.  An example of one of these pages is <a href="http://olinks.ohiolink.edu/olinks.php?id=doi:&amp;genre=article&amp;isbn=&amp;issn=00178012&amp;title=Harvard+Business+Review&amp;volume=84&amp;issue=12&amp;date=20061201&amp;aulast=Christensen%2c+Clayton+M.&amp;atitle=Disruptive+Innovation+for+Social+Change.&amp;spage=94&amp;pages=94-101" title="OLINKs page for Christensen article">this one from my place of work for an article by Clayton Christensen in the Harvard Business Review</a>.  (When you are coming from an OhioLINK member institution, it looks like the screen image to the right.)  Clicking on that link that says &#8220;<a href="http://search.ebscohost.com/login.aspx?direct=true&#038;site=ehost-live&#038;scope=site&#038;db=bth&#038;AN=23081431" title="EBSCOhost page for Christensen article">Full text of this article at EBSCO</a>&#8221; takes you to yet another page &#8212; this time from EBSCOhost &#8212; that has the citation data again and the options for viewing or taking other actions on the article.  Once there it is one more click to the HTML or PDF full text of the article.  From the perspective of the creators of PubGet, that is two clicks and two screens too many.  PubGet&#8217;s pathing engine knows about the structure of links on the publishers website, and so it creates a link directly from the citation in the search results list to the article PDF.</p><p>The pathing engine is one of three components that make up the service.  The other two are a search engine and a personalization feature.  The search engine indexes the citation and abstract fields; it is not nearly as sophisticated as the thesauri-driven search engine native to PubMed, but it does the job for cases when you have a known citation.  The personalization feature allows you to tie your account on PubGet to an institution, and with that knowledge the PubGet service can know exactly what digital rights your institution has for each journal and can create links to the full-text article that go through your institution&#8217;s proxy server.  The account system also enables you to have new articles matching your search criteria sent to you and to mark articles in the search results for later bulk downloading (via a <a href="http://ianconnor.blogspot.com/2009/03/download-all-papers-from-firefox-within.html" title="Download all papers from Firefox within Pubget">Firefox plugin</a>).<sup><a href="http://dltj.org/article/analysis-of-pubget-an-expedited-fulltext-service-for-life-science-journal-articles/#footnote_0_1008" id="identifier_0_1008" class="footnote-link footnote-identifier-link" title="Since the articles are not held within the PubGet service itself, the bulk article downloading function requires a Firefox plugin so that the article requests come from your browser to the publisher&amp;#8217;s site.">1</a></sup></p><p><h2>Thinking About PubGet in a Wider Information Ecosystem</h2><br />One quandary I have with PubGet is that it bypasses <a href="http://en.wikipedia.org/wiki/OpenURL" title="OpenURL - Wikipedia">OpenURL</a> as the open standard for linking to full-text content.  In order to take advantage of PubGet&#8217;s unique characteristic &#8212; the pathing engine to get straight to the article text &#8212; you need to start at the PubGet site itself in order to get the direct URLs to the articles.  This is a pretty significant downside to the service.  You can&#8217;t get the pathing engine along with the powerful PubMed search engine.</p><p>It would be nice if PubGet could be set up as an OpenURL target, and when it receives a request translates it to the direct link to the full-text using its pathing engine.  That way I could set up PubGet as the OpenURL resolver in my PubMed account, and the article links in PubMed would automatically go to the full text.  I don&#8217;t know how this would work as a business model for PubGet, though, because as an OpenURL resolver is this manner, it makes the PubGet website invisible &#8212; through a series of browser redirects I&#8217;d go from PubMed to PubGet to the publisher site.  (If the point is to <a href="http://pubget.com/site/contact/how_we_make_money" title="Pubget: How We Make Money">make money selling advertising for related industries</a>, a configuration that completely by-passes any visible signs of PubGet would cut into that revenue source.)</p><p>As I was sharing background on PubGet with Thomas Dowling, a colleague at OhioLINK, he pointed out something I didn&#8217;t know about OpenURL:  it is within the standard to specify a &#8220;service type&#8221; in the OpenURL Context Object.  Section 5.1 of the <a href="http://www.niso.org/kst/reports/standards?step=2&amp;gid=&amp;project_key=d5320409c5160be4697dc046613f71b9a773cd9e" title="ANSI/NISO Z39.88 - The OpenURL Framework for Context-Sensitive Services">NISO standard for OpenURL</a> says a service type is &#8220;The resource that defines the type of service (pertaining to the Referent) that is requested.&#8221;  And there are indeed <a href="http://alcme.oclc.org/openurl/servlet/OAIHandler?verb=GetRecord&amp;metadataPrefix=oai_dc&amp;identifier=info:ofi/fmt:xml:xsd:sch_svc" title="XML Metadata Format for Scholarly ServiceTypes">service types registered</a> as part of the SAP2 Community Profile: abstract, citation, fulltext, holdings, ill, and any.  So the recipient of an OpenURL request, using the &#8220;fulltext&#8221; Service Type, should be able to replicate the proprietary PubGet pathing engine using a standard OpenURL structure.  In a brief bit of experimentation, though, I was not able to find an OpenURL resolver that a) knew how to handle a Service Type parameter, and/or b) knew how to honor that parameter by getting directly to the full text.  As OpenURL undergoes its 5-year review this year, it might be worthwhile to emphasize this part of the standard with examples and descriptions of best practices so it is more widely adopted.</p><p><h2>Other Articles on PubGet</h2></p><ul type="disc"><li><a href="http://www.bio-itworld.com/news/06/10/09/pubget-full-text-PDF-search.html" title="Got PubMed? Pubget Searches and Delivers Scientific PDFs">Got PubMed? Pubget Searches and Delivers Scientific PDFs</a>, Bio-IT World, June 10, 2009, By Kevin Davies</li><li><a href="http://scharrlibrary.blogspot.com/2009/07/recommended-website-of-month-pubget.html" title="Recommended Website of the month - Pubget">Recommended Website of the month &#8211; Pubget</a>, Information Service based at ScHARR at The University of Sheffield, June 9, 2009</li><li><a href="http://researchblogging.org/news/?p=126" title="Pubget -- useful, growing resource for anyone interested in research">Pubget &#8212; useful, growing resource for anyone interested in research</a>, ResearchBlogging.org news, June 10, 2009, By Dave Munger</li><li><a href="http://www.xconomy.com/boston/2009/06/23/pubget-speeds-up-science-journal-searches-provides-marketing-tools/" title="Pubget Speeds Up Science Journal Searches, Provides Marketing Tools">Pubget Speeds Up Science Journal Searches, Provides Marketing Tools</a>, Xconomy, June 23, 2009, By Ryan McBride</li><li><a href="http://bitesizebio.com/2009/06/25/get-pdfs-asap-with-pubget/" title="Get PDFs ASAP with Pubget">Get PDFs ASAP with Pubget</a>, Bitesize Bio, June 25, 2009, By Carrie Iwema</li></ul><h2>Footnotes</h2><ol class="footnotes"><li id="footnote_0_1008" class="footnote">Since the articles are not held within the PubGet service itself, the bulk article downloading function requires a Firefox plugin so that the article requests come from your browser to the publisher&#8217;s site.</li></ol>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/analysis-of-pubget-an-expedited-fulltext-service-for-life-science-journal-articles/feed/</wfw:commentRss> <slash:comments>18</slash:comments> </item> <item><title>Collocating Serial Formats Via &#8220;Linking ISSN&#8221;</title><link>http://dltj.org/article/issn-l/</link> <comments>http://dltj.org/article/issn-l/#comments</comments> <pubDate>Fri, 06 Jun 2008 16:12:55 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Linking Technologies]]></category> <category><![CDATA[identifier]]></category> <category><![CDATA[ISSN]]></category> <category><![CDATA[metadata]]></category> <category><![CDATA[openurl]]></category><guid isPermaLink="false">https://dltj.org/?p=374</guid> <description><![CDATA[Earlier this week I received an e-mail from the director of the ISSN International Center announcing a session at the ALA Annual Conference in Anaheim to talk about the &#8220;linking ISSN&#8221;. Abbreviated ISSN-L, this is a new addition to the &#8230; <a href="http://dltj.org/article/issn-l/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="https://dltj.org/?p=374"></abbr><p>Earlier this week I received an e-mail from the director of the <acronym title="International Standard Serials Number">ISSN</acronym> International Center announcing a session at the <acronym title="American Library Association">ALA</acronym> Annual Conference in Anaheim to talk about the &#8220;linking ISSN&#8221;.  Abbreviated ISSN-L, this is a new addition to the revised ISSN standard (ISO 3297, published last August) that allows for the collocation of separate ISSNs under a single ISSN-L.  The ISSN standard now explicitly states that an <q>ISSN is a unique identifier for a specific serial in a defined medium.</q> In other words, separate ISSN should be assigned to each different medium version of a serial.  The ISSN-L table brings these separate ISSNs together.</p><p>The FAQ I received that goes along with the e-mail announcement has several interesting statements.  I&#8217;ve extracted the most interesting, at least from my perspective, here:</p><blockquote><p>The purpose of the linking ISSN is to provide a tool for grouping, or collocating, the various medium versions of a resource, for instance, the print and online versions of a journal. &#8230; Those involved in the production, distribution, management of, and access to serial resources have expressed the strong desire that the ISSN system meet two different needs:</p><ul type="disc"><li>The need for the ISSN to identify the various medium versions of a continuing resource, for product management purposes. To meet this need, separate ISSN are assigned to the various medium versions of a resource.</li><li>The need for a collocating, or grouping mechanism that would bring together various medium versions, and thus facilitate content management. The linking ISSN (ISSN-L) has been defined to meet this currently unmet need.</li></ul><p>The linking ISSN is a new function for the ISSN system &#8212; not a new identifier. &#8230; The first ISSN assigned in the ISSN Register to any medium version of a continuing resource is designated by default to function also as the linking ISSN and applies to all other medium versions of that resource identified in the ISSN Register.  The linking ISSN is labelled (for eye-readable purposes) as “linking ISSN” or “ISSN-L”.   A linking ISSN is designated for each continuing resource identified in the ISSN Register, even if the continuing resource is issued in only one medium. Only one linking ISSN is designated regardless of how many different medium versions of a continuing resource exist.</p><p>The ISSN-L should always be treated, for computer processing purposes, as a separate data element. For example, in MARC formats, which are used in the ISSN Register, ISSN-L is input in a separately tagged subfield in each of the ISSN Register metadata records to which it pertains. It is important to note that ISSN-L should be processed and used separately from medium-specific ISSN. In applications based on tables or indexes, ISSN-L should not be included in the same tables or indexes as medium-specific ISSN.</p><p>The designated ISSN-L is made available in several different ways:</p><ul type="disc"><li>via a table which lists the ISSN-L and the corresponding ISSN linked to the ISSN-L. This table is available free of charge on the ISSN IC web site. <i>[See note below.]</i></li><li>via the ISSN Register: (each metadata record in the ISSN Register includes the medium-specific ISSN assigned to the resource described in the record, and the designated linking ISSN, as separate data elements);</li><li>via the ISSN National Centres, which communicates to publishers the ISSN-L designated for newly-assigned ISSN,</li><li>via the resources themselves, provided that publishers print or display this information according to the recommendations in  the standard.</li></ul><p>In order for the ISSN-L to work effectively, publishers will need to clearly indicate when they are using an ISSN-L as opposed to an ISSN.  The ISO standard’s recommendations for printing and displaying ISSN-L are as follows: <q>the linking ISSN shall be clearly distinguished as such by use of the label ISSN-L. In such cases, the label ISSN-L shall be written in uppercase and a space shall precede the 8 digits of the linking ISSN. Example: &#8216;ISSN-L 0251-1479&#8242;.</q></p><p>No programmatic method can be used to determine ISSN-L on the basis of one of the medium-specific ISSN, nor is there a programmatic way to determine the group of medium-specific ISSN associated with one ISSN-L. This is due to fundamental characteristics of the ISSN system: ISSN have no inherent meaning, and they are distributed sequentially. ISSN are assigned by national centres around the world, and a new medium version may appear at any time, perhaps published in a different country; this cannot be predicted and thus there is no programmatic way to associate an ISSN with its corresponding ISSN-L.</p><p>The assignment policy of the Standard now explicitly specifies that <q>ISSN is a unique identifier for a specific serial in a defined medium</q>. Therefore separate ISSN are assigned to each particular medium version of a serial. &#8230; If some publishers use the same ISSN for different medium versions of a serial, they deprive their users of the means to identify the medium- specific versions of that serial for ordering, claiming, etc. However, this should not interfere with the ISSN-L. The single ISSN will become the ISSN-L that can be used for collocating functions.</p><p>ISSN-L is a tool that aims at facilitating collocating functions. To perform these functions adequately, ISSN-L and the related ISSN have to be present in OpenURL knowledge bases. At this time, various scenarios can be envisaged. The desired end result is described as follows: <q>The request must use the data available in the citation. It is the job of the resolver to match the identifier to the appropriate resource. It is the resolver that will make use of ISSN-L to relate the various medium of an ISSN to each other and find one to satisfy the request.  One should be able to put in a request (for example an OpenURL) using any of the ISSN and separately and additionally request that the electronic copy is desired.</q></p></blockquote><p>One note: I can&#8217;t find the ISSN-L table on the <a href="http://www.issn.org/" title="ISSN International Centre homepage">http://www.issn.org/</a> website.  In a follow up discussion with Françoise Pellé (Director of the ISSN International Centre) I learned that it isn&#8217;t there yet, but they intend to put it there.</p><p>This is an interesting, well, kluge.  It provides a neat amount of backwards compatibility &#8212; for publications that only have one medium, that publication&#8217;s ISSN automatically becomes its ISSN-L.  The publishers (presumably) would need to take proactive action to tell the ISSN registrar that two ISSN are two mediums of the same publication; hopefully the word will get out and all affected publishers will do so.</p><p>It doesn&#8217;t effectively replace the <a href="http://xissn.worldcat.org/xissnadmin/index.htm" title="WorldCat Web service: xISSN [OCLC - WorldCat Affiliate tools]: Home">xISSN service from OCLC</a> because the ISSN-L table only gives <em>current</em> links.  The description of xISSN says <q>ISSNs are related in two different ways: different editions of same serial (such as print and online editions) and historical relationships (ISSN changes that result from title changes, mergers, splits, etc.).</q> ISSN-L only handles the former (different editions) relationship type, not the latter (historical) relationship type.  The ISSN-L table is (reportedly) free, however, and the xID service require an OCLC network membership.  (While this posting was in draft form, Tim McCormick of OCLC announced that <q>effective immediately, xID services from OCLC &#8212; that is, xISBN, and the forthcoming xISSN &#8212; will be included at no additional cost with all OCLC cataloguing subscriptions.</q></p><p>I hope to hear more after ALA Annual.  Unfortunately, I think I have a conflict with the proposed meeting on Friday afternoon of ALA, so if anyone else hears anything please blog about it.  (Trackbacks to here would be appreciated.)</p><p><h2>Updates from Information at ALA</h2><br />I heard some more about linking ISSN during the <span class="removed_link" title="http://ala.org/ala/lita/litamembership/litaigs/igstandards/standards.cfm"><acronym title="Library and Information Technology Association">LITA</acronym> Standards interest group</span> meeting on Saturday afternoon.</p><p>MARC codes were approved by MARBI a few months ago.  Current ISSN-L identifiers will be put in MARC21 field 022 subfield &#8216;l&#8217;.  Cancelled ISSN-L identifiers will be put in MARC21 field 022 subfield &#8216;m&#8217;.  (Every record will have a subfield &#8216;l&#8217; or a subfield &#8216;m&#8217;, even if it is in a single format.)</p><p>Retrospective designation uses lowest ISSN from the cluster linked via 776 field.  As on-going ISSN assignments are made, the ISSN-L will be the first assigned out of any media.  (May not always be the lowest in numerical order because of how ISSNs are allocated to ISSN centers.)</p><p>The crossreferencing linking table for ISSN-L has not yet been published.  The international ISSN center is also considering a web service that would send back all of the associated ISSNs for an ISSN-L.</p><p>Launch is at IFLA in August 2008.  Expected implementation before end of 2008.  (There is of course some catch-up time as library automation vendors add this capability to individual systems.)<p style="padding:0;margin:0;font-style:italic;" class="removed_link">The text was modified to remove a link to http://ala.org/ala/lita/litamembership/litaigs/igstandards/standards.cfm on June 9th, 2011.</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/issn-l/feed/</wfw:commentRss> <slash:comments>4</slash:comments> </item> <item><title>OAI-ORE Alpha Specifications Updated</title><link>http://dltj.org/article/oai-ore-03/</link> <comments>http://dltj.org/article/oai-ore-03/#comments</comments> <pubDate>Mon, 14 Apr 2008 14:11:07 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Linking Technologies]]></category> <category><![CDATA[atompub]]></category> <category><![CDATA[Open Archives Initiative]]></category><guid isPermaLink="false">https://dltj.org/?p=350</guid> <description><![CDATA[As a result of discussions coming from the OAI-ORE open meeting in Baltimore in the first week of March, the document editors released a new version of the ORE alpha specifications (labeled &#8220;0.3&#8243;) earlier this month to coincide with the &#8230; <a href="http://dltj.org/article/oai-ore-03/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="https://dltj.org/?p=350"></abbr><p>As a result of discussions coming from the <abbrev title="Open Archives Initiative">OAI</abbrev>-<abbrev title="Object Reuse and Exchange">ORE</abbrev> open meeting in Baltimore in the first week of March, the document editors released <a href="http://www.openarchives.org/ore/0.3/toc" title="ORE Specification and User Guide - Table of Contents">a new version of the ORE alpha specifications</a> (labeled &#8220;0.3&#8243;) earlier this month to coincide with the open meeting at Southampton, UK.  In a message to the technical committee, Herbert Van de Sompel summarized the changes as:</p><ul type="square"><li><a href="http://www.openarchives.org/ore/0.3/datamodel" title="ORE Specification - Abstract Data Model">Data Model</a>: changes in the relationship between Resource Map and Aggregation; introduction of a solution regarding the reference in context; revised approach regarding nesting aggregations.</li><li><a href="http://www.openarchives.org/ore/0.3/atom" title="http://www.openarchives.org/ore/0.3/atom">Atom serialization</a>: Significant revision of the mapping from the ORE Model to Atom.</li><li>Introduction of <a href="http://www.openarchives.org/ore/0.3/primer" title="ORE User Guide - Primer">an ORE Primer</a> (formerly called the data model overview)</li><li>Introduction of a <a href="http://www.openarchives.org/ore/0.3/http" title="ORE User Guide - HTTP Implementation and Multiple Serializations">document describing implementation issues regarding HTTP URIs for Aggregations and Resource Maps</a>.</li><li><a href="http://www.openarchives.org/ore/0.3/rdfsyntax" title="http://www.openarchives.org/ore/0.3/rdfsyntax">RDF serialization document</a>: This document was first made available in alpha 0.2. It deals with RDF/XML but also RDFa that can, for example be used, in splash pages.</li></ul><p>Comments are still welcome, and the <a href="http://groups.google.com/group/oai-ore" title="OAI-ORE discussion list on Google Groups">OAI-ORE discussion list</a> has been quite active the past few weeks.  The document editors are aiming for a beta release of the specification in mid-May.</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/oai-ore-03/feed/</wfw:commentRss> <slash:comments>2</slash:comments> </item> <item><title>OAI-ORE Open Meeting, March 3 2008, Johns Hopkins University</title><link>http://dltj.org/article/ore-open-meeting/</link> <comments>http://dltj.org/article/ore-open-meeting/#comments</comments> <pubDate>Sat, 10 Nov 2007 01:36:32 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Linking Technologies]]></category> <category><![CDATA[conference]]></category> <category><![CDATA[metadata]]></category> <category><![CDATA[Object Reuse and Exchange]]></category> <category><![CDATA[Open Repositories 2008]]></category> <category><![CDATA[standards]]></category> <category><![CDATA[web architecture]]></category><guid isPermaLink="false">http://dltj.org/2007/11/ore-open-meeting/</guid> <description><![CDATA[Here is the press release describing the event:FOR IMMEDIATE RELEASE:Open Archives Initiative Announces Public Meeting on March 3, 2008 to Release Object Reuse and Exchange SpecificationsIthaca, NY and Los Alamos, NM, October 31, 2007 &#8211; On March 3, 2008 the &#8230; <a href="http://dltj.org/article/ore-open-meeting/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/2007/11/ore-open-meeting/"></abbr><p>Here is the press release describing the event:</p><blockquote><p>FOR IMMEDIATE RELEASE:<br /><h2>Open Archives Initiative Announces Public Meeting on March 3, 2008 to Release Object Reuse and Exchange Specifications</h2><br /><i>Ithaca, NY and Los Alamos, NM, October 31, 2007</i> &#8211; On March 3, 2008 the Open Archives Initiative (OAI) will hold a public meeting at Johns Hopkins University in Baltimore, MD to introduce the Object Reuse and Exchange (ORE) specifications. The ORE specifications are developed in response to a significant challenge that has emerged in eScholarship.  In contrast to the paper publications of traditional scholarship, or even their digital counterparts, the artifacts of eScholarship are complex aggregations.  These aggregations consist of multiple resources with varying media types, semantics types, network locations, and intra- and inter-relationships. The future scholarly communication, research, and higher education infrastructure requires standardized approaches to identify, describe, and exchange these new outputs of scholarship.</p><p>The ORE specifications address this challenge with the ORE data model that defines how to associate an identifier, a URI, with aggregations of web resources.  By referring to these identifiers, aggregations can then be linked to, cited, and described with metadata, in the same manner as any web resource.  The ORE data model also makes it possible to describe the structure and semantics of these aggregations. The ORE specifications define how these descriptions can then be packaged in the XML-based Atom syndication format or in RDF/XML, making them available to a variety of applications.</p><p>In addition to their utility in eScholarship, the ORE specifications also apply to our everyday web use where we often encounter aggregations such as multi-page HTML documents, and collections of multi-format images on sites like flickr.  OAI-ORE descriptions of these aggregations can be used to improve search engine behavior, provide input for browser-based navigation tools, and develop automated web services to analyze and preserve this information.</p><p>The March 3 meeting at Hopkins is intended for information managers and strategists, and implementers of networked information systems.  It will be led by the two coordinators of OAI- ORE, Carl Lagoze of Cornell University and Herbert Van de Sompel of Los Alamos National Laboratory.  Attendees will learn about the ORE data model.  They will also learn about the translation of this data model to the XML-based ATOM syndication format.  In addition, they will hear the results of initial experiments with the specifications by OAI-ORE community members. There will be ample time for discussion and questions and to meet other members of the OAI- ORE community.  Detailed information for the meeting is at the registration page at <a href="http://www.regonline.com/oai-ore" title="ORE Open Meeting Registration">http://www.regonline.com/oai-ore</a> (NOTE: attendees must register in advance and attendance is limited).  A subsequent meeting with similar content will be held in the UK in connection with the <a href="http://or08.ecs.soton.ac.uk/" title="Open Repositories 2008">Open Repositories 2008 Conference</a>. An announcement will be made when details are settled.</p><p><b>About the Open Archives Initiative:</b> The Open Archives Initiative (OAI) develops and promotes interoperability standards that aim to facilitate the efficient dissemination, sharing, and reuse of web-based content.  OAI-ORE work is supported by the Andrew W. Mellon Foundation, Microsoft Corporation, and the National Science Foundation (IIS-0430906).  More information is available at <a href="http://www.openarchives.org/" title="Open Archives Initiative">http://www.openarchives.org</a>.</p></blockquote><div class="vevent" id="hcalendar-OAI-ORE-Open-Meeting"> <img src="http://cdn.dltj.org/wp-content/uploads/2007/11/microformat_hcalendar.png" alt="hCalendar Encoded Microformat" width="80" height="15" style="float: left; padding: 0.5em 1.5em 3em 0" /><a class="url" href="http://www.openarchives.org/ore/documents/ore-hopkins-press-release.pdf" title="Press Release for ORE Open Meeting"><abbr class="dtstart" title="20080303">March 3th, 2008</abbr> &mdash; <span class="summary">OAI-ORE Open Meeting</span> at <span class="location">Johns Hopkins University</span></a></p><div class="description">On March 3, 2008 the Open Archives Initiative (OAI) will hold a public meeting at Johns Hopkins University in Baltimore, MD to introduce the Object Reuse and Exchange (ORE) specifications.</div></div>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/ore-open-meeting/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>&#8220;Using Access Data for Paper Recommendations&#8221;</title><link>http://dltj.org/article/using-co-access/</link> <comments>http://dltj.org/article/using-co-access/#comments</comments> <pubDate>Sat, 28 Apr 2007 02:46:28 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Linking Technologies]]></category> <category><![CDATA[arXiv]]></category> <category><![CDATA[digital libraries]]></category> <category><![CDATA[Joint Conference on Digital Libraries 2007]]></category><guid isPermaLink="false">http://dltj.org/2007/04/using-co-access/</guid> <description><![CDATA[Here is a pair of papers that I&#8217;d like a chance to digest at some point. The first is &#8220;Recommending Related Papers Based on Digital Library Access Records&#8221; by Stefan Pohl, Filip Radlinski, and Thorsten Joachims. According to the notes &#8230; <a href="http://dltj.org/article/using-co-access/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/2007/04/using-co-access/"></abbr><p>Here is a pair of papers that I&#8217;d like a chance to digest at some point.  The first is <a href="http://arxiv.org/abs/0704.2902" title="Recommending Related Papers Based on Digital Library Access Records | arXiv">&#8220;Recommending Related Papers Based on Digital Library Access Records&#8221;</a> by <a href="http://arxiv.org/find/cs/1/au:+Pohl_S/0/1/0/all/0/1" title="arXiv.org Search Results for Stefan Pohl">Stefan Pohl</a>, <a href="http://arxiv.org/find/cs/1/au:+Radlinski_F/0/1/0/all/0/1" title="arXiv.org Search Results for Filip Radlinski">Filip Radlinski</a>, and <a href="http://arxiv.org/find/cs/1/au:+Joachims_T/0/1/0/all/0/1" title="arXiv.org Search Results for Thorsten Joachims">Thorsten Joachims</a>.  According to the notes on the paper, it is to appear in proceedings of JCDL&#8217;07.  The abstract:</p><blockquote><p>An important goal for digital libraries is to enable researchers to more easily explore related work. While citation data is often used as an indicator of relatedness, in this paper we demonstrate that digital access records (e.g. http-server logs) can be used as indicators as well. In particular, we show that measures based on co-access provide better coverage than co-citation, that they are available much sooner, and that they are more accurate for recent papers.</p></blockquote><p>This is a two-page summary, with the meatier version being <a href="http://arxiv.org/abs/0704.2963" title="Using Access Data for Paper Recommendations on ArXiv.org | arXiv">&#8220;Using Access Data for Paper Recommendations on ArXiv.org&#8221;</a>, a masters thesis written by Stefan Pohl.  This one is about 70 pages. The abstract is:</p><blockquote><p>This thesis investigates in the use of access log data as a source of information for identifying related scientific papers. This is done for arXiv.org, the authority for publication of e-prints in several fields of physics.</p><p>Compared to citation information, access logs have the advantage of being immediately available, without manual or automatic extraction of the citation graph. Because of that, a main focus is on the question, how far user behavior can serve as a replacement for explicit meta-data, which potentially might be expensive or completely unavailable. Therefore, we compare access, content, and citation-based measures of relatedness on different recommendation tasks. As a final result, an online recommendation system has been built that can help scientists to find further relevant literature, without having to search for them actively.</p></blockquote><p>Stefan&#8217;s work would seem to bring the old adage &#8220;Do as I Do (access), Not as I Say (cite)&#8221; to bear on information retrieval.  More fluid and dynamic than using PageRank &mdash; or web citation &mdash; &#8220;co-access&#8221; would seem to allow the wisdom of the crowds to become more apparent.</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/using-co-access/feed/</wfw:commentRss> <slash:comments>1</slash:comments> </item> <item><title>A Report on Namespaces Used by OAI-PMH Repositories</title><link>http://dltj.org/article/oai-pmh-namespaces/</link> <comments>http://dltj.org/article/oai-pmh-namespaces/#comments</comments> <pubDate>Tue, 20 Mar 2007 21:00:43 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Linking Technologies]]></category> <category><![CDATA[Raw Technology]]></category> <category><![CDATA[digital libraries]]></category> <category><![CDATA[Dublin Core]]></category> <category><![CDATA[libraries]]></category> <category><![CDATA[MARC]]></category> <category><![CDATA[metadata]]></category> <category><![CDATA[oai-pmh]]></category> <category><![CDATA[standards]]></category><guid isPermaLink="false">http://dltj.org/2007/03/oai-pmh-namespaces/</guid> <description><![CDATA[I had a need for a survey of the metadata namespaces used by OAI-PMH repositories, so I wrote up a quick shell script and XSLT style sheet to parse through the list of Registered Data Providers at the OpenArchives.org website. &#8230; <a href="http://dltj.org/article/oai-pmh-namespaces/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/2007/03/oai-pmh-namespaces/"></abbr><p>I had a need for a survey of the metadata namespaces used by <a href="http://www.openarchives.org/pmh/" title="Open Archives Initiative Protocol for Metadata Harvesting homepage">OAI-PMH</a> repositories, so I wrote up a quick shell script and XSLT style sheet to parse through the list of <a href="http://www.openarchives.org/Register/BrowseSites" title="Registered OAI-PMH Data Providers">Registered Data Providers</a> at the OpenArchives.org website.  The <span class="removed_link" title="http://dltj.org/misc/oai-pmh-namespace-report.html">results of this effort</span> are pretty interesting.  Some of them:</p><ul><li>Dublin Core is, as you would expect, the highest-used descriptive metadata standard.  Every service &mdash; or at least those that reported using any namespace at all &mdash; reported Dublin Core as a record harvesting option.  For some, it was the <em>only</em> option (which I find rather sad).  One problem, though, comes in with the variety of namespace URIs declared that all appear to be semantically the same thing: <tt>http://www.openarchives.org/OAI/2.0/oai_dc/</tt>, <tt>http://www.openarchives.org/OAI/2.0/oai_dc</tt> (note the missing trailing slash), <tt>http://purl.org/dc/elements/2.0/</tt> (used exclusively by the <span class="removed_link" title="http://www.umi.com/umi/digitalcommons/">ProQuest Digital Commons product</span>, it would seem), and <tt>http://purl.org/dc/elements/1.1/</tt> (the difference between 2.0 and 1.1 is not clear to me).  In order to be processable, there must be an exact string match of the namespace URI &#8212; so even that missing trailing slash is significant!</li><li>The next most popular namespace URI is <tt>http://info.internet.isi.edu:80/in-notes/rfc/files/rfc1807.txt</tt>, which semantically would seem to identify the <a href="http://www.faqs.org/rfcs/rfc1807.html" title="RFC 1807 (rfc1807) - A Format for Bibliographic Records">IETF RFC 1807 on a Format for Bibliographic Records</a>.  You can <a href="http://etd.library.pitt.edu/ETD-db/NDLTD-OAI2/oai.pl?verb=GetRecord&#038;metadataPrefix=oai_rfc1807&#038;identifier=oai%3APITETD%3Aetd-11272006-155805" title="">see what one of these things looks like</a> &#8212; although RFC1807 predates XML (it was approved by the IETF in mid-1995), it looks like someone turned the metadata format into XML along the way.  Very interesting&#8230;</li><li>The next most popular is <tt>http://www.ndltd.org/standards/metadata/etdms/1.0/</tt> &mdash; corresponding to the <a href="http://www.ndltd.org/standards/metadata/etd-ms-v1.00-rev2.html/" title="Networked Digital Library of Theses and Dissertations Metadata Standard">Interoperability Metadata Standard for Electronic Theses and Dissertations</a> &mdash; followed closely by <tt>http://www.openarchives.org/OAI/1.1/oai_marc</tt> &mdash; which <a href="http://www.openarchives.org/OAI/2.0/guidelines-marcxml.htm" title="OAI-PMH Implementation Guidelines: A recommended XML Schema to represent MARC21 records">fell out of favor years ago</a> with the publication of <a href="http://www.loc.gov/standards/marcxml/" title="MARC 21 XML Schema">MARC21</a> by the Library of Congress (which goes by the namespace <tt>http://www.loc.gov/MARC21/slim</tt>).  Unfortunately, it doesn&#8217;t seem to have been picked up by the majority of OAI-PMH data providers that used the older oai_marc schema.</li><li>As you get towards the bottom of the first list, there are all sorts of interesting variants on qualified Dublin Core and other one-off schemas.</li></ul><p>Your thoughts and observations?  I&#8217;ve filed away the UNIX script and XSLT style sheet.  If there is interest in seeing something like this in the future, let me know and I can dig them out.<p style="padding:0;margin:0;font-style:italic;" class="removed_link">The text was modified to remove a link to http://dltj.org/misc/oai-pmh-namespace-report.html on December 30th, 2010.</p><p style="padding:0;margin:0;font-style:italic;" class="removed_link">The text was modified to remove a link to http://www.umi.com/umi/digitalcommons/ on January 19th, 2011.</p><p style="padding:0;margin:0;font-style:italic;">The text was modified to update a link from http://www.ndltd.org/standards/metadata/current.html to http://www.ndltd.org/standards/metadata/etd-ms-v1.00-rev2.html/ on January 19th, 2011.</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/oai-pmh-namespaces/feed/</wfw:commentRss> <slash:comments>2</slash:comments> </item> <item><title>The Intersection of the Web Architecture with Scholarly Communication</title><link>http://dltj.org/article/ore-model-services/</link> <comments>http://dltj.org/article/ore-model-services/#comments</comments> <pubDate>Fri, 16 Feb 2007 20:01:32 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Linking Technologies]]></category> <category><![CDATA[Raw Technology]]></category> <category><![CDATA[digital libraries]]></category> <category><![CDATA[library 2.0]]></category> <category><![CDATA[Object Reuse and Exchange]]></category> <category><![CDATA[web architecture]]></category><guid isPermaLink="false">http://dltj.org/2007/02/ore-model-services/</guid> <description><![CDATA[Two previous posts on dltj.org have described the OAI Object Reuse and Exchange (ORE) project and the theory behind what has become known as the &#8216;Web Architecture&#8217;. These two areas meet up now in this post which describes the issues &#8230; <a href="http://dltj.org/article/ore-model-services/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/2007/02/ore-model-services/"></abbr><p>Two previous posts on <i>dltj.org</i> have described the <a href="http://dltj.org/2007/02/ore-introduction/">OAI Object Reuse and Exchange (ORE) project</a> and the <a href="http://dltj.org/2007/02/web-architecture/">theory behind what has become known as the &#8216;Web Architecture&#8217;</a>.  These two areas meet up now in this post which describes the issues surrounding the raw Web Architecture as applied to a web of scholarly communication and a basic outline of what the ORE project hopes to accomplish.</p><p><h2>Problems With the Web Architecture</h2></p><p>The concepts behind the Web Architecture are clearly successful.  I believe it is safe to assert that the genius behind the creation of Tim Berners-Lee and his colleagues is the simplicity with which the vast web of world wide connections has sprung into existence with relatively little coordination.  That said, some of the fundamental concepts behind the Web Architecture do not fit well with the web of interactions known as &#8220;scholarly communication.&#8221;</p><p>The first issue is aggregation.  The Web Architecture does not provide a way to describe a finite set of Resources and relationships as a citable complex digital object resource structure.  As scholarly communication becomes more than just papers &mdash; it can also now include data sets, supplementary graphics, primary source material as well as references to previously publish objects &mdash; this concept of aggregation becomes important.</p><p>Second, the relationships between Resources are usually untyped and link type ontologies are not well defined.  (Capital-R &#8220;Resources&#8221; carries the meaning of this term as defined by the standards related to the Web Architecture; see the previous posting for a definition and examples.) Is this link within the text of a document a citation?  A data set?  An explanatory graphic?  In general, it is not good practice to try to guess the relationship based on the contents of the URI itself.  In fact, the Web Architecture technical report suggests &#8220;agents making use of URIs SHOULD NOT attempt to infer properties of the referenced resource.&#8221; <sup><a href="http://dltj.org/article/ore-model-services/#footnote_0_184" id="identifier_0_184" class="footnote-link footnote-identifier-link" title="&amp;#8220;Architecture of the World Wide Web, Volume One&amp;#8221; paragraph #98.  Available from http://www.w3.org/TR/2004/REC-webarch-20041215/#p98 Accessed Feb 16 2007.">1</a></sup> In the absence of additional standards (such as OAI-ORE) layered on top of the core Web Architecture, this notion of &#8220;URI Opacity&#8221; encourages independence between an identifier in one document and the Representation of another object.  One thing the ORE work seeks to accomplish is to build a framework for the semantics of links between objects in a scholarly communication environment.</p><p><h2>The Problems From a Scholarly Communication Perspective</h2></p><div style="float: right; margin: 0 0 1em 1.5em; padding 0 0 1em 1.5em; border: 2px solid grey;"><a id="p185" rel="attachment" class="imagelink" href="http://dltj.org/2007/02/ore-model-services/compound-digital-object-modeled-using-the-web-architecture/" title="Compound digital object modeled using the Web Architecture"><img id="image185" style="width: 200px;" src="http://cdn.dltj.org/wp-content/uploads/2007/02/pre-ore-model.gif" alt="Compound digital object modeled using the Web Architecture" /></a></div><p>Take, for example, a paper in an example repository as described by this graphic.  The article, identified by the number &#8220;012345&#8243; has six Resources with five Representations:  an HTML splash page generated by the repository software (Resource #1), the article in PDF format (Resource #2), the article in Postscript format (Resource #4), metadata in Dublin Core XML (Resource #5), metadata in BibTex format (Resource #6), and the article in a format decided by agent/server content negotiation (#3). <sup><a href="http://dltj.org/article/ore-model-services/#footnote_1_184" id="identifier_1_184" class="footnote-link footnote-identifier-link" title="The Web Architecture allows for the Representation of a Resource to be decided through content negotiation between the agent/browser and the server.">2</a></sup>.  Keep in mind that views of digital object must be bound to Resources in order to be reference-able (e.g. they must have URIs).</p><p>Although it may be possible to infer that all six Resources are related by comparing the leading fragment of the URIs, the Web Architecture principle of URI opacity dictates that we shouldn&#8217;t make those assumptions.  Furthermore, even if we could determine that they are related based on examining the URIs, we do not have a consistent vocabulary to <em>define</em> that relationship.  Is &#8220;&#8230;meta/bibtex&#8221; the citation data for <em>this</em> article or is it the <em>list</em> of citations used in the article?</p><p><h2>Modeling Complex Objects</h2></p><div style="float: right; margin: 0 0 1em 1.5em; padding 0 0 1em 1.5em; border: 2px solid grey;"><a id="p186" rel="attachment" class="imagelink" href="http://dltj.org/2007/02/ore-model-services/compound-digital-object-modeled-using-ore-concepts/" title="Compound digital object modeled using ORE concepts"><img id="image186" style="width: 200px;" src="http://cdn.dltj.org/wp-content/uploads/2007/02/ore-model.gif" alt="Compound digital object modeled using ORE concepts" /></a></div><p>Because the Web Architecture does not allow for the definition of a boundary for a compound digital object, the ORE project proposes the definition of a Resource &mdash; called the ORE Model, for lack of a better name at the moment &mdash; that formally expresses a bounded aggregation of resources and relationships that corresponds to a compound digital object.  Put another way, an instantiation of the ORE Model is a map of other resources that expresses the boundaries of the compound digital object.  A URI identifies the compound digital object &mdash; the ORE Resource &mdash; and a service request on that URI returns a Representation that is some serialization of the ORE Model.</p><p>The preliminary version of the model describes two types of relationships:  intra-aggregation relationships (inside the boundaries of the compound digital object) and inter-aggregation relationships (to Resources outside the boundary of this compound digital object).  The intra-aggregation relationships come in two forms:  hasPart (where one Resource contains other Resources, such as books contain chapters or journal issues contain articles) and hasView (where the target Resource is a semantically equivalent presentation format, such as Word and PDF versions of an article).  The inter-aggregation relationship has only one verb, &#8220;hasRelationshipTo,&#8221; which simply means the target of the relationship is considered outside the boundaries of the complex digital object.  From a base verb of &#8220;hasRelationshipTo&#8221; other communities can apply specialized relationships.</p><p>The result describes a connected sub-graph with a finite set of resources and relationships among those resources to form a compound digital object plus relationships to resources that are external to the aggregation.  With that in place, we can consider services that can be applied to portions of the graph.</p><p><h2>ORE Services</h2></p><p>One half of the work of the ORE project is to define a model for compound digital objects in a Web Architecture environment.  The other half of the work is to define the meaning of services that exchange instances of the model to form the basis of a Web Architecture-aware scholarly communication environment.</p><p>Conceived based on the experiences with the OAI Protocol for Metadata Harvesting (PMH), there are three archetypes of services.</p><ul><li>Harvest: a request for a batch of instances that correspond to the ORE model from a set of ORE Resources.</li><li>Obtain: A request for an instance that corresponds to the ORE Model from a specific ORE Resource.</li><li>Register: A request to add new nodes or relationships to an ORE aggregation.</li></ul><p>Service requests against the ORE Resource URI are the access points for these activities.</p><p><h2>For more information&#8230;</h2></p><p>This is a basic introduction to the work of the technical committee so far.  For a more in-depth view into the outcomes of the first face-to-face meeting, including expanded definitions and examples of what was outlined here, see the <a href="http://www.openarchives.org/ore/documents/OAI-ORE-TC-Meeting-200701.pdf" title="">Report of the January 2007 ORE-TC Meeting</a>.  In addition, there is a <a href="http://connect.educause.edu/blog/mpasiewicz/an_interview_with_herbert_van_de_sompel/15519?time=1171490236" title="An Interview with Herbert van de Sompel | EDUCAUSE CONNECT" class="broken_link" rel="nofollow">interview with Herbert van de Sompel</a> recorded at the CNI 2006 Fall Task Force to go along with <a href="http://www.cni.org/tfms/2006b.fall/abstracts/PB-oai-sompel.html" title="Project Briefing-Fall 2006 Task Force Meeting">a project briefing</a> presented at that meeting.  (Keep in mind that these were recorded and presented before the first technical committee meeting, so some of the concepts of the implementation have changed.)  Pete Johnson, a member of the ORE technical committee, posted his thoughts on the topic on his blog: <a href="http://efoundations.typepad.com/efoundations/2007/01/ore.html" title="eFoundations: Prospecting for ORE">Prospecting for ORE</a>, <a href="http://efoundations.typepad.com/efoundations/2007/01/more_ore.html" title="eFoundations: More ORE">More ORE</a>, and <a href="http://efoundations.typepad.com/efoundations/2007/02/more_rumination.html" title="eFoundations: More ruminations on compoundness and complexity (and metadata)">More ruminations on compoundness and complexity (and metadata)</a>.  The <a href="http://www.openarchives.org/ore/documents/OR07.pdf" title="http://www.openarchives.org/ore/documents/OR07.pdf">presentation slides</a> from Carl Lagoze&#8217;s talk at Open Repositories 2007 are also available, of which <a href="http://cwilper.blogspot.com/2007/01/resources-representations-repositories.html" title="Your Metadata Sucks: Resources, Representations, Repositories, and RDF">Chris Wilper</a> and <a href="http://wwmm.ch.cam.ac.uk/blogs/downing/?p=69" title="Unilever Centre for Molecular Informatics, Cambridge - Jim Downing  &amp;raquo; Blog Archive   &amp;raquo; Open Repositories 2007 Plenary Session 5: Interoperability">Jim Downing</a> posed summaries and reactions.  Also keep an eye on the <span class="removed_link" title="http://technorati.com/tag/OAI-ORE">OAI-ORE tag on Technorati</span> for more updates and reactions.<p style="padding:0;margin:0;font-style:italic;" class="removed_link">The text was modified to remove a link to http://technorati.com/tag/OAI-ORE on January 19th, 2011.</p><h2>Footnotes</h2><ol class="footnotes"><li id="footnote_0_184" class="footnote">&#8220;Architecture of the World Wide Web, Volume One&#8221; paragraph #98.  Available from <a href="http://www.w3.org/TR/2004/REC-webarch-20041215/#p98" title="Architecture of the World Wide Web, Volume One">http://www.w3.org/TR/2004/REC-webarch-20041215/#p98</a> Accessed Feb 16 2007.</li><li id="footnote_1_184" class="footnote">The Web Architecture allows for the Representation of a Resource to be decided through content negotiation between the agent/browser and the server.</li></ol><div class='series_links'><a href='http://dltj.org/article/web-architecture/' title='Working With the Web Architecture'>Previous in series</a> <a href='http://dltj.org/article/thoughts-on-compound-documents/' title='OAI-ORE Thoughts on Compound Documents'>Next in series</a></div>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/ore-model-services/feed/</wfw:commentRss> <slash:comments>2</slash:comments> </item> <item><title>Working With the Web Architecture</title><link>http://dltj.org/article/web-architecture/</link> <comments>http://dltj.org/article/web-architecture/#comments</comments> <pubDate>Fri, 16 Feb 2007 15:29:09 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Linking Technologies]]></category> <category><![CDATA[Raw Technology]]></category> <category><![CDATA[digital libraries]]></category> <category><![CDATA[Object Reuse and Exchange]]></category> <category><![CDATA[web architecture]]></category> <category><![CDATA[web standards]]></category><guid isPermaLink="false">http://dltj.org/2007/02/web-architecture/</guid> <description><![CDATA[As you may have noticed, the web has evolved a set of common principles that are a mix of ratified standards and ad hoc practices. The notion of a Web Architecture was codified in a W3C technical report called &#8220;Architecture &#8230; <a href="http://dltj.org/article/web-architecture/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/2007/02/web-architecture/"></abbr><p>As you may have noticed, the web has evolved a set of common principles that are a mix of ratified standards and ad hoc practices.  The notion of a Web Architecture was codified in a W3C technical report called &#8220;Architecture of the World Wide Web&#8221; <a href="http://www.w3.org/TR/2004/REC-webarch-20041215/" title="Architecture of the World Wide Web, Volume One">http://www.w3.org/TR/2004/REC-webarch-20041215/</a> or simply &#8216;Web Architecture.&#8217;  Those projects and protocols that align with the &#8216;Web Architecture&#8217; are more likely to be picked up and used than those that do not.  As a result, the <a href="http://www.openarchives.org/ore/" title="Open Archives Initiative Protocol - Object Exchange and Reuse">OAI Object Reuse and Exchange</a> (ORE) project seeks to provide an infrastructure for web-based information systems that exploit and enhance the Web Architecture, and therefore overlay cleanly on the existing web.</p><p>Given that we want to align closely with this &#8216;Web Architecture&#8217; how far does the Web Architecture report go to define what is needed to make an ORE environment happen?  The answer lies in the definition of three terms and the interaction of these three concepts.</p><ul><li>Resource: &#8220;A network data object or service that can be identified by a URI&#8230;. Resources may be available in multiple representations (e.g. multiple languages, data formats, size, and resolutions) or vary in other ways.&#8221; <sup><a href="http://dltj.org/article/web-architecture/#footnote_0_182" id="identifier_0_182" class="footnote-link footnote-identifier-link" title="&amp;#8220;Hypertext Transfer Protocol &amp;#8212; HTTP/1.1&amp;#8243; RFC 2616. Available from http://www.ietf.org/rfc/rfc2616 Accessed Feb 15 2007.">1</a></sup></li><li>Uniform Resource Identifier (URI):  &#8220;A compact string of characters for identifying an abstract or physical resource.&#8221; <sup><a href="http://dltj.org/article/web-architecture/#footnote_1_182" id="identifier_1_182" class="footnote-link footnote-identifier-link" title="&amp;#8220;Uniform Resource Identifiers (URI): Generic Syntax&amp;#8221; RFC 2396. Available from http://www.rfc-editor.org/rfc/rfc2396.txt Accessed Feb 15 2007.">2</a></sup></li><li>Representation: &#8220;An entity included with a response that is subject to content negotiation&#8230;. There may exist multiple representations associated with a particular response status.&#8221; <sup><a href="http://dltj.org/article/web-architecture/#footnote_0_182" id="identifier_2_182" class="footnote-link footnote-identifier-link" title="&amp;#8220;Hypertext Transfer Protocol &amp;#8212; HTTP/1.1&amp;#8243; RFC 2616. Available from http://www.ietf.org/rfc/rfc2616 Accessed Feb 15 2007.">1</a></sup></li></ul><div style="float: right; margin: 0 0 1em 1.5em; padding 0 0 1em 1.5em; border: 2px solid grey;"><a id="p183" rel="attachment" class="imagelink" href="http://dltj.org/2007/02/web-architecture/illustration-shows-the-relationship-between-identifier-resource-and-representation/" title="Illustration shows the relationship between identifier, resource, and representation."><img id="image183" style="width: 200px;" src="http://cdn.dltj.org/wp-content/uploads/2007/02/uri-res-rep.png" alt="Illustration shows the relationship between identifier, resource, and representation." /></a></div><p>This is perhaps best explained by this graphic from the &#8220;Architecture of the World Wide Web&#8221; document.  All three terms are included:  a URI identifies a resource which is in turn expressed as one representation.  The key part of how the web works, though, lies in the definition of &#8220;representation&#8221; &mdash; <em>that there may exist multiple representations</em> for a single URI.  Believe it or not, you already know this.  The representation of the resource identified by the URI &#8216;<tt>cnn.com</tt>&#8216; at noon today is different from the one that existed at noon yesterday.  You might say, &#8220;well so what&#8230;it is a dynamic website,&#8221; and I would agree &mdash; what is important here is that the web architecture does not give you a way to identify with a URI that representation of the <tt>cnn.com</tt> resource at noon yesterday.  Put another way, in the words of the Web Architecture technical report, &#8220;Agents [web browsers and the like] may use a URI to access the referenced resource; this is called dereferencing the URI.&#8221; <sup><a href="http://dltj.org/article/web-architecture/#footnote_2_182" id="identifier_3_182" class="footnote-link footnote-identifier-link" title="&amp;#8220;Architecture of the World Wide Web, Volume One&amp;#8221; paragraph #117.  Available from http://www.w3.org/TR/2004/REC-webarch-20041215/#p117 Accessed Feb 15 2007.">3</a></sup> The representation comes into being as a result of a service request by an agent for a resource via a URI.</p><p>The Web Architecture technical report lists four factors that determine which representation(s) are retrieved as a result of a service request: <sup><a href="http://dltj.org/article/web-architecture/#footnote_3_182" id="identifier_4_182" class="footnote-link footnote-identifier-link" title="&amp;#8220;Architecture of the World Wide Web, Volume One&amp;#8221; paragraph #122.  http://www.w3.org/TR/2004/REC-webarch-20041215/#p117 Accessed Feb 15 2007.">4</a></sup></p><ol><li>Whether the URI owner makes available any representations at all;</li><li>Whether the agent making the request has access privileges for those representations&#8230;;</li><li>If the URI owner has provided more than one representation (in different formats such as HTML, PNG, or RDF; in different languages such as English and Spanish; or transformed dynamically according to the hardware or software capabilities of the recipient), the resulting representation may depend on negotiation between the user agent and server.</li><li>The time of the request; the world changes over time, so representations of resources are also likely to change over time.</li></ol><p>When a URI is accessed by a browser, one goes through a content negotiation to get a representation.  Representations may vary by device or time or IP address or authorization or any number of factors.  In a graph or type-based thinking, a resource is a first class object:  it is linkable &mdash; one can cite a resource. Representations, on the other hand, are second class objects:  identified only in the context of a resource. A representation is not linkable, there may be many representations per resource, and a representation only comes about as a result of an action.</p><p><h2>Observations</h2><br />This notion of the &#8216;Web Architecture&#8217; is clearly dominant now, so what does the Web Architecture &mdash; resources, URIs, and representations &mdash; mean in the context of the OAI Object Reuse and Exchange work?  One would be well advised to use its existing capabilities where they are appropriate and build specialized extensions that sit on top in such a way as to not contradict its fundamental aspects.  This means cleanly layering new capabilities that meet the needs of our problem space.  In a subsequent posting, I&#8217;ll outline the need for some <a href="http://dltj.org/2007/02/ore-model-services">ORE-specific extensions to the Web Architecture</a>.</p><h2>Footnotes</h2><ol class="footnotes"><li id="footnote_0_182" class="footnote">&#8220;Hypertext Transfer Protocol &#8212; HTTP/1.1&#8243; RFC 2616. Available from <a href="http://www.ietf.org/rfc/rfc2616" title="RFC 2616 &#039;Hypertext Transfer Protocol -- HTTP/1.1&#039;">http://www.ietf.org/rfc/rfc2616</a> Accessed Feb 15 2007.</li><li id="footnote_1_182" class="footnote">&#8220;Uniform Resource Identifiers (URI): Generic Syntax&#8221; RFC 2396. Available from <a href="http://cdn.dltj.org/wp-content/uploads/2007/02/rfc2396.txt.gzip" title="RFC 2396 &#039;Uniform Resource Identifiers (URI): Generic Syntax&#039;">http://www.rfc-editor.org/rfc/rfc2396.txt</a> Accessed Feb 15 2007.</li><li id="footnote_2_182" class="footnote">&#8220;Architecture of the World Wide Web, Volume One&#8221; paragraph #117.  Available from <a href="http://www.w3.org/TR/2004/REC-webarch-20041215/#p117" title="Architecture of the World Wide Web, Volume One">http://www.w3.org/TR/2004/REC-webarch-20041215/#p117</a> Accessed Feb 15 2007.</li><li id="footnote_3_182" class="footnote">&#8220;Architecture of the World Wide Web, Volume One&#8221; paragraph #122. <a href="http://www.w3.org/TR/2004/REC-webarch-20041215/#p117" title="Architecture of the World Wide Web, Volume One">http://www.w3.org/TR/2004/REC-webarch-20041215/#p117</a> Accessed Feb 15 2007.</li></ol><div class='series_links'><a href='http://dltj.org/article/ore-introduction/' title='Introducing the OAI Object Reuse and Exchange Initiative'>Previous in series</a> <a href='http://dltj.org/article/ore-model-services/' title='The Intersection of the Web Architecture with Scholarly Communication'>Next in series</a></div>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/web-architecture/feed/</wfw:commentRss> <slash:comments>5</slash:comments> </item> <item><title>Introducing the OAI Object Reuse and Exchange Initiative</title><link>http://dltj.org/article/ore-introduction/</link> <comments>http://dltj.org/article/ore-introduction/#comments</comments> <pubDate>Tue, 13 Feb 2007 03:49:20 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Linking Technologies]]></category> <category><![CDATA[digital libraries]]></category> <category><![CDATA[Object Reuse and Exchange]]></category> <category><![CDATA[standards]]></category><guid isPermaLink="false">http://dltj.org/2007/02/ore-introduction/</guid> <description><![CDATA[In the past few months a new group has formed to tackle the problem of representing and exchanging complex digital objects in a web-based environment. I am proud to serve on the technical committee for this group and over the &#8230; <a href="http://dltj.org/article/ore-introduction/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/2007/02/ore-introduction/"></abbr><p>In the past few months a new group has formed to tackle the problem of representing and exchanging complex digital objects in a web-based environment.  I am proud to serve on the technical committee for this group and over the next few postings I&#8217;m aiming to introduce the library community to the work of the Open Archives Initiative Object Exchange and Reuse group and seek the feedback of the wisdom of this crowd.</p><p><h2>Vision and Scope</h2></p><p><a href="http://www.openarchives.org/ore/" title="Open Archives Initiative - Object Exchange and Reuse home page">OAI Object Reuse and Exchange</a> (ORE) is a new effort conducted under the umbrella of the Open Archives Initiative.  The summary vision statement is to develop, identify, and profile extensible standards and protocols to allow repositories, agents, and services to interoperate in the context of use and reuse of compound digital objects beyond the boundaries of the holding repositories.  A key aspect of this statement is that it refers to working with objects, not about metadata only.  In that way, the ORE work is set apart from the previous OAI work, the <a href="http://www.openarchives.org/pmh/" title="Open Archives Initiative Protocol for Metadata Harvesting home page">Protocol for Metadata Harvesting</a> (PMH).</p><p>The aim of the ORE effort is to promote (through creation or endorsement) effective and consistent mechanisms:</p><ul><li>to facilitate discovery of compound digital objects;</li><li>to reference (or &#8216;link to&#8217;) these objects (as well as parts thereof);</li><li>to obtain a variety of disseminations of these objects;</li><li>to aggregate and disaggregate objects; and</li><li>to enable processing of objects by automated agents.</li></ul><p>Although these mechanisms may apply to more general web activities, the use cases we are working from are firmly bounded to the needs of the academic community.  Generally speaking, those use cases seek to establish the basis for a digital scholarly communication system composed of two types of systems:  a) applications that manage content (such as institutional repositories); and b) applications that leverage managed content (such as search engines, personal productivity tools, and data and text analysis services).  Of course, other application domains are possible, but like the initial OAI-PMH, the intent is to start with a domain with which we are familiar with an eye towards more general applications as appropriate.</p><p><h2>Compound Digital Objects</h2></p><p>Key to understanding the ORE vision and scope is a definition of the phrase &#8220;compound digital object.&#8221;  In the case, a compound digital object is content with multiple components that vary on:</p><ul><li>Content, or semantic, types (including: text, datasets, simulations, software, dynamic knowledge representations, machine readable chemical structures, bibliographic and other types of metadata);</li><li>Media types (including IANA registered MIME types and other type/format registries such as GDFR);</li><li>Network locations (including content from institutional repositories, scientific data repositories, social networking sites and the general web); and</li><li>Relationships (where the digital object is part of a complex graph of objects related by lineage, versions, and derivations).</li></ul><p>In the abstract, this definition is understandably hard to grasp.  There are some conceptual examples the technical committee uses to keep focus on the task at hand.  One example is a paper in the arXiv repository with different disseminations.  Although the primary artifact in this example might be simply the paper itself, in even this case there is a compound digital object surrounding that artifact with components that represent the paper in PDF and Postscript formats, Dublin Core descriptive metadata, and an HTML &#8220;splash page&#8221; for the paper.  Another example is that of an issue of an overlay journal built from distributed ePrints from different repositories.  The nature of &#8220;the paper&#8221; itself is changing: in e-science the text of the paper is combined with data sets and simulations; in e-humanities, the text of the paper is combined with primary content (such as scanned items) and the scholar&#8217;s derived content.</p><p><h2>Description of Work</h2></p><p>So an accurate perspective on the OAI-ORE work is that it seeks to enrich the content sharing landscape.  ORE is about enabling digital objects to float between systems that manage content and systems that leverage managed content.  In the first category are applications such as institutional repositories, research-group and managed personal (ePortfolio) repositories, discipline-oriented repositories, publisher repositories, dataset repositories, cultural heritage repositories, learning object repositories, and digitized book and manuscript collections.  In the second category are applications such as search engines, authoring tools, citation management, collaborative environments, social network applications, data/text mining applications, relationship graph analysis tools, preservation services, workflow tools, and report generation tools.</p><p>A key point to remember is that OAI-ORE is not necessarily about transferring the digital assets from one system to another.  It is the goal of the technical work to enable new, complex objects to be built without necessarily transferring all of the component parts from disparate content repositories to a single system.  (Reflect for a moment on the overlay journal concept &#8212; the papers that make up the issue of the overlay journal could certainly remain dispersed in the repositories where they are originally located; the overlay journal pulls together the contents in a virtual construct that represents an issue of that overlay journal.)  In some use cases, transfer of the digital object content is required; preservation mirroring is one such example.  In many cases, however, full transfer is not permitted (by terms of use), impractical (as in a dataset that is terabytes in size) or simply superfluous.</p><p>At its first meeting, the technical committee identified some motivating use cases to guide our work.  At this point they are little more than general statements of the activities that can be done better with an ORE framework in place.  Over the course of the next few weeks the technical committee will elaborate on these general statements and turn them into stories.</p><ul><li>Find, collect, analyze, relate, and publish data-oriented scholarly objects</li><li>Preserve compound digital objects</li><li>Remote submission of compound digital objects</li><li>Citation management</li><li>Object equivalence recognition (de-duping) to aid resource discovery</li><li>Graph-based quality assessment of data-centric scholarship</li></ul><p><h2>OAI-ORE Project Organization</h2></p><p>The character of the project organization is similar to that of the Protocol for Metadata Harvesting effort.  Carl Lagoze (Cornell University) and Herbert Van de Sompel (Los Alamos National Laboratory) are the principle investigators on the project.  They coordinate the efforts of an international group of volunteers that form an Advisory Committee, a Technical Committee (of which I am a member) and a Liaison Committee.  The membership lists for these committees are available on the <a href="http://www.openarchives.org/ore/" title="Open Archives Initiative Protocol - Object Exchange and Reuse">OAI-ORE website</a>.  It is worth noting that the participants are not exclusively from the library domain.  In particular, there is an emphasis not just on text/image/video objects but also scientific data objects.  The <a href="http://www.openarchives.org/ore/documents/ORE-Announcement.html" title="For immediate release October 13">Andrew W. Mellon Foundation is funding the work</a> for a 24-month period that began in October 2006 with additional support from the National Science Foundation.</p><p>The impetus behind the OAI-ORE effort was a meeting in April 2006 of representatives from institutional repository projects, scholarly content repositories, registry projects, and various other projects that touch on interoperability.  See <a href="http://msc.mellon.org/Meetings/Interop/" title="http://msc.mellon.org/Meetings/Interop/">http://msc.mellon.org/Meetings/Interop/</a> or more information.</p><p>The ORE work does not imply that the OAI-PMH specification is being dropped or replaced.  OAI-PMH will continue to exist as one approach to interoperability.  OAI-ORE will complement OAI-PMH when richer functionality is desired as part of a multi-level interoperability stack. In fact, one might consider OAI-ORE to be resource centric in contrast to OAI-PMH&#8217;s metadata-centric approach.</p><p>The technical committee has met once (the <a href="http://www.openarchives.org/ore/documents/OAI-ORE-TC-Meeting-200701.pdf" title="http://www.openarchives.org/ore/documents/OAI-ORE-TC-Meeting-200701.pdf">report of the meeting</a> is available from the ORE website) and will be conducting a conference call this week leading up to a second face-to-face meeting in May.  Right now we are fleshing out the use cases as a tool for testing models that we create or adapt from other uses.  We want to make sure what we&#8217;re specifying will really work in our application domain.  Once we have a good sense of what a model of the ORE scope of work entails, we&#8217;ll review existing related technologies with the intent of adapting what is currently available to meet the needs of the ORE model and only creating new specifications and protocols when it is really necessary.  Some early candidates of related technologies are OAI-PMH, RSS/ATOM, OpenURL, and METS/DIDL.</p><p>The technical committee has agreed amongst itself to use a tag of &#8216;oaiore&#8217; in the various social web tools (<span class="removed_link" title="http://technorati.com/tag/oaiore">Technorati</span>, <a href="http://www.connotea.org/tag/oaiore" title="Pages tagged with &amp;quot;oaiore&amp;quot; on Connotea">Connotea</a>, and <a href="http://del.icio.us/tag/oaiore" title="Pages tagged with &amp;quot;oaiore&amp;quot; on del.icio.us">del.icio.us</a> for example) as a way to co-locate material on this topic.  Others are encouraged to do the same.  Fellow technical committee member <a href="http://efoundations.typepad.com/efoundations/2007/01/ore.html" title="eFoundations: Prospecting for ORE">Pete Johnson</a> (with <a href="http://efoundations.typepad.com/efoundations/2007/01/more_ore.html" title="eFoundations: More ORE">follow up</a>) has already started a conversation, and you can listen to <a href="http://connect.educause.edu/blog/mpasiewicz/an_interview_with_herbert_van_de_sompel/15519" title="An Interview with Herbert van de Sompel via EDUCAUSE CONNECT" class="broken_link" rel="nofollow">an interview with Herbert van de Sompel</a> via EDUCAUSE.</p><p>In a subsequent postings, I&#8217;ll go into some detail about the <a href="http://dltj.org/2007/02/web-architecture/">inner workings of &#8220;The Web Architecture&#8221;</a> and how it is both <a href="http://dltj.org/2007/02/ore-model-services">a help and a hindrance to the interaction of compound digital objects in our domain</a>, and how it is a force too powerful to be ignored in either case.<p style="padding:0;margin:0;font-style:italic;" class="removed_link">The text was modified to remove a link to http://technorati.com/tag/oaiore on January 19th, 2011.</p><div class='series_links'> <a href='http://dltj.org/article/web-architecture/' title='Working With the Web Architecture'>Next in series</a></div>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/ore-introduction/feed/</wfw:commentRss> <slash:comments>3</slash:comments> </item> </channel> </rss>
<!-- Served from: dltj.org @ 2012-02-11 12:23:22 by W3 Total Cache -->
