Jester's Cap

Disruptive Library Technology Jester

We're Disrupted, We're Librarians, and We're Not Going to Take It Anymore

Main menu

Skip to primary content
Skip to secondary content
  • About the Blog
  • About the Author
  • About the Tagline
  • Comment Policy
  • Contact

Post navigation

← Previous Next →

A Report on Namespaces Used by OAI-PMH Repositories

Posted on March 20, 2007 by Peter Murray
This entry was posted in Linking Technologies, Raw Technology and tagged digital libraries, Dublin Core, libraries, MARC, metadata, oai-pmh, standards by Peter Murray. Bookmark the permalink.

I had a need for a survey of the metadata namespaces used by OAI-PMH repositories, so I wrote up a quick shell script and XSLT style sheet to parse through the list of Registered Data Providers at the OpenArchives.org website. The results of this effort are pretty interesting. Some of them:

  • Dublin Core is, as you would expect, the highest-used descriptive metadata standard. Every service — or at least those that reported using any namespace at all — reported Dublin Core as a record harvesting option. For some, it was the only option (which I find rather sad). One problem, though, comes in with the variety of namespace URIs declared that all appear to be semantically the same thing: http://www.openarchives.org/OAI/2.0/oai_dc/, http://www.openarchives.org/OAI/2.0/oai_dc (note the missing trailing slash), http://purl.org/dc/elements/2.0/ (used exclusively by the ProQuest Digital Commons product, it would seem), and http://purl.org/dc/elements/1.1/ (the difference between 2.0 and 1.1 is not clear to me). In order to be processable, there must be an exact string match of the namespace URI — so even that missing trailing slash is significant!
  • The next most popular namespace URI is http://info.internet.isi.edu:80/in-notes/rfc/files/rfc1807.txt, which semantically would seem to identify the IETF RFC 1807 on a Format for Bibliographic Records. You can see what one of these things looks like — although RFC1807 predates XML (it was approved by the IETF in mid-1995), it looks like someone turned the metadata format into XML along the way. Very interesting…
  • The next most popular is http://www.ndltd.org/standards/metadata/etdms/1.0/ — corresponding to the Interoperability Metadata Standard for Electronic Theses and Dissertations — followed closely by http://www.openarchives.org/OAI/1.1/oai_marc — which fell out of favor years ago with the publication of MARC21 by the Library of Congress (which goes by the namespace http://www.loc.gov/MARC21/slim). Unfortunately, it doesn’t seem to have been picked up by the majority of OAI-PMH data providers that used the older oai_marc schema.
  • As you get towards the bottom of the first list, there are all sorts of interesting variants on qualified Dublin Core and other one-off schemas.

Your thoughts and observations? I’ve filed away the UNIX script and XSLT style sheet. If there is interest in seeing something like this in the future, let me know and I can dig them out.

The text was modified to remove a link to http://dltj.org/misc/oai-pmh-namespace-report.html on December 30th, 2010.

The text was modified to remove a link to http://www.umi.com/umi/digitalcommons/ on January 19th, 2011.

The text was modified to update a link from http://www.ndltd.org/standards/metadata/current.html to http://www.ndltd.org/standards/metadata/etd-ms-v1.00-rev2.html/ on January 19th, 2011.

(This post was updated on 19-Jan-2011.)

Links in "A Report on Namespaces Used by OAI-PMH Repositories"

Tags for "A Report on Namespaces Used by OAI-PMH Repositories"

Find Related Content:within DLTJTechnoratidel.icio.usWikipedia
digital librariesFind posts tagged 'digital libraries' in DLTJFind posts tagged 'digital libraries' in TechnoratiFind posts tagged 'digital libraries' in del.icio.usFind posts tagged 'digital libraries' in Wikipedia (English)
Dublin CoreFind posts tagged 'Dublin Core' in DLTJFind posts tagged 'Dublin Core' in TechnoratiFind posts tagged 'Dublin Core' in del.icio.usFind posts tagged 'Dublin Core' in Wikipedia (English)
librariesFind posts tagged 'libraries' in DLTJFind posts tagged 'libraries' in TechnoratiFind posts tagged 'libraries' in del.icio.usFind posts tagged 'libraries' in Wikipedia (English)
MARCFind posts tagged 'MARC' in DLTJFind posts tagged 'MARC' in TechnoratiFind posts tagged 'MARC' in del.icio.usFind posts tagged 'MARC' in Wikipedia (English)
metadataFind posts tagged 'metadata' in DLTJFind posts tagged 'metadata' in TechnoratiFind posts tagged 'metadata' in del.icio.usFind posts tagged 'metadata' in Wikipedia (English)
oai-pmhFind posts tagged 'oai-pmh' in DLTJFind posts tagged 'oai-pmh' in TechnoratiFind posts tagged 'oai-pmh' in del.icio.usFind posts tagged 'oai-pmh' in Wikipedia (English)
standardsFind posts tagged 'standards' in DLTJFind posts tagged 'standards' in TechnoratiFind posts tagged 'standards' in del.icio.usFind posts tagged 'standards' in Wikipedia (English)

Related Posts on Disruptive Library Technology Jester

  • Specifications for Object Reuse and Exchange (ORE) published
  • OAI-ORE Open Meeting, March 3 2008, Johns Hopkins University
  • Mashups of Bibliographic Data: A Report of the ALCTS Midwinter Forum
  • Introducing the OAI Object Reuse and Exchange Initiative
  • “Object Reuse and Exchange” Beta Specifications Now Available

Track and Share With Others

• Technorati iconTechnorati Cosmos

• TrackBack URI


Logging In...

Profile cancel

Sign in with Twitter Sign in with Facebook
or

Not published

  • 2 Replies
  • 2 Comments
  • 0 Tweets
  • 0 Facebook
  • 0 Pingbacks
Last reply was 1891 days ago
  1. Sarah Shreeves
    View 1891 days ago

    Have you seen the work that Tom Habing has done at the University of Illinois on a Registry of OAI data providers? He’s added all sorts of interesting reports on OAI data providers and has probably the biggest list of OAI data providers since he pulls in data providers that are not registered at openarchives.org.

    See http://gita.grainger.uiuc.edu/registry/.

    sarah

    Reply
  2. the jester
    View 1891 days ago

    Thank you, Sarah! Tom’s Distinct Metadata Schemas is much more comprehensive, and more useful, than my quick scripting. I’m grateful for the pointer to his work.

    Reply

Home

Search

Recent Posts

  • Unglue.It — a service to crowdsource book licensing fees — launches
  • WorldCat May Become Available as Library Linked Data under ODC-BY
  • But Is It a Library? — Reflections on ‘Little Free Libraries’
  • Thursday Threads: Developer Genders, Facebook Release Engineering, Alcohol Among Technologists
  • Thursday Threads: Open Source Advocates Twitch at Blackboard’s Strategy and Effect of Copyright/DRM on Access
  • Thursday Threads: Research Works Act, Amazon Kindle Give and Take, OCLC’s Website for Small Libraries

Archives

  • 2012: J F M A M J J A S O N D
  • 2011: J F M A M J J A S O N D
  • 2010: J F M A M J J A S O N D
  • 2009: J F M A M J J A S O N D
  • 2008: J F M A M J J A S O N D
  • 2007: J F M A M J J A S O N D
  • 2006: J F M A M J J A S O N D
  • 2005: J F M A M J J A S O N D

Feeds and Such

  • Link to Podcast (RSS feed) for this blog
    Add Podcast to iTunes subscription
    Receive DLTJ by e-mail:


    Delivered by FeedBurner
  • View Peter Murray's profile on LinkedIn

Copyright

This work by Peter Murray is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States.

Creative Commons License
© 2012 | Theme based on Twenty Eleven by Wordpress.org | DLTJ strives for Standards Compliant XHTML & CSS | RSS Posts & Comments
From the Disruptive Library Technology Jester (http://dltj.org/), printed on Wednesday the 23rd of May 2012 at 7:52:24 PM UTC (+0000). The URL to this page is

[Creative Commons Logo] This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/us/ or send a letter to Creative Commons, 543 Howard Street, 5th Floor, San Francisco, California, 94105, USA.
This work by Peter Murray is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States.