Jester's Cap

Disruptive Library Technology Jester

We're Disrupted, We're Librarians, and We're Not Going to Take It Anymore

Main menu

Skip to primary content
Skip to secondary content
  • About the Blog
  • About the Author
  • About the Tagline
  • Comment Policy
  • Contact

Post navigation

← Previous Next →

A Report on Namespaces Used by OAI-PMH Repositories

Posted on March 20, 2007 by Peter Murray
This entry was posted in Linking Technologies, Raw Technology and tagged digital libraries, Dublin Core, libraries, MARC, metadata, oai-pmh, standards by Peter Murray. Bookmark the permalink.

I had a need for a survey of the metadata namespaces used by OAI-PMH repositories, so I wrote up a quick shell script and XSLT style sheet to parse through the list of Registered Data Providers at the OpenArchives.org website. The results of this effort are pretty interesting. Some of them:

  • Dublin Core is, as you would expect, the highest-used descriptive metadata standard. Every service — or at least those that reported using any namespace at all — reported Dublin Core as a record harvesting option. For some, it was the only option (which I find rather sad). One problem, though, comes in with the variety of namespace URIs declared that all appear to be semantically the same thing: http://www.openarchives.org/OAI/2.0/oai_dc/, http://www.openarchives.org/OAI/2.0/oai_dc (note the missing trailing slash), http://purl.org/dc/elements/2.0/ (used exclusively by the ProQuest Digital Commons product, it would seem), and http://purl.org/dc/elements/1.1/ (the difference between 2.0 and 1.1 is not clear to me). In order to be processable, there must be an exact string match of the namespace URI — so even that missing trailing slash is significant!
  • The next most popular namespace URI is http://info.internet.isi.edu:80/in-notes/rfc/files/rfc1807.txt, which semantically would seem to identify the IETF RFC 1807 on a Format for Bibliographic Records. You can see what one of these things looks like — although RFC1807 predates XML (it was approved by the IETF in mid-1995), it looks like someone turned the metadata format into XML along the way. Very interesting…
  • The next most popular is http://www.ndltd.org/standards/metadata/etdms/1.0/ — corresponding to the Interoperability Metadata Standard for Electronic Theses and Dissertations — followed closely by http://www.openarchives.org/OAI/1.1/oai_marc — which fell out of favor years ago with the publication of MARC21 by the Library of Congress (which goes by the namespace http://www.loc.gov/MARC21/slim). Unfortunately, it doesn’t seem to have been picked up by the majority of OAI-PMH data providers that used the older oai_marc schema.
  • As you get towards the bottom of the first list, there are all sorts of interesting variants on qualified Dublin Core and other one-off schemas.

Your thoughts and observations? I’ve filed away the UNIX script and XSLT style sheet. If there is interest in seeing something like this in the future, let me know and I can dig them out.

The text was modified to remove a link to http://dltj.org/misc/oai-pmh-namespace-report.html on December 30th, 2010.

The text was modified to remove a link to http://www.umi.com/umi/digitalcommons/ on January 19th, 2011.

The text was modified to update a link from http://www.ndltd.org/standards/metadata/current.html to http://www.ndltd.org/standards/metadata/etd-ms-v1.00-rev2.html/ on January 19th, 2011.

Link to this post!

Share this:

(This post was updated on 19-Jan-2011.)

Links in "A Report on Namespaces Used by OAI-PMH Repositories"

Tags for "A Report on Namespaces Used by OAI-PMH Repositories"

Find Related Content: within DLTJ Technorati del.icio.us Wikipedia
digital libraries Find posts tagged 'digital libraries' in DLTJ Find posts tagged 'digital libraries' in Technorati Find posts tagged 'digital libraries' in del.icio.us Find posts tagged 'digital libraries' in Wikipedia (English)
Dublin Core Find posts tagged 'Dublin Core' in DLTJ Find posts tagged 'Dublin Core' in Technorati Find posts tagged 'Dublin Core' in del.icio.us Find posts tagged 'Dublin Core' in Wikipedia (English)
libraries Find posts tagged 'libraries' in DLTJ Find posts tagged 'libraries' in Technorati Find posts tagged 'libraries' in del.icio.us Find posts tagged 'libraries' in Wikipedia (English)
MARC Find posts tagged 'MARC' in DLTJ Find posts tagged 'MARC' in Technorati Find posts tagged 'MARC' in del.icio.us Find posts tagged 'MARC' in Wikipedia (English)
metadata Find posts tagged 'metadata' in DLTJ Find posts tagged 'metadata' in Technorati Find posts tagged 'metadata' in del.icio.us Find posts tagged 'metadata' in Wikipedia (English)
oai-pmh Find posts tagged 'oai-pmh' in DLTJ Find posts tagged 'oai-pmh' in Technorati Find posts tagged 'oai-pmh' in del.icio.us Find posts tagged 'oai-pmh' in Wikipedia (English)
standards Find posts tagged 'standards' in DLTJ Find posts tagged 'standards' in Technorati Find posts tagged 'standards' in del.icio.us Find posts tagged 'standards' in Wikipedia (English)

Related Posts on Disruptive Library Technology Jester

  • Specifications for Object Reuse and Exchange (ORE) published
  • OAI-ORE Open Meeting, March 3 2008, Johns Hopkins University
  • Mashups of Bibliographic Data: A Report of the ALCTS Midwinter Forum
  • Introducing the OAI Object Reuse and Exchange Initiative
  • “Object Reuse and Exchange” Beta Specifications Now Available

Track and Share With Others

• Technorati iconTechnorati Cosmos

• TrackBack URI


Logging In...

Profile cancel

Sign in with Twitter Sign in with Facebook
or

Not published

  • 2 Replies
  • 2 Comments
  • 0 Tweets
  • 0 Facebook
  • 0 Pingbacks
Last reply was March 20, 2007
  1. Sarah Shreeves
    View March 20, 2007

    Have you seen the work that Tom Habing has done at the University of Illinois on a Registry of OAI data providers? He’s added all sorts of interesting reports on OAI data providers and has probably the biggest list of OAI data providers since he pulls in data providers that are not registered at openarchives.org.

    See http://gita.grainger.uiuc.edu/registry/.

    sarah

    Reply
  2. the jester
    View March 20, 2007

    Thank you, Sarah! Tom’s Distinct Metadata Schemas is much more comprehensive, and more useful, than my quick scripting. I’m grateful for the pointer to his work.

    Reply

Home

Search

Recent Posts

  • Code4Lib Journal Issue #20 Published; My Editorial: “It is Volunteers All the Way Down…”
  • Notes on the Code4Lib Virtual Lightning Talks
  • Interlibrary Loan Standards Undergoing Revision at the ISO Level
  • Vote for an ALA2013 Ignite Session on Open Source Communities
  • A Great iPad Keyboard/Case Combination: New Trent Airbender
  • ResourceSync Specification Draft Published for Comment

Archives

  • 2013: J F M A M J J A S O N D
  • 2012: J F M A M J J A S O N D
  • 2011: J F M A M J J A S O N D
  • 2010: J F M A M J J A S O N D
  • 2009: J F M A M J J A S O N D
  • 2008: J F M A M J J A S O N D
  • 2007: J F M A M J J A S O N D
  • 2006: J F M A M J J A S O N D
  • 2005: J F M A M J J A S O N D

Feeds and Such

  • Link to Podcast (RSS feed) for this blog
    Add Podcast to iTunes subscription
    Receive DLTJ by e-mail:


    Delivered by FeedBurner
  • View Peter Murray's profile on LinkedIn

Copyright

This work by Peter Murray is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States.

Creative Commons License
© 2013 | Theme based on Twenty Eleven by Wordpress.org | DLTJ strives for Standards Compliant XHTML & CSS | RSS Posts & Comments
From the Disruptive Library Technology Jester (http://dltj.org/), printed on Sunday the 26th of May 2013 at 8:26:05 AM UTC (+0000). The URL to this page is

[Creative Commons Logo] This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/us/ or send a letter to Creative Commons, 543 Howard Street, 5th Floor, San Francisco, California, 94105, USA.
This work by Peter Murray is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States.