Skip to content
Solely for the Purpose of Catching $PAMRZ

Article-Level OAI-PMH Harvest Available from DOAJ

Earlier this year the DOAJ began offering a new schema for registered articles that significantly improves the value of OAI-PMH harvested article content. Prior to this addition the only scheme available was Dublin Core, which as a metadata schema for describing article content is woefully inadequate. (Dublin Core, of course, was never designed to handle the complexity of the description of an average article.) The new schema (graphically represented here
doajArticles schema image — select thumbnail to see a larger version) includes elements for ISSN/eISSN, volume/issue, start/end page numbers, and author affiliation. There is also a <fullTextUrl> element that is a link to the article content itself (not the splash page of the article on the publisher’s site).

Article content using this schema is harvestable through the DOAJ OAI-PMH provider site (for instance, using a ListRecords verb with a doajArticle metadata prefix against the PMH URL). This is, in fact, the same schema journal publishers use to submit article content to the DOAJ article database. With these pieces in place, it is now conceivable to harvest open access journal article content through the DOAJ and add it to a local journal article repository (such as the Electronic Journal Center in the case of OhioLINK).

Thanks go out to the DOAJ folks for making this available!

5 Comments

  1. Eric Lease Morgan | July 12, 2007 at 9:16 am | Permalink

    Yep, kudos to DOAJ.

    I saw this a week or two ago, and while I did not take advantage of their article-specific metadata scheme, I did use the Dublin Core metadata scheme to harvest about 54,000 of the articles and save them into a MyLibrary instance. I then used an indexer called Kinosearch to make them searchable. Finally I created a rudimentary searchable/browsable interface to the whole thing. See:

    http://dewey.library.nd.edu/mylibrary/demos/article-index/

    Ah, the possibilities are almost endless!


    Eric Lease Morgan
    University Libraries of Notre Dame

  2. the jester | July 13, 2007 at 9:26 pm | Permalink

    Neat, Eric! Thanks for posting the demo link. Another great idea for the feed…

  3. K.G. Schneider | July 14, 2007 at 11:36 am | Permalink

    Is the relationship between two articles in a journal defined by the start and end pages of each article?

  4. the jester | July 16, 2007 at 9:02 am | Permalink

    [quote comment="19096"]Is the relationship between two articles in a journal defined by the start and end pages of each article?[/quote]

    I suppose it would be; I haven’t stopped to think about it much. The order of elements matters in XML, so it could be an accurate representation of the way a journal issue is put together. The database in which the citation data is stored would need to preserve that order, of course.

  5. Md. Malek Hossain | April 2, 2008 at 9:30 am | Permalink

    This website is very essential for me and make a helpful guidline.

3 Trackbacks

  1. eFoundations | July 12, 2007 at 9:09 am | Permalink

    Journal articles, metadata formats and woes…

    In a post on his Digital Library Technology Jester weblog, Peter Murray of OhioLINK points to an XML format developed by the Directory of Open Access Journals (DOAJ) for representing descriptions of journal articles. First, I think I’d qualify Peter’…

  2. Peter Suber, Open Access News | July 12, 2007 at 7:21 pm | Permalink

    Kramer auto Pingback[...] articles for local repositories through the DOAJ Article-Level OAI-PMH Harvest Available from DOAJ, Disruptive Technology Library Jester, July 11, 2007.  [...]

  3. DigitalKoans | July 13, 2007 at 10:07 am | Permalink

    links from TechnoratiWhat was new and interesting during the week of 7/9/07? (Brief quotes follow article/Web page titles.)

Post a Comment

Your email is never published nor shared. Required fields are marked *
Human Detection Scheme
(What's this?)
Comment Preview

Additional comments powered by BackType

Subscribe without commenting

From the Disruptive Library Technology Jester (http://dltj.org/), printed on Tuesday the 9th of February 2010 at 9:08:34 AM EST (-0500). The URL to this page is http://dltj.org/article/doaj-articles/

[Creative Commons Logo] This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/us/ or send a letter to Creative Commons, 543 Howard Street, 5th Floor, San Francisco, California, 94105, USA.