Jester's Cap

Disruptive Library Technology Jester

We're Disrupted, We're Librarians, and We're Not Going to Take It Anymore

Main menu

Skip to primary content
Skip to secondary content
  • About the Blog
  • About the Author
  • About the Tagline
  • Comment Policy
  • Contact

Post navigation

← Previous Next →

Representing Collections In FEDORA

Posted on July 25, 2006 by Peter Murray
This entry was posted in DRC, Fedora, Raw Technology and tagged digital libraries, DRC, Fedora, library service-oriented architecture, metadata, open source, RDF, standards by Peter Murray. Bookmark the permalink.

One of the DRC developers had a question recently that sparked a discussion about what to do with collections of objects. In order to answer the question of how to represent the notion of a collection within the repository, we’re going to have to get pretty heavy into RDF: the Resource Description Framework. RDF is a language created by the Worldwide Web Consortium “for representing information about resources in the World Wide Web.” If you already know about RDF — or just want to see what a proposed solution is — you can skip down to the “RDF for Collections in FEDORA” heading.

At the preface, I have to say that I’m increasingly uncomfortable with the word “collection” because it has become so overloaded in library usage, and like Carl Lagoze prefer the term “aggregation” to describe in a general sense what we think a collection is and what it could become. I probably bounce back and forth between the terms here, but am aiming to use “aggregation” and “aggregation object” more often.

I’m going to be pulling a lot of examples and language from the “RDF Primer“, which I would recommend reading at some point. It is a very long, dense document, but if you can get through it you’ll have a very good understanding of what RDF is and what is does for us.

The Primer describes RDF this way: “It is particularly intended for representing metadata about Web resources, such as the title, author, and modification date of a Web page, copyright and licensing information about a Web document, or the availability schedule for some shared resource…. RDF is based on the idea that the things being described have properties which have values, and that resources can be described by making statements … that specify those properties and values.”

There are three parts to an RDF statement about an object. “[T]he part that identifies the thing the statement is about (the Web page in this example) is called the subject. The part that identifies the property or characteristic of the subject that the statement specifies (creator, creation-date, or language in these examples) is called the predicate, and the part that identifies the value of that property is called the object.”

These component make up what is called an “RDF triple.” When written in tabular form an RDF triple is conventionally written in the order subject, predicate, object. To represent RDF statements in a machine-processable way, RDF uses the Extensible Markup Language [XML]. RDF defines a specific XML markup language, referred to as RDF/XML, for use in representing RDF information, and for exchanging it between machines.

For instance, imagine trying to state the (nominally, Dublin Core) descriptive metadata about a web page called http://www.example.org/index.html. In natural language, the descriptive elements could be:

http://www.example.org/index.html has a creator whose value is John Smith
http://www.example.org/index.html has a creation-date whose value is Aug 16, 1999
http://www.example.org/index.html has a language whose value is English

In tabular form, this could look like:

SubjectPredicateObjecthttp://www.example.org/index.htmlcreatorJohn Smithhttp://www.example.org/index.htmlcreation-dateAug 16, 1999http://www.example.org/index.htmllanguageEnglish

In XML, this could look like:

 <?xml version="1.0"?> <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"      xmlns:dc="http://purl.org/dc/elements/1.1/">   <rdf:Description rdf:about="http://ex.org/i.html">     <dc:creator>John Smith</dc:creator>     <dc:creation-date>Aug 16, 1999</dc:creation-date>     <dc:language>English</dc:language>   </rdf:Description> </rdf:RDF>

Keep in mind, though, that we expressed the predicate here as Dublin Core; the predicate can be anything — even something you make up!

RDF for Collections in FEDORA

RDF is used throughout FEDORA — in fact, the Dublin Core properties can (and in our FEDORA configuration, are) expressed as RDF triples in an internal database and can be searched as such. But the RDF triples can be used to express more than just attributes about an object — it can be used to express /relationships/ between objects. There is a whole section of the FEDORA docs called “Fedora Digital Object Relationships” that goes into more detail. Quotations and examples in this section are drawn from that document.

“Fedora digital objects can be related to other Fedora objects in many ways. For example there may be a Fedora object that represents a collection and other objects that are members of that collection. Also, it may be the case that one object is considered a part of another object, a derivation of another object, a description of another object, or even equivalent to another object.”

FEDORA comes with a list of common relationships between objects, and other community or user-defined relationships may also be asserted. These relationships can be expressed in RDF notation:

SubjectPredicateObject<subjectFedoraObject><relationshipProperty><targetFedoraObject>MyCatVideois a member of collectionGreatCatVideosdrc:101isMemberOfdrc:100drc:101isFromInstitutionmu3ug
 <?xml version="1.0"?> <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"   xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"   xmlns:fedora="info:fedora/fedora-system:def/relations-external#"   xmlns:drcns="http://drc.ohiolink.edu/ontologies/relationships#">   <rdf:Description rdf:about="info:fedora/drc:101">      <fedora:isMemberOf rdf:resource="info:fedora/drc:100" />      <drcns:isFromInstitution>mu3ug</drcns:isFromInstitution>   </rdf:Description> </rdf:RDF>

“drc:100″ is an aggregation object (otherwise known as a “collection object”, but I’ve learned from others in the FEDORA community that “collection” is too loaded of a word) of which “drc:101″ is a member. To put it in terms that we may be familiar with:

  • drc:100 is the aggregation object for the “Charles E. Frohman Collection”
  • drc:101 is a digital image of a photograph with the title “Work Crew” that is part of the Charles E. Frohman collection
  • drc:101 is a digital image contributed by member institution “mu3ug”

Conclusions


So the issue becomes, I believe, to examine the pre-loaded set of relationships to match those against the existing relationships in the DMC and then do define any kind of unique relationships (such as “isFromInstitution”) that we would want to express about our objects.

Link to this post!

Share this:

Links in "Representing Collections In FEDORA"

Series Table of Contents for Why FEDORA?

  1. On the Need for a General Purpose Digital Object Repository
  2. Why Fedora? Because You Don’t Need Fedora
  3. Thinking about Our Fedora Disseminators
  4. Processing Raw Fedora Objects
  5. Fedora Disseminators to Enable Accessible Repository Content
  6. Representing Collections In FEDORA
  7. Best Practice Proposal for a DESCRIPTION Datastream
  8. Why FEDORA? Answers to the FEDORA Users Interview Survey

Tags for "Representing Collections In FEDORA"

Find Related Content: within DLTJ Technorati del.icio.us Wikipedia
digital libraries Find posts tagged 'digital libraries' in DLTJ Find posts tagged 'digital libraries' in Technorati Find posts tagged 'digital libraries' in del.icio.us Find posts tagged 'digital libraries' in Wikipedia (English)
DRC Find posts tagged 'DRC' in DLTJ Find posts tagged 'DRC' in Technorati Find posts tagged 'DRC' in del.icio.us Find posts tagged 'DRC' in Wikipedia (English)
Fedora Find posts tagged 'Fedora' in DLTJ Find posts tagged 'Fedora' in Technorati Find posts tagged 'Fedora' in del.icio.us Find posts tagged 'Fedora' in Wikipedia (English)
library service-oriented architecture Find posts tagged 'library service-oriented architecture' in DLTJ Find posts tagged 'library service-oriented architecture' in Technorati Find posts tagged 'library service-oriented architecture' in del.icio.us Find posts tagged 'library service-oriented architecture' in Wikipedia (English)
metadata Find posts tagged 'metadata' in DLTJ Find posts tagged 'metadata' in Technorati Find posts tagged 'metadata' in del.icio.us Find posts tagged 'metadata' in Wikipedia (English)
open source Find posts tagged 'open source' in DLTJ Find posts tagged 'open source' in Technorati Find posts tagged 'open source' in del.icio.us Find posts tagged 'open source' in Wikipedia (English)
RDF Find posts tagged 'RDF' in DLTJ Find posts tagged 'RDF' in Technorati Find posts tagged 'RDF' in del.icio.us Find posts tagged 'RDF' in Wikipedia (English)
standards Find posts tagged 'standards' in DLTJ Find posts tagged 'standards' in Technorati Find posts tagged 'standards' in del.icio.us Find posts tagged 'standards' in Wikipedia (English)

Related Posts on Disruptive Library Technology Jester

  • Why FEDORA? Answers to the FEDORA Users Interview Survey
  • Why Fedora? Because You Don’t Need Fedora
  • Looking Forward to Version 2.2 of FEDORA
  • Fedora plus Sakai — a marriage made in heaven?
  • Analysis of CDL’s XTF textIndexer to Replace the Local Files with FEDORA Objects

Track and Share With Others

• Technorati iconTechnorati Cosmos

• TrackBack URI


Logging In...

Profile cancel

Sign in with Twitter Sign in with Facebook
or

Not published

Home

Search

Recent Posts

  • Code4Lib Journal Issue #20 Published; My Editorial: “It is Volunteers All the Way Down…”
  • Notes on the Code4Lib Virtual Lightning Talks
  • Interlibrary Loan Standards Undergoing Revision at the ISO Level
  • Vote for an ALA2013 Ignite Session on Open Source Communities
  • A Great iPad Keyboard/Case Combination: New Trent Airbender
  • ResourceSync Specification Draft Published for Comment

Archives

  • 2013: J F M A M J J A S O N D
  • 2012: J F M A M J J A S O N D
  • 2011: J F M A M J J A S O N D
  • 2010: J F M A M J J A S O N D
  • 2009: J F M A M J J A S O N D
  • 2008: J F M A M J J A S O N D
  • 2007: J F M A M J J A S O N D
  • 2006: J F M A M J J A S O N D
  • 2005: J F M A M J J A S O N D

Feeds and Such

  • Link to Podcast (RSS feed) for this blog
    Add Podcast to iTunes subscription
    Receive DLTJ by e-mail:


    Delivered by FeedBurner
  • View Peter Murray's profile on LinkedIn

Copyright

This work by Peter Murray is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States.

Creative Commons License
© 2013 | Theme based on Twenty Eleven by Wordpress.org | DLTJ strives for Standards Compliant XHTML & CSS | RSS Posts & Comments
From the Disruptive Library Technology Jester (http://dltj.org/), printed on Friday the 24th of May 2013 at 4:33:02 AM UTC (+0000). The URL to this page is

[Creative Commons Logo] This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/us/ or send a letter to Creative Commons, 543 Howard Street, 5th Floor, San Francisco, California, 94105, USA.
This work by Peter Murray is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States.