XML Tower of Structural Metadata

Posted on 3 minute read

× This article was imported from this blog's previous content management system (WordPress), and may have errors in formatting and functionality. If you find these errors are a significant barrier to understanding the article, please let me know.

Jerome McDonough of the Graduate School of Library & Information Science at the University of Illinois at Urbana-Champaign presented a paper this summer at the Balisage conference with the title Structural Metadata and the Social Limitation of Interoperability: A Sociotechnical View of XML and Digital Library Standards Development. ((McDonough, J. (2008). Structural Metadata and the Social Limitation of Interoperability: A Sociotechnical View of XML and Digital Library Standards Development. InBalisage: The Markup Conference Proceedings 2008. Montréal, Canada. Retrieved October 2, 2008, from http://balisage.net/Proceedings/html/2008/McDonough01/Balisage2008-McDonough01.html.)) The title is very hard to penetrate, but the contents of the paper lay bare a theory for why we don't have large, swirling pools of shared digital objects that cross institutional silo boundaries.

Jerome lays the issue out right away. Paraphrasing from several papers, he says:

Despite its success, however, XML has not lived up to many librarians' expectations within one area, that of interoperability.... Digital library developers have expected that shared use of an XML standard for structuring of content and metadata (what is commonly called "structural metadata" within the digital library community) would ensure content interoperability and provide a clean division between content and higher level tools and services designed to work with standardized encodings of that content. In practice, this goal has proved extraordinarily elusive. Experiments conducted by participants in the Library of Congress National Digital Infrastructure for Preservation Program (NDIIPP) to test the exchange of digital objects between repositories failed even when participants were using the same XML-based encoding format and producing valid XML instances to exchange.

What Jerome is referring to is the ability to readily move objects from one repository to another. This would seem inherently doable on the surface -- the offering repository and the receiving repository are both using XML and perhaps even the same "structural metadata standard" (METS, MPEG-21, etc.). These standards provide "a structural grammar for the encoding of complex digital objects" -- the kind of thing needed to move these complex digital objects around various repositories. Jerome lays out two reasons why this doesn't occur. First, there is a tremendous amount of flexibility in the metadata standards as a result of efforts to make each of the standards abstract enough to encode every conceivable structure. Document authors have choices in the depth of levels of description, labeling of object components, and arrangement of the object structure relative to creation of one or more interrelated descriptive metadata files. The second issue he identifies is the problem of "standards independence", or the desire by the document standard author to have his/her metadata schema stand alone. Relying on other schemas may decrease the usefulness of the new standard to other organizations and environments.

The paper is a rich history of structural markup standards that have lead the profession to where it is today. He concludes with suggestions for the digital library community to move past this problem. One solution is to declare that our community is more concerned with using the flexibility inherent in the metadata standards for local needs than we are with sharing digital objects across silos. Counter to this is to refine standards to reduce the flexibility so as to increase the chances that the standards will promote interoperability. Jerome also offers the idea of promoting the activity of converting between various metadata formats to a more recognized and valued level. (He notes, for instance that the XSL stylesheets created by the Library of Congress that convert MODS into MARC/XML and back are omitted from the 'Standards of the Library of Congress' web page.)

Via Lorcan Dempsey.

The text was modified to update a link from http://balisage.net/Proceedings/html/2008/McDonough01/Balisage2008-McDonough01.html to http://www.balisage.net/Proceedings/vol1/html/McDonough01/BalisageVol1-McDonough01.html on November 13th, 2012.

The text was modified to update a link from http://balisage.net/Proceedings/html/2008/McDonough01/Balisage2008-McDonough01.html to http://www.balisage.net/Proceedings/vol1/html/McDonough01/BalisageVol1-McDonough01.html on November 13th, 2012.