<?xml version="1.0" encoding="UTF-8"?> <rss version="2.0" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule"><channel><title>Disruptive Library Technology Jester &#187; trac</title> <atom:link href="http://dltj.org/tag/trac/feed/" rel="self" type="application/rss+xml" /><link>http://dltj.org</link> <description>We&#039;re Disrupted, We&#039;re Librarians, and We&#039;re Not Going to Take It Anymore</description> <lastBuildDate>Mon, 06 Feb 2012 20:04:22 +0000</lastBuildDate> <language>en</language> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <cloud domain='dltj.org' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' /> <creativeCommons:license>http://creativecommons.org/licenses/by-nc-sa/3.0/us/</creativeCommons:license> <item><title>Options in Storage for Digital Preservation</title><link>http://dltj.org/article/preservation-storage-options/</link> <comments>http://dltj.org/article/preservation-storage-options/#comments</comments> <pubDate>Sun, 09 Jan 2011 22:45:10 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Meeting]]></category> <category><![CDATA[Amazon S3]]></category> <category><![CDATA[Association for Library Collections and Technical Services]]></category> <category><![CDATA[Chronopolis]]></category> <category><![CDATA[DAITS]]></category> <category><![CDATA[Duracloud]]></category> <category><![CDATA[Iron Mountain]]></category> <category><![CDATA[LOCKSS]]></category> <category><![CDATA[OCLC]]></category> <category><![CDATA[preservation]]></category> <category><![CDATA[storage]]></category> <category><![CDATA[trac]]></category><guid isPermaLink="false">http://dltj.org/?p=2101</guid> <description><![CDATA[A last-minute change to my plans for ALA Midwinter came on Tuesday when I was sought out to fill in for a speaker than canceled at the ALCTS Digital Preservation Interest Group meeting. Options for outsourcing storage and services for &#8230; <a href="http://dltj.org/article/preservation-storage-options/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=2101"></abbr><p>A last-minute change to <a href="http://dltj.org/article/alamw11-schedule/">my plans for ALA Midwinter</a> came on Tuesday when I was sought out to fill in for a speaker than canceled at the <a href="http://connect.ala.org/node/119686" title="Digital Preservation Interest Group (ALCTS) | ALA Connect">ALCTS Digital Preservation Interest Group meeting</a>.  Options for outsourcing storage and services for preserving digital content has been a recent interest, so I volunteered to combine two earlier <i><acronym title="Disruptive Library Technology Jester">DLTJ</acronym></i> blog posts with some new information and present it to the group for feedback.  The reaction was great, and here is the promised slide deck, links to further information, and some thoughts from the audience response.</p><p><h2>Slide Deck and References</h2><br /><div id="slideshare-options-in-storage" class="wp-caption alignright" style="width: 435px;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; float: right;"><div style="width:425px" id="__ss_6499127"><strong style="display:block;margin:12px 0 4px"><a href="http://www.slideshare.net/DataGazetteer/options-in-storage-for-digital-preservation" title="Options in Storage for Digital Preservation">Options in Storage for Digital Preservation</a></strong><object id="__sse6499127" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=201101digitalpreservationinterestgroupalcts-110109162838-phpapp02&#038;stripped_title=options-in-storage-for-digital-preservation&#038;userName=DataGazetteer" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed name="__sse6499127" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=201101digitalpreservationinterestgroupalcts-110109162838-phpapp02&#038;stripped_title=options-in-storage-for-digital-preservation&#038;userName=DataGazetteer" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object></div><p><p style=' padding: 0 4px 5px; margin: 0;'  class="wp-caption-text">Slides for &#039;Options in Storage for Digital Preservation&#039;</p></div><br />In the presentation there is a <a href="http://www.slideshare.net/DataGazetteer/options-in-storage-for-digital-preservation/14" title="Options in Storage for Digital Preservation">Table About Costs</a> that uses a scenario from an <a href="http://dltj.org/article/oclc-digital-archive-vs-amazon-s3/">earlier <i><acronym title="Disruptive Library Technology Jester">DLTJ</acronym></i> blog post</a>.  The text of the scenario is:<br /><blockquote>To examine the similarities and differences in costs, let’s use the OhioLINK Satellite Image collection as a prototypical example. It consists of about 2 terabytes (2TB) of high-quality images in TIFF format, with about 7.5GB of data going into the repository each month. In the interest of exploring everything that S3 can do, there is an assumption that approximately 4GB of data will be transferred out of the archive each month; OCLC’s Digital Archive does not have a end-user dissemination component.</p></blockquote><p>The point of showing this scenario is to show the widest range of costs &#8212; from a storage-only solution like Amazon S3 to a soup-to-nuts service like OCLC Digital Archive.  A word about the redacted costs.  Some of the numbers for OCLC&#8217;s Digital Archive response (from 2008) came from a confidential quote, so the numbers were removed from the public table.  For the numbers that are publicly listed, the values come from <a href="http://newsbreaks.infotoday.com/nbReader.asp?ArticleId=49018" title="OCLC Introduces High-Priced Digital Archive Service">Barbara Quint&#8217;s article</a>.</p><p>The articles and blog posts I referenced in the course of the presentation were:</p><p>Iglesias, Edward and Wittawat Meesangnil (2010). Using Amazon S3 in Digital Preservation in a mid sized academic library: A case study of CCSU ERIS digital archive system. <i>The Code4Lib Journal</i>, issue 12, retrieved 5-Jan-2011 from <a href="http://journal.code4lib.org/articles/4468" title="Using Amazon S3 in Digital Preservation in a mid sized academic library: A case study of CCSU ERIS digital archive system | The Code4Lib Journal">http://journal.code4lib.org/articles/4468</a></p><p>Murray, Peter (2008). Long-term Preservation Storage: OCLC Digital Archive versus Amazon S3. <i>Disruptive Library Technology Jester.</i> Retrieved 5-Jan-2011 from <a href="http://dltj.org/article/oclc-digital-archive-vs-amazon-s3/">http://dltj.org/article/oclc-digital-archive-vs-amazon-s3/</a></p><p>Murray, Peter (2009). Can We Outsource the Preservation of Digital Bits?. <i>Disruptive Library Technology Jester.</i> Retrieved 5-Jan-2011 from <a href="http://dltj.org/article/outsource-digital-bits/">http://dltj.org/article/outsource-digital-bits/</a></p><p>Quint, Barbara (2008). OCLC Introduces High-Priced Digital Archive Service. <i>Information Today.</i> Retrieved 5-Jan-2011 from <a href="http://newsbreaks.infotoday.com/nbReader.asp?ArticleId=49018" title="OCLC Introduces High-Priced Digital Archive Service | Information Today">http://newsbreaks.infotoday.com/nbReader.asp?ArticleId=49018</a></p><ul><li>Amazon S3. Retrieved 5-Jan-2011 from <a href="http://aws.amazon.com/s3/" title="Amazon Simple Storage Service (Amazon S3)">http://aws.amazon.com/s3/</a></li><li>Chronopolis. Retrieved 5-Jan-2011 from <a href="http://chronopolis.sdsc.edu/" title="Chronopolis -- Digital Preservation Program -- Long-Term Mass-Scale Federated Digital Preservation"> http://chronopolis.sdsc.edu/</a></li><li>DAITSS. Retrieved 5-Jan-2011 from <a href="http://daitss.fcla.edu/" title="DAITSS - Trac">http://daitss.fcla.edu/</a></li><li>DuraCloud. Retrieved 5-Jan-2011 from <a href="https://wiki.duraspace.org/display/duracloud/DuraCloud">https://wiki.duraspace.org/display/duracloud/DuraCloud</a></li><li>Iron Mountain. Retreived 5-Jan-2011 from <a href="http://www.ironmountain.com/news/2009/impr02232009.asp" title="Iron Mountain Digital Introduces the Industry" class="broken_link" rel="nofollow">http://www.ironmountain.com/news/2009/impr02232009.asp</a></li><li>OCLC Digital Archive. Retrieve 5-Jan-2011 from <a href="http://www.oclc.org/us/en/digitalarchive/" title="Digital Archive [OCLC - Digital Collection Services]">http://www.oclc.org/us/en/digitalarchive/</a></li><li>Private LOCKSS Networks. Retrieved 5-Jan-2011 from <a href="http://lockss.stanford.edu/lockss/Private_LOCKSS_Networks" title="Private LOCKSS Networks - LOCKSS">http://lockss.stanford.edu/lockss/Private_LOCKSS_Networks</a></li></ul><p><h2>Some Thoughts</h2><br />There was a great deal of discussion after the presentation about how good of a guarantee is good enough.  Amazon S3, offers two levels of availability:  &#8220;Designed to provide 99.999999999% durability and 99.99% availability of objects over a given year.&#8221;  The question was whether that slight risk of loss is &#8220;good enough&#8221; for our purposes.  Coming to grips with the digital storage, can we (as the librarian profession) get someone from Amazon to talk about what they do to assure that data is available?  Can the terms that they use be translated into terms that we use and understand?  Can we get a level of familiarity and comfort with their storage about what they do to trust them as a long-term data warehouse?  Can we pull out the appropriate questions of the <a href="http://www.dcc.ac.uk/resources/tools-and-applications/trustworthy-repositories" title="Trustworthy Repositories | Digital Curation Centre">Trusted Repositories Audit &amp; Certification: Criteria and Checklist</a> to see how Amazon S3 measures up?</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/preservation-storage-options/feed/</wfw:commentRss> <slash:comments>10</slash:comments> </item> <item><title>HTML Template of a TRAC:CC Report</title><link>http://dltj.org/article/trac-cc-report-template/</link> <comments>http://dltj.org/article/trac-cc-report-template/#comments</comments> <pubDate>Mon, 27 Aug 2007 17:44:27 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Unified Content Repository]]></category> <category><![CDATA[preservation]]></category> <category><![CDATA[trac]]></category><guid isPermaLink="false">http://dltj.org/2007/08/trac-cc-report-template/</guid> <description><![CDATA[Just in case this might be useful to others, I&#8217;ve created a report template based on the &#8220;Trustworthy Repositories Audit &#038; Certification: Criteria Checklist&#8221; report. It has the section, subsection, criteria and evidence from the original report marked up in &#8230; <a href="http://dltj.org/article/trac-cc-report-template/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/2007/08/trac-cc-report-template/"></abbr><p>Just in case this might be useful to others, I&#8217;ve created a <a href="http://cdn.dltj.org/wp-content/uploads/2007/08/trac-cc-criteria-checklist.html.gzip">report template</a> based on the &#8220;Trustworthy Repositories Audit &#038; Certification: Criteria Checklist&#8221; report.  It has the section, subsection, criteria and evidence from the <a href="http://www.crl.edu/content.asp?l1=13&#038;l2=58&#038;l3=162&#038;l4=91" title="CRL - Preservation">original report</a> marked up in an HTML document &#8212; ready for you to add the narrative on how your repository meets the criteria.  The original report included a table-style layout as an appendix; personally, I like this more free-form narrative approach.</p><p>I need to set work on this aside for a moment &#8212; if anyone gets to filling out appropriate portions based on the <a href="http://www.fedora-commons.org/" title="Fedora Commons homepage">Fedora digital object repository</a>, please let me know.</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/trac-cc-report-template/feed/</wfw:commentRss> <slash:comments>1</slash:comments> </item> <item><title>Planning a digital preservation assessment using TRAC:CC and DRAMBORA</title><link>http://dltj.org/article/trac-and-drambora/</link> <comments>http://dltj.org/article/trac-and-drambora/#comments</comments> <pubDate>Wed, 23 May 2007 20:06:51 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[DRC]]></category> <category><![CDATA[policy]]></category> <category><![CDATA[digital libraries]]></category> <category><![CDATA[drambora]]></category> <category><![CDATA[jisc]]></category> <category><![CDATA[oais]]></category> <category><![CDATA[OCLC]]></category> <category><![CDATA[preservation]]></category> <category><![CDATA[standards]]></category> <category><![CDATA[trac]]></category><guid isPermaLink="false">http://dltj.org/2007/05/trac-and-drambora/</guid> <description><![CDATA[OhioLINK is engaged in building a &#8220;trusted digital repository&#8221; on behalf of its membership. As we build it, we want to have an understanding of what &#8220;trusted&#8221; means, and so we are engaging in an audit process to assess whether &#8230; <a href="http://dltj.org/article/trac-and-drambora/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/2007/05/trac-and-drambora/"></abbr><p>OhioLINK is engaged in building a &#8220;trusted digital repository&#8221; on behalf of its membership.  As we build it, we want to have an understanding of what &#8220;trusted&#8221; means, and so we are engaging in an audit process to assess whether we can claim to be trustworthy.  This process is panning out to have four major phases:</p><ol><li>Research common and best practices for preservation.</li><li>Evaluate the OhioLINK policies and processes against common and best practices.</li><li>Perform a gap analysis between where we are now and where common and best practices suggest we should be.</li><li>Propose and adopt policies and processes that get us closer to the ideal common and best practices.</li></ol><p>This is a report at the end of phase 1.  Earlier this year, two major reports were released that address how one measures a &#8220;trustworthy repository.&#8221;  The two reports are summarized below, followed by a recommendation.<br /><span id="more-241"></span><br /><h2>Trustworthy Repositories Audit &amp; Certification: Criteria and Checklist</h2><br />The first is the OCLC/CRL/NARA <i><a href="http://bibpurl.oclc.org/web/16712" title="Trustworthy Repositories Audit &amp; Certification: Criteria and Checklist homepage">Trustworthy Repositories Audit &amp; Certification: Criteria and Checklist</a></i> (TRAC:CC).  This document &#8220;represents best current practice and thought about the organization and technical infrastructure required to be considered trustworthy and capable of certification.&#8221;  Quoting again:</p><blockquote><p>The nestor working group says a trusted, “long-term digital repository is a complex and interrelated system” (nestor 2006). However, more than just the “digital preservation system” drives the management of the digital materials. In determining trustworthiness, one must look at the entire system in which the digital information is managed, including the organization running the repository: its governance; organizational structure and staffing; policies and procedures; financial fitness and sustainability; the contracts, licenses, and liabilities under which it must operate; and trusted inheritors of data, as applicable. Additionally, the digital object management practices, technological infrastructure, and data security in place must be reasonable and adequate to fulfill the mission and commitments of the repository.</p><p>A trusted digital repository will understand threats to and risks within its systems. As articulated by Rosenthal et al. (2005), these potential threats include media failure, hardware failure, software failure, communication errors, failure of network services, media and hardware obsolescence, software obsolescence, operator error, natural disaster, external attack, internal attack, economic failure, and organizational failure. Constant monitoring, planning, and maintenance, as well as conscious actions and strategy implementation will be required of repositories to carry out their mission of digital preservation. All of these present an expensive, complex undertaking that depositors, stakeholders, funders, the designated community(ies), and other digital repositories will need to rely on in the greater collaborative digital preservation environment that is required to preserve the vast amounts of digital information generated now and into the future.</p></blockquote><p>The <i>TRAC:CC</i> contains 84 criteria broken out into three main sections:  Organizational infrastructure; Digital object management; and Technologies, technical infrastructure, and security. Within each of these sections are various subsections and under the subsections are the criteria themselves.</p><ol type="A" start="1"><li>Organizational infrastructure<ol><li>Governance &amp; organizational viability</li><li>Organizational structure &amp; staffing</li><li>Procedural accountability &amp; policy framework</li><li>Financial sustainability</li><li>Contracts, licenses, &amp; liabilities</li></ol></li><li>Digital object management<ol><li>Ingest: acquisition of content</li><li>Ingest: creation of the archivable package</li><li>Preservation planning</li><li>Archival storage &amp; preservation/maintenance of AIPs</li><li>Information management</li><li>Access management</li></ol></li><li>Technologies, technical infrastructure, and security<ol><li>System infrastructure</li><li>Appropriate technologies</li><li>Security</li></ol></li></ol><p>Some sample criteria are:</p><ul><li>A1.1 Repository has a mission statement that reflects a commitment to the long-term retention of, management of, and access to digital information.</li><li>A2.2 Repository has the appropriate number of staff to support all functions and services.</li><li>A3.5 Repository has policies and procedures to ensure that feedback from producers and users is sought and addressed over time.</li><li>A5.4 Repository tracks and manages intellectual property rights and restrictions on use of repository content as required by deposit agreement, contract, or license.</li><li>B2.5 Repository has and uses a naming convention that generates visible, persistent, unique identifiers for all archived objects (i.e., AIPs).</li><li>B2.9 Repository acquires preservation metadata (i.e., PDI) for its associated Content Information.</li><li>B3.4 Repository can provide evidence of the effectiveness of its preservation planning.</li><li>B4.4 Repository actively monitors integrity of archival objects (i.e., AIPs).</li><li>B5.3 Repository can demonstrate that referential integrity is created between all archived objects (i.e., AIPs) and associated descriptive information.</li><li>C1.1 Repository functions on well-supported operating systems and other core</li><li>infrastructural software.</li><li>C1.5 Repository has effective mechanisms to detect bit corruption or loss.</li><li>C1.7 Repository has defined processes for storage media and/or hardware change (e.g., refreshing, migration).</li><li>C1.9 Repository has a process for testing the effect of critical changes to the system.</li><li>C3.3 Repository staff have delineated roles, responsibilities, and authorizations related to implementing changes within the system.</li></ul><p><i>TRAC:CC</i> states that the &#8220;criteria are written to be applicable to any kind of digital repository or archives.&#8221;  As such, criteria should be placed within the context of vision and goals of the Ohio DRC.  One demonstrates compliance with the criteria through documentation (evidence), transparency (open examination of the evidence), adequacy (degree to which the evidence meets the vision/goals), and measurability.</p><p><h2>Digital Repository Audit Method Based On Risk Assessment</h2><br />The second major report was <i><a href="http://www.repositoryaudit.eu/" title="DCC/DPE Digital Repository Audit Method Based on Risk Assessment homepage">Digital Repository Audit Method Based On Risk Assessment</a></i> (DRAMBORA).  Written by the Digital Curation Centre (U.K. JISC-funded effort researching best practice for storage management and preservation of digital information) and Digital Preservation Europe (European Commission-funded project to improve coordination and cooperation among member states for digital preservation), <i>DRAMBORA</i> is a more methodical approach to assessing the trustworthiness of a repository.  A systematic process guides the auditor to identify risks to long-term preservation of repository content, and then scores each risk as a product between the likelihood of the risk occurring with the impact associated with that event.  Mitigation of the risks could then be prioritized based on a descending order of the score.</p><p>The process has six stages, some with multiple tasks:</p><ol><li>Identify organizational context<ul><li>Specify mandate of your repository or the organization in which it is embedded</li><li>List goals and objectives of your repository</li></ul></li><li>Document policy and regulatory framework<ul><li>List your repository&#8217;s strategic planning documents</li><li>List the legal, regulatory, and contractual frameworks or agreements to which your repository is subject</li><li>List the voluntary codes to which your repository has agreed to adhere</li><li>List any other documents and principles with which your repository complies</li></ul></li><li>Identify activities, assets and their owners<ul><li>Identify your repository&#8217;s activities, assets and their owners</li></ul></li><li>Identify risks<ul><li>Identify risks associated with activities and assets of your repository</li></ul></li><li>Assess risks<ul><li>Assess the identified risks</li></ul></li><li>Manage risks<ul><li>Manage the risks</li></ul></li></ol><p>The report includes a catalog of risks taken from other checklists and repository audits that can be used to spur the thinking of the auditor.</p><p><h2>Recommendation</h2><br />As other reviewers of these documents have noted, <i>DRAMBORA</i> takes a more quantified approach to assessing repositories.  As such, I think it would work best for an established repository self-review. <i>TRAC:CC</i> is more open-ended and exploratory, taking into account vision and goals and plans for a repository.  The authors of <i>DRAMBORA</i> estimate that it would take 28 to 40 hours to complete the audit; <i>TRAC:CC</i> does not provide an estimate, but I think its more general nature means that it would take less time.</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/trac-and-drambora/feed/</wfw:commentRss> <slash:comments>5</slash:comments> </item> </channel> </rss>
<!-- Served from: dltj.org @ 2012-02-11 09:04:14 by W3 Total Cache -->
