<?xml version="1.0" encoding="UTF-8"?> <rss version="2.0" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule"><channel><title>Disruptive Library Technology Jester &#187; Meeting</title> <atom:link href="http://dltj.org/category/meeting/feed/" rel="self" type="application/rss+xml" /><link>http://dltj.org</link> <description>We&#039;re Disrupted, We&#039;re Librarians, and We&#039;re Not Going to Take It Anymore</description> <lastBuildDate>Fri, 18 May 2012 15:43:10 +0000</lastBuildDate> <language>en</language> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <cloud domain='dltj.org' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' /> <creativeCommons:license>http://creativecommons.org/licenses/by-nc-sa/3.0/us/</creativeCommons:license> <item><title>My ALA Midwinter 2012 Schedule</title><link>http://dltj.org/article/alamw12-schedule/</link> <comments>http://dltj.org/article/alamw12-schedule/#comments</comments> <pubDate>Thu, 19 Jan 2012 22:08:26 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Meeting]]></category> <category><![CDATA[ALA Midwinter Conference 2012]]></category> <category><![CDATA[foss4lib]]></category> <category><![CDATA[travel]]></category><guid isPermaLink="false">http://dltj.org/?p=3619</guid> <description><![CDATA[The snow is falling here in central Ohio, so I&#8217;m eager to leave here and head to warm Dallas for ALA Midwinter 2012. I&#8217;m looking forward to catching up with colleagues; making new acquaintances; learning the latest thinking on RDA, &#8230; <a href="http://dltj.org/article/alamw12-schedule/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=3619"></abbr><p>The snow is falling here in central Ohio, so I&#8217;m eager to leave here and head to <a href="http://www.wunderground.com/q/zmw:75201.1.99999" title="Weather Forecast Dallas, TX | Dallas Weather | Wunderground">warm Dallas</a> for <a href="http://www.alamidwinter.org/" title="Homepage | ALA Midwinter 2012">ALA Midwinter 2012</a>.  I&#8217;m looking forward to catching up with colleagues; making new acquaintances; learning the latest thinking on RDA, linked data, and standards activity; and talking about free/open source software in libraries.  On the latter point, I encourage you to come see me give an introduction to the <a href="http://foss4lib.org/article/2012/jan/lyrasis-launches-foss4lib-provide-guidance-libraries-about-open-source-software" title="LYRASIS Launches FOSS4LIB to Provide Guidance to Libraries about Open Source Software | Library Open-Source Software Registry">newly announced FOSS4LIB site</a>, answer questions, and take feedback on <a href="http://foss4lib.org/content/learn-about-foss4lib-ala-midwinter" title="Learn about FOSS4LIB at ALA Midwinter | Library Open-Source Software Registry">Saturday morning (10:30 to 11:30) or Sunday morning (10:30 to 11:30)</a>.  (Or, if you are not coming to Midwinter, sign up for one of the free webinar sessions later in January and February.)</p><p>ALA is using a new iteration of its scheduler this year, and it keeps getting better and better.  This one even allows you to embed your selected schedule as an &lt;iframe&gt; on an arbitrary page.  So here is my schedule:</p><p><iframe src="http://alamw12.scheduler.ala.org/user/26508/schedule-embed" width="600" height="600"></iframe></p><p>You can <a target="_blank" href="https://twitter.com/intent/user?screen_name=DataG">follow me on <span style="background-image: url(&quot;//si0.twimg.com/images/dev/cms/intents/bird/bird_blue/bird_16_blue.png&quot;); background-repeat: no-repeat; padding-left: 18px;">Twitter</span></a> where I&#8217;ll be tweeting about <a href="https://twitter.com/#!/search/%23alamw12">#alamw12</a>.  A Twitter mention or direct message is also the best way to get ahold of me while in Dallas.</p><p>Safe travels if you are headed to Midwinter, and I hope to run into you there.</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/alamw12-schedule/feed/</wfw:commentRss> <slash:comments>2</slash:comments> </item> <item><title>Open Repositories 2011 Report: Day 3 &#8211; Clifford Lynch Keynote on Open Questions for Repositories, Description of DSpace 1.8 Release Plans, and Overview of DSpace Curation Services</title><link>http://dltj.org/article/or11-report-4/</link> <comments>http://dltj.org/article/or11-report-4/#comments</comments> <pubDate>Sat, 11 Jun 2011 04:16:28 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Meeting]]></category> <category><![CDATA[Clifford Lynch]]></category> <category><![CDATA[DSpace]]></category> <category><![CDATA[higher education]]></category> <category><![CDATA[Open Repositories 2011]]></category><guid isPermaLink="false">http://dltj.org/?p=3014</guid> <description><![CDATA[The main Open Repositories conference concluded this morning with a keynote by Clifford Lynch and the separate user group meetings began. I tried to transcribe Cliff&#8217;s great address as best I could from my notes; hopefully I&#8217;m not misrepresenting what &#8230; <a href="http://dltj.org/article/or11-report-4/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=3014"></abbr><p>The main <a href="https://conferences.tdl.org/or/index.php/OR2011/OR2011main">Open Repositories conference</a> concluded this morning with a keynote by <a href="http://www.cni.org/staff/clifford_index.html" title="CNI Staff: Clifford Lynch<br />1000">Clifford Lynch</a> and the separate user group meetings began.  I tried to transcribe Cliff&#8217;s great address as best I could from my notes; hopefully I&#8217;m not misrepresenting what he said in any significant ways.  He has some thought-provoking comments about the positioning of repositories in institutions and the policy questions that come from that.  For an even more abbreviated summary, check out this <a href="http://storify.com/datag/clifford-lynch-keynote-at-open-repositories-2011/" title="Clifford Lynch Keynote at Open Repositories 2011 - storify.com">Storify archive of tweets</a> during his keynote.  Then I attended the DSpace track of user group programming, and below there are summaries of plans for DSpace version 1.8 and the new DSpace Curation Services.</p><p><h2>Repositories: Major Progress and Open Questions</h2><br />Mindful that we are roughly a decade into building institutional repositories, Cliff said it was an appropriate time to look at what has been accomplished along with some of the open issues and new questions have emerged.  We still don&#8217;t have a good way to measure content in repositories.  People had radically different ideas from institution to institution on what is an &#8220;object&#8221; so those metrics don&#8217;t mean much; counting terabytes is equally fruitless because some repositories have video and others only have textual material.</p><p>Instead, the growth of repositories has highlighted critical policy discussions of what the missions of institutions of higher education are supposed to be.  Questions such as the responsibility to curate knowledge they create, curate the evidence on which inquiry is based, to disseminate knowledge.  These weren&#8217;t on the table 10-15 years ago. Now they are central issues for discussion in the leadership of universities.  Tied to institutional repositories is the question of open access.  The scope of issues goes beyond just open access, though.  It reaches into the kind of questions that are now getting traction like getting access to research data and an institution&#8217;s role of stewardship and disseminate of research data. Also the creation of open educational resources and institution&#8217;s responsibility to disseminate such resources.  These questions wouldn&#8217;t have emerged without the effort to build out institutional repositories.  As soon as you start talking about these questions they demand investments in infrastructure of institutional repositories.  So we should take satisfaction in the role that the efforts of deploying institutional repositories have played in advancing these critical policy discussions.  These questions have gone unanswered far too long.</p><p>That said, there is a danger of confusion of mechanism with policy.  We went through a bad period around 2004 when institutional repositories were deployed without knowing what would be put in them.  Institutional repositories are services to support policies, not ends to themselves.  We have mostly gotten past this and can have the discussion of whether institutionally based repositories are the appropriate tool and when should we build discipline-specific repositories or other kinds of platforms that are not institutionally focused.</p><p>He also noted that the question of institutional assets and the balance of faculty control with institutional responsibility is being talked about, if only quietly.  The piece of video that Clifford referred to &#8212; a talk by Derick Law on how universities have failed in their stewardship responsibilities of research &#8212; may be this video from the The Blue Ribbon Task Force on Sustainable Digital Preservation and Access&#8217;s <a href="http://brtf.vidizmo.com/MashupPlayBack.aspx?type=d&amp;id=HI3pHUmaP7o%3d" title="Conversations about Research Data and Scholarly Discourse video archive, National Conversation on the Economic Sustainability of Digital Information">National Conversation on the Economic Sustainability of Digital Information</a> (skip to &#8220;chapter 2&#8243; of the video) held April 1, 2010 in Washington DC.</p><p>Not only have institutional repositories acted as focal point for policy, they have also been a focal point for collaborations.  Library and IT collaborations were happening long before institutional repositories surfaced.  Institutional repositories, though, have been a great place to bring other people into that conversation, including faculty leaders to start engaging them in questions about dissemination of their work.  Also chief research officers; in 1995 if you were a university librarian doing leadership work constructing digital resources to change scholarly communication, would have talked to CIO but may not know who your chief research officer was at that point.  That set of conversations, which are now critical when talking about data curation, got their start with institutional repositories and related policies.</p><p>Another place for conversation has been those in the university administrations concerned with building public support for the institution.  By giving the public a deeper understanding of what the institution contributes to culture, industry, health and science, and connecting faculty to this effort.  This goes beyond the press release by opening a public window into the work of the institutions.  This is particularly important today with questions of public support for institutions.</p><p>That said, there are a number of open questions and places where we are dealing with works-in-progress.  Cliff then went into an incomplete and, from his perspective, perhaps idiosyncratic, list of these issues.</p><p>Repositories are one of the threads that are leading us nationally and internationally into a complete rethinking of the practice of name authority.  While it is a librarian, old fashion concept, but it is converging with &#8220;identity management&#8221; from IT.  He offered an abbreviated and exaggerated example:  librarians did name authority for authors of stuff in general in 19th century. In 20th century there was too much stuff, particularly stuff in journals and magazines became overwhelming.  So libraries backed off and focused only on books and stuff that went into catalogs; the rest they turned over to indexing and abstracting services.  We made a few weird choices like an authority file should be as simple as possible to disambiguate authors rather than be as full as possible, so we had the development of things along side name authority files like the national dictionaries of literary biographies.</p><p>For scientific journal literature, publishers followed practices about how obscure author names could be (e.g. just last name and first initial). Huge amounts of ambiguity of &#8220;telegraphic author names&#8221; results in a horribly dirty corpus of data.  A variety of folks are realizing that we need to disambiguate authorship by assigning author identifiers and somehow go back and cleanup the mess in the existing bibliographic data of scholarly literature, especially journal literature.  Institutions taking more responsibility for the work of their community, and having to do local name authority all over again. We have the challenge of how to reconnect this activity to national and international files.  We also have a set of challenges on whether we want to connect this to biographical resources.  It brings up issues of privacy, when do people do things of record, and how much else should come along with building a public biography resource.  We also see a vast parallel investment of institutional identity management.  Institutions haven&#8217;t quite figured out that people don&#8217;t necessarily publish with the same name that is recorded in the enrollment or employment systems that the institution manages, and that it would be a good idea to tie those literary names to identity files that the institution manages.</p><p>We&#8217;re not confident of the kind of ecological positioning institutional repositories among a pretty complicated array of information systems found at a typical large university.  Those systems include digital library platforms, course management systems, lecture capture systems, facilities for archiving the digital records of the institution, and platforms intended to directly support active research by faculty.  All are evolving at their own rate.  It is unclear where the institutional repositories fit, and what are the boundaries around them.</p><p>Here is one example.  What is the difference between an institutional repository and a digital library/collection?  You&#8217;d get very different answers from different people.  One might be who does the curation, how it is sourced, and how it is scoped.  The decisions are largely intellectual.  Making this confusing is that you&#8217;ll see the same platform for institutional repositories and digital library platforms.  We are seeing a convergence of the underpinning platforms.</p><p>Another one: learning management systems (LMS).  These are virtually universal among institution in the same timeframe that institutional repositories have been deployed.  We&#8217;ve done a terrible job at thinking about what happens to the stuff in them when the course is over.  We can&#8217;t decide if it is scholarly material, institutional records, or something else.  They are tangled up between learning materials and all of the stuff that populates a specific performance of a course such as quizzes and answers, discussion lists, and student course projects.  We don&#8217;t have taxonomies and policies here and a working distinction between institutional repositories and learning management systems.  It is an unusual institution that has as systematic export from the LMS to an IR.</p><p>Lecture capture systems becoming quite commonplace; students are demanding them in much the same way that the LMS was demanded.  A lecture capture system may be more universally helpful than an LMS.  Lectures being captured for a wide range of reasons, but not knowing why means it is difficult to know whether to keep them and how to integrate them into the institution&#8217;s resources.</p><p>Another example: the extent to which institutional repositories should sit in the stream of active work.  As faculty are building datasets and doing computation with them, when is it time for something to go into an institutional repository.  How volatile can content be in the repository?  How should repositories be connected or considered as robust working storage?  He suspects that many institutional repositories are not provisioned with high-performance storage and network connections, and would become a bottleneck in the research process.  The answers would be different for big data sets and small data sets, and we are starting to see datasets that are too big to backup or two big to replicate.</p><p>Another issue is that of virtual organizations, the kind of collaborative efforts that span institutions and nations.  They often allow relatively low overhead to mobilize researchers to work on a problem, and are becoming commonplace in sciences and social sciences and starting to pop up in the humanities.  We have a problem for the rules-of-the-road between virtual organizations and institution-based repositories.  It is easy to spin up an institutional repository for a virtual organization, but what happens to it when the virtual organization shuts down.  Some of these organizations are intentionally transient; how do we assign responsibility for a world of virtual organizations and map them into institutional organizations for long-term stewardship.</p><p>Software is starting to concern people.  So much scholarship is tied up now in complicated software systems that we are starting to see a number of phenomena.  One is data that is difficult to reuse or understand without the software.  Another is the is difficulty surrounding reproducibility &#8212; taking results and realizing they are dependent on an enormous stack of software and we don&#8217;t have a clear way to talk about the provenance of a result that is based on the stack of software versions that would allow for high-confidence in reproduction of results. We&#8217;re doing to have to deal with software.  We are also entering an era of deliberate obsolescence of software; for instance, any Apple product that is older than a few years is going to the dustbin and it hasn&#8217;t been fully announced or realized so that people can deal with it.</p><p>Another place that has been under-exploited is the question of retiring faculty and repositories. Taking inventory of someone&#8217;s scholarly collections and migrating it to an institutional framework in an orderly fashion.</p><p>How we reinterpret institutional repositories going beyond universities. For example there is something that looks a bit like an institutional repository but has some different things about it that belongs in public libraries or historic societies or similar.  This dimension bears exploration.</p><p>To conclude his comments he talked about a last open issue.  When we talk about good stewardship and preservation of digital materials, there are a couple of ideas that have emerged as we tried to learn from our past stewardship of print scholarly literature.  One of these principles is that geographic replication is a good thing; we&#8217;re starting to see this in a sense that most repositories are based on some geographically redundant storage system or we&#8217;ll see a steady migration towards this in the next few years.  A second one is organizational redundancy.  If you look at the print work, it wasn&#8217;t just that the scholarly record wasn&#8217;t in a number of independent locations but also that control was replicated among institutions that were making independent decisions about adding materials to their library collection. Clearly they coordinated to a point, but they also have institutional independence. We don&#8217;t know how to do this with institutional repositories. This is also emerging in special collections as they become digital.  Because they didn&#8217;t start life as published materials in many replicated versions, we need other mechanisms to have curatorial responsibility distributed.  This is linked to the notion that it is usually not helpful to talk about preservation in terms like &#8220;eternity&#8221; or  &#8220;perpetuity&#8221; or life-of-the-republic.  It is probably better in most cases to think about preservation in one chunk at a time; an institution making a 20-year or 50-year commitment with a well-structured process at the end.  That process includes whether an institution should renew the commitment and if not other interested parties could come in and take responsibility with a well-ordered hand-off.  This ties into policies and strategies for curatorial replication across institutions and ways that institutional repositories will need to work together.  It may be less critical today, but will become increasingly critical.</p><p>In conclusion, Cliff said that he hoped left the attendees with a sense that repositories are not things that stand on their own.  That they in fact are mechanism that advance policy in a very complex ecology of systems.  In fact, we don&#8217;t have our policy act together on many systems adjacent to the repository that leads to issues of appropriate scope and interfaces with those systems.  Where repositories will evolve to in the future as we understand the role of big data is also of interest.</p><p><h2>DSpace 1.8</h2><br />Robin Taylor, the DSpace version 1.8 release manager, gave an overview of what was planned (not promised!) for the next major release.  The release schedule was to have a beta last week, but that didn&#8217;t happen.  The remainder of the schedule is to have a beta on July 8th, feature freeze on August 19th, release candidate 1 published on September 2nd in time for the test-a-thon from the 5th to the 16th, followed by a second release candidate on September 30th, final testing October 3rd through the 12th, and a final release on October 14th.  He then went into some of the planned highlights of this release.</p><p>SWORD is a lightweight protocol for depositing items between repositories; it is a profile of the Atom Publishing Protocol.  At the current release, DSpace has be able to accept items; the planned work for 1.8 will make it possible to send items.  Some possible use cases: publishing from a closed repository to an open repository, sending from the repository to a publisher, from the repository to a subject-specific service (such as arXiv), or vice versa.  The functionality was copied from the Swordapp demo.  It supports SWORD v1 and only the DSpace XMLUI.  A question was ask about whether the SWORD copy process is restricted to just the repository manager? The answer was that it should be configurable.  On the one hand it can be open because it is up to the receiving end to determine whether or not to accept it.  On the other hand, a repository administrator might want to prevent items being exported out of a collection.</p><p>MIT has rewritten the Creative Commons licensing selection steps. It uses the Creative Commons web services (as XML) rather than HTML iframes, which allows better integration with DSpace.  As an aside, the Creative Commons and license steps have been split into two discrete steps allowing different headings in the progress bar.</p><p>The DSpace Community Advisory Team prioritized issues to be addressed by the developers, and for this release they include JIRA issue <a href="https://jira.duraspace.org/browse/DS-638">DS-638</a> for virus checking during submission.  The solution invokes the existing Curation Task and  requires Clam AV antivirus software to be installed.  It is switched off by default and is configured in submission-curation.cfg.  Two other issues that were addressed are <a href="https://jira.duraspace.org/browse/DS-587">DS-587</a> (Add the capability to indicate a withdrawn reason to an Item) and <a href="https://jira.duraspace.org/browse/DS-164">DS-164</a> (Deposit interface), which was completed as the Google Summer of Code Submission Enhancement project.</p><p>Thanks to Bojan Suzic in his Google Summer of Code project, DSpace has had a REST API.  The code has been publicly available and repositories have been making use of it, so the committers group want to get it into a finished state and include it in 1.8.  There is also work on an alternative approach to a REST API.</p><p>DSpace and DuraCloud was also covered; it was much the same that <a href="http://dltj.org/article/or11-report-1/">I reported on earlier this week</a>, so I&#8217;m not repeating it here.</p><p>From the geek perspective, the new release will see increasing modularization of the codebase and more use of Spring and the DSpace Services Framework.  The monolithic dspace.cfg will be split up into separate pieces; some pieces would move into Spring config while other pieces could go into the database.  It will have a simplified installation process, and several components that were talked about elsewhere at the meeting: WebMVC UI, configurable workflow, and more curation tasks.</p><p><h2>Introduction to DSpace Curation Services</h2><br />Bill Hays talked about curation tasks in DSpace.  Curation tasks are Java objects managed by the Curation System.  Functionally, they are an operation run on a DSpace Object and (optionally) its contained objects (e.g., community, subcommunity, collection, and items).  They do not work site-wide and not on bundles or bitstreams. The tasks can be run in multiple ways by different types of administrative users, and they are configured separately from dspace.cfg.</p><p>Some built-in tasks are to validate metadata against input forms (halts on task failure), count bitstreams by format type, virus scan (uses external virus detection service), on ingest (the desired use case), and the replication suite of tasks for DuraCloud.  Other tasks: link checker and 11 others (from Stuart Lewis and Kim Shepherd), format id with DROID (in development), validate/add/replace metadata, status report on workflow items, filter media in workflow (proposed), and checksum validation (proposed).</p><p>What does this mean for different users?  As a repository or collection manager, it means new functionality &#8212; GUI access without GUI development: curation, preservation, validation, reporting. As a developer: rapid development, and deployment of functionality without rebuilding or redeploying the DSpace instance.</p><p>The recommended Java development environment for tasks is with a package outside of <code>dspace-api</code>.  Make a POM with dependency on <code>dspace-api</code>, especially <code>/curate</code>.  Required features of the task are a constructor with no arguments to support loading as a plugin and that it implements the CurationTask interface or extends the AbstractCurationTask class.  Deploy it as a JAR and configure (similar to a DSpace plugin)</p><p>There are some Java annotations for Curation Task code that are important to know about.  Setting <code>@Distributive</code> means that the task is responsible for handling any contained DSpace objects as appropriate.  Otherwise the default is to have the task executed across all contained objects (subcommunities, collections, or items). Setting <code>@Suspendable</code> means the task interrupts processing when first FAIL status is returned.  Setting <code>@Mutative</code> means the task makes changes to target objects.</p><p>Invoking tasks can be done several ways: from the web application (XMLUI), the command line, from workflow, from other code, or from a queue (deferred operation).  In the case of the workflow, one can target the action of the task at anywhere in the workflow steps (e.g. before step 1, step 2, step 3 or at item installation).  Actions (reject or approve) are based on tasks results, and notifications are sent by e-mail.</p><p>A mechanism for discovering and sharing tasks doesn&#8217;t exist yet.  What is needed is a community repository of tasks.  For each task what is needed is: a descriptive listing, documentation, reviews/ratings, link to source code management system, and link to binaries applicable to specific versions.</p><p>With dynamic loading with scripting languages in <a href="http://java.sun.com/developer/technicalArticles/J2SE/Desktop/scripting/" title="Scripting for the Java Platform">JSR-223</a>, it is theoretically possible to create Curation Tasks in Groovy, JRuby, Jython, although the only one Bill has been able to get to work so far has been Groovy.  Scripting code needs a high level of interoperability with Java, and must implement the CurationTask interface.  Configuration is a little bit different: one needs a taskcatalog with descriptors for language, name of script, and how the constructor is called.  Bill demonstrated some sample scripts.</p><p>In his conclusion, Bill said that the new Curation Services: increases functionality for content in a managed framework; has multiple ways of running tasks for different types of users and scenarios; makes it possible to add new code without a rebuild; simplifies extending DSpace functionality; and with scripting lowers the bar even more.</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/or11-report-4/feed/</wfw:commentRss> <slash:comments>12</slash:comments> </item> <item><title>Open Repositories 2011 Report: Day 2 with DSpace plus Fedora and Lots of Lightning Talks</title><link>http://dltj.org/article/or11-report-3/</link> <comments>http://dltj.org/article/or11-report-3/#comments</comments> <pubDate>Fri, 10 Jun 2011 03:58:20 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Meeting]]></category> <category><![CDATA[DSpace]]></category> <category><![CDATA[fascinator]]></category> <category><![CDATA[Fedora]]></category> <category><![CDATA[microservice]]></category> <category><![CDATA[OCLC]]></category> <category><![CDATA[Open Repositories 2011]]></category> <category><![CDATA[premis]]></category> <category><![CDATA[simile]]></category><guid isPermaLink="false">http://dltj.org/?p=3004</guid> <description><![CDATA[Today was the second day of the Open Repositories conference, and the big highlight of the day for me was the panel discussion on using Fedora as a storage and service layer for DSpace. This seems like such a natural &#8230; <a href="http://dltj.org/article/or11-report-3/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=3004"></abbr><p>Today was the second day of the <a href="https://conferences.tdl.org/or/index.php/OR2011/OR2011main">Open Repositories conference</a>, and the big highlight of the day for me was the panel discussion on using Fedora as a storage and service layer for DSpace.  This seems like such a natural fit, but with two pieces of complex software the devil is in the details.  Below that summary is some brief paragraphs about some of the 24&#215;7 lightning talks.<br /><span id="more-3004"></span><br /><h2>Fedora inside DSpace</h2><br /><a href="https://profiles.google.com/mdiggory/about">Mark Diggory of @MIRE</a> moderated a panel of <a href="http://loomware.typepad.com/about.html" title="Mark Leggott">Mark Leggott</a> (Islandora, DiscoveryGarden and UPEI), <a href="http://bradmc.users.sourceforge.net/" title="SourceForge.net: Bradley McLean - Developer Web Hosting - Open Source Software">Bradley McLean</a> (CTO for DuraSpace), <a href="http://www.linkedin.com/pub/richard-rodgers/4/147/b2a" title="Richard Rodgers  | LinkedIn">Richard Rogers</a> (Head of Software at MIT Libraries), <a href="http://ryan.scherle.org/" title="Ryan's Home">Ryan Scherle</a> (Technical Lead, Dryad Digital Repository), and <a href="http://www.blogger.com/profile/13668812856976810177" title="Matt Zumwalt | Blogger">Matt Zumwalt</a> (MediaShelf; Technical Lead, Hydra) on the topic of &#8220;DSpace with Fedora Inside.&#8221;  At last year&#8217;s Open Repositories conference there was a call for the DSpace and Fedora communities to explore this idea.  All of the content and metadata would be stored in Fedora with DSpace continuing to provide the user interface for workflow, discovery and administration.  Or, viewed another way, retain the out-of-the-box experience of DSpace while exposing the versioning, object relationship, and flexible architecture features provided by Fedora.  Work on this has been going on for a number of years, starting in 2007 with Scott Yeadon demonstrating object portability between Fez/Fedora and DSpace.  In 2008, 2009 and 2010 there were three Google Summer of Code projects by Andrius Blažinskas that laid the groundwork for some of this integration by abstracting the DSpace storage layer options.</p><p>Here are some of the questions and responses from the panel.  I hope I&#8217;m representing everyone&#8217;s views as intended; comments and clarifications are welcome.</p><p>Will adding DSpace on Fedora make DSpace even more complex? <i>Matt:</i> It is an opportunity to revisit the design assumptions &#8220;and clean up your work.&#8221; This is an example of where the transition will create the opportunity to tidy up the complexity and make DSpace simpler while gaining the flexibility of Fedora. <i>Mark:</i> The complexity of DSpace isn&#8217;t in its content model, it is in trying to use the existing content model to do things DSpace wasn&#8217;t designed to do. For instance, Islandora has more complex atomistic content models, particularly with science data, than DSpace&#8217;s content model. <i>Bradley:</i> Acknowledged that there is a risk here, but &#8220;if it becomes more complex we are doing it wrong.&#8221;</p><p>There is concern from the DSpace because it may have to change to accommodate Fedora, but Fedora will not need to be changed. <i>Bradley:</i> New ideas are sources of concern.  It is difficult to categorize DSpace developers as a whole.  Any time you move a major application on top of another component you may find the underlying APIs need to change.</p><p>At what level do we align DSpace and Fedora? Do we really need an intermediary format (AIP)? <i>Bradley:</i> If all we do is find a way to graft the existing DSpace workflows onto a Fedora that is very specific to DSpace and can&#8217;t be used with other Fedora tools, then we haven&#8217;t moved very far.  The end goal is to find those places where DSpace is not formally specified enough and get it formally specified. <i>Mark:</i> Islandora hooks Drupal into Fedora. 80%-90% of the time we work with the Drupal-Fedora API &#8212; a simple PHP wrapper around Fedora API.  It transforms calls into appropriate actions into Drupal. Another option comes from an Italy project that created a synchronization of Fedora objects and Drupal Nodes; it copies information from Fedora into the Drupal RDBMS.  Other applications like Omeka have a Fedora plugin.  DiscoveryGarden has also looked at things like WordPress with Fedora underneath.  As repository services become more intelligent about microservices it would take even less time to make these integrations.</p><p>What benefit does Fedora receive? <i>Richard:</i> For the Fedora community, DSpace alignment would provide a rich IR content model for Fedora. <i>Ryan:</i> DSpace was designed as an IR and nothing else; it has that at its core.  The problems that people have with DSpace are when people try to make it do something outside that vision.  Having a Fedora repository and have a DSpace interface for those IR use cases and something like Hydra or Islandora for people using those use cases. <i>Matt:</i> There has never been a large cohesion of IR workflow in Fedora; having this workflow satisfy the IR use case. <i>Bradley:</i> DSpace with Fedora inside is a repackaging with a slightly different set of existing components.  DSpace across its lifetime has tried to become more modular; integration with Fedora will make this clearer. <i>Mark:</i> Would agree that one of the main things DSpace brings to Fedora is the workflow tool and also the back-end data transformation workflows.  But he has also never been a fan of the DSpace workflow because the staunch requirement to fill out a lot of metadata is a mistake.  Working with science data, researchers want to ingest 100K microscope images without metadata then go back and add metadata with time. <i>Ryan:</i> (Agreeing) Some of the requirements of the native installation of DSpace is difficult to work with in other use cases.  Started with configuring the workflow as much as could be done with config file, but then created a new workflow process that still used many of the underlying tools.</p><p>The way I think of the motivation is that DSpace on Fedora will have the same easy setup with access to the underlying APIs for customization. <i>Bradley:</i> Yes &#8212; that is an aspirational goal. The practical realities mean that we will have to take steps there one at a time.  And given the time scale the question comes whether we will get there before we decide to do something else.  One of them &#8212; sort of unsaid &#8212; is to take a look at EPeople and see how that would migrate. <i>Ryan:</i> Unlocking the data is one thing, but unlocking the underlying datastreams as a API.  DSpace storage API is opaque.  You can rebuild everything from the underlying storage. [Ryan also calls out an old DLTJ post: A key advantage of DSpace with Fedora inside. "<a href="http://dltj.org/p38" class="broken_link" rel="nofollow">Why Fedora? Because You Don’t Need Fedora</a>".]</p><p><h2>Lightning Talks</h2><br />The other sessions that I went to today were of the quick 24&#215;7 type.  Here are some highlights.</p><p>Mark Phillips talked about a <strong>PREMIS Event Service</strong>.  He needed a way to log events that occur during the life of digital objects (virus checks, ingest, fixity check, replication) &#8212; when they occur and their outcomes. So he built a microservice based on AtomPub.  Each repository component sends outcomes of an event to this service, a central event collector.  Uses the PREMIS Event and Agent Modules.  Events metadata include: event_identifier, event_type, event_timestamp, event_outcome, outcome_detail, agent_identifier, object_identifier, and event_detail.  Agent metadata includes details about the software, human or organization triggering the event.  An AtomPub feed for each object returns all events for that object.  The system includes a basic search interface to see all events of a particular type and enables a feed to be set up on those searches.  There is the ability to harvest all events via OAI-PMH and Atom.  It is built with Django and Python and will be release on the MetaArchive Google Code page.</p><p>Peter Sefton talked about <strong>The Fascinator &amp; Fedora Commons: A Toolkit Tour</strong>. <a href="http://sites.google.com/site/fascinatorhome/" title="The Fascinator">Fascinator</a> is a java-based platform targeting repository solutions. It is open source (GPL), a plugin-based platform, and highly customizable.  It was first used as an aggregator of data from various sources into a discovery service.  They tried doing the same thing on the desktop computer (something a researchers could put on a personal computer, index data, group/describe it).  He thinks of the process as a conveyor belt: harvest digital objects, transform them, store them, index them and find them.  Harvest: draw into your digital ecosystem files, databases, online resources.  Transform: be ready to present and share.  Real web stuff &#8212; not PDFs.  Video and image previews. Multiple renditions for a multi-platform world.  Storage: Store originals and their friends; basic filesystem storage or use Fedora Commons.  Index: Apache SOLR index.  Find: Faceted search interface.  Web previews (turn Word into HTML and PDF for preview, same for video transcoding). Easily customizable UI (Jython and Velocity).  It is used by REDDBOX, Mint (described at a session yesterday), and a library of university policies (policies sent from Microsoft Sharepoint and transformed into HTML and PDF).</p><p>Rich Rogers talked about <strong>Publishing Large, Data-Rich Collections on the Web with Exhibit3</strong>. We collect and we like to share what we collect. Nowadays we live on the web, so how can we share out collections there? <a href="http://www.simile-widgets.org/exhibit/" title="Exhibit | SIMILE Widgets">Exhibit</a> is all I need to publish my collection to the web: no backend database, no server application; it will even convert a spreadsheet into usable data on the web.  Originally created by the SIMILE Project at MIT, Exhibit is an entire data publication platform with &#8216;list&#8217; and &#8216;tabular. views.  There is also a rich library of additional views. If temporal data, scrollable interactive timeline.  If geospatial data, interactive mapping displays.  Numerical data, scatter or time plots.  It is installable by HTML configuration and uses a browser-resident RDF database in JSON.</p><p>Geri Ingram and Carol Godby talked about <strong>Sustaining Collaboration Among Open Access Repositories</strong> focused on the <a href="http://www.oclc.org/gateway/" title="WorldCat Digital Collection Gateway  | OCLC">WorldCat Digital Collection Gateway</a>.  WorldCat broadens exposure to digital object collections; end users click through to the hosting repository server from WorldCat.org.  Metadata from OAI-PMH-compliant repositories are regularly harvested to WorldCat through the Gateway; digital objects remain on local repository server.  The gateway includes a translation service that allows repository managers to create mappings from Dublin Core (or selected other metadata schemas) to MARC.</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/or11-report-3/feed/</wfw:commentRss> <slash:comments>7</slash:comments> </item> <item><title>Open Repositories 2011 Report: Day 1 with Apache, Technology Trends, and Bolded Labels</title><link>http://dltj.org/article/or11-report-2/</link> <comments>http://dltj.org/article/or11-report-2/#comments</comments> <pubDate>Thu, 09 Jun 2011 04:16:30 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Meeting]]></category> <category><![CDATA[Apache Foundation]]></category> <category><![CDATA[emerging technology]]></category> <category><![CDATA[Open Repositories 2011]]></category> <category><![CDATA[web design]]></category><guid isPermaLink="false">http://dltj.org/?p=2959</guid> <description><![CDATA[Today was the first main conference day of the Open Repositories conference in Austin, Texas. There are 300 developers here from 20 countries and 30 states. I have lots of notes from the sessions, and I&#8217;ve tried to make sense &#8230; <a href="http://dltj.org/article/or11-report-2/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=2959"></abbr><p>Today was the first main conference day of the <a href="https://conferences.tdl.org/or/index.php/OR2011/OR2011main">Open Repositories conference</a> in Austin, Texas.  There are 300 developers here from 20 countries and 30 states.  I have lots of notes from the sessions, and I&#8217;ve tried to make sense of some of them below before I lose track of the entire context.</p><p>The meeting opened with the a keynote by <a href="http://www.jimjag.com/" title="Excuse me, do I know you?">Jim Jagielski</a>, president of the <a href="http://www.apache.org/" title="Welcome to The Apache Software Foundation!">Apache Software Foundation</a>.  He gave a <a href="http://people.apache.org/~jim/presos/OR2011/Open_Source_NotJust.pdf" title="Presentation Slides from Open Source: It's Not Just for IT Anymore">presentation</a> on what it means to be open source project with a focus on how Apache creates a community of developers and users around its projects.</p><p><div id="attachment_2961" class="wp-caption alignright" style="width: 310px;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; float: right;"><img src="http://cdn.dltj.org/wp-content/uploads/2011/06/Open_Source_NotJust-slide50-300x224.jpg" alt="Slide 50 of Open Source: It&#039;s Not Just for IT Anymore" title="Slide 50 of Open Source: It&#039;s Not Just for IT Anymore" width="300" height="224" class="size-medium wp-image-2961" /> <img style=' float: right; padding: 4px; margin: 0 0 2px 7px;'  src="http://cdn.dltj.org/wp-content/uploads/2011/06/Open_Source_NotJust-slide51-300x224.jpg" alt="Slide 50 of Open Source: It&#039;s Not Just for IT Anymore" title="Slide 51 of Open Source: It&#039;s Not Just for IT Anymore" width="300" height="224" class="alignright size-medium wp-image-2962" /><p style=' padding: 0 4px 5px; margin: 0;'  class="wp-caption-text">Slides 50 and 51 of &quot;Open Source: It&#039;s Not Just for IT Anymore&quot;</p></div>One of the take-a-ways was a characterization of Open Source Licenses from <a href="http://rollerweblogger.org/roller/entry/gimme_credit_gimme_fixes_gimme" title="Blogging Roller: Gimme credit, gimme fixes, gimme it ALL!">Dave Johnson</a>.  Although it is a basic shorthand, it is useful in understanding the broad classes of licenses:</p><ul type="disc"><li>Give Me Credit (&#8220;You can use, modify and redistribute my code in your product but give me credit&#8221;): <a href="http://www.opensource.org/licenses/apachepl.php" title="Apache Public Licenses | Open Source Initiative">Apache License</a>, <a href="http://www.opensource.org/licenses/bsd-license.php" title="Open Source Initiative OSI - The BSD License:Licensing | Open Source Initiative">Berkeley License</a>, and <a href="http://www.opensource.org/licenses/mit-license.php" title="Open Source Initiative OSI - The MIT License (MIT):Licensing | Open Source Initiative">MIT License</a></li><li>Give Me Fixes (&#8220;You can use, modify and redistribute my code in your product but give me the source for any fixes you make to it.&#8221;): <a href="http://www.opensource.org/licenses/mozilla1.1.php" title="Open Source Initiative OSI - Mozilla Public License 1.1 (MPL-1.1) :Licensing | Open Source Initiative">Mozilla Public License</a>, <a href="http://www.opensource.org/licenses/EPL-1.0" title="Open Source Initiative OSI - Eclipse Public License 1.0 (EPL-1.0):Licensing | Open Source Initiative">Eclipse Public License</a>, and <a href="http://www.opensource.org/licenses/lgpl-license.php" title="LGPL Licenses | Open Source Initiative">GNU Library or &#8220;Lesser&#8221; General Public License</a></li><li>Give Me Everything (&#8220;You can use, modify and redistribute my code in your product but give me your entire product&#8217;s source code.&#8221;): <a href="http://www.opensource.org/licenses/gpl-license.php" title="GNU General Public License Versions | Open Source Initiative">GPL</a></li></ul><p>He also explained why community and code are at peer levels in Apache.  One is not possible without the other; it takes an engaged community to create great code and great code can only be created by a healthy community.  He also described how the primary communications tool for projects is not new-fangled technologies like wikis and conference calls and IRC.  The official record of a project is its e-mail lists.  This enables the broadest possible inclusion of participants across many time zones and the list archives enable people to look into the history of decisions.  If discussions take place in other forums or tools, the summary is always brought back to the e-mai list.</p><p>Jim&#8217;s concluding thoughts were a great summary of the presentation, and I&#8217;ve inserted them in on the right.</p><p>I missed the first concurrent session of the day due to work conflict, so the first session I went to was the after lunch 24&#215;7 presentations.  That is no more than 24 slides in no more than seven minutes.  I like this format because it forces the presenters to be concise, and if the topic is not one that interests you it isn&#8217;t long until the next topic comes up.  The short presentations are also great for generating discussion points with the speakers during breaks and the reception.  Two of these in particular struck a cord with me.</p><p>The first was &#8220;Technology Trends Influencing Repository Design&#8221; by Brad McLean of DuraSpace.  His list of four trends were:</p><ol type="1" start="1"><li><strong>Design for mobile, not just PCs.</strong> The model of a mobile app &#8212; local computation and page rendering backed by web services for retrieving data &#8212; is having several impacts on design:  a reinforcement of the need for lightweight web services and UIs; accounting for how screen size has shrunk again; and having a  strategy for multi-platform apps will become critical.</li><li><strong>More programming language(s) than you need/want.</strong> Java, Python, Ruby, Scala, LISP, Groovy, JavaScript and the list goes on.  This proliferation of languages has forced looser coupling between components (e.g. a JavaScript based script can consume data from and write data to a Java-based servlet engine).  The implications he listed for this are that it is even clearer that true integration challenges are in the data modeling and policy domains; harder to draw neat boxes around required skill sets; and that you might lose control of your user experience (and it might be a good thing).</li><li><strong>Servers and clusters.</strong> Clusters are not just for high-performance computing and search engines anymore.  Techniques like map/reduce are available to all.  He said that Ebay was the last major internet company to deploy its infrastructure on &#8220;big iron&#8221; but he didn&#8217;t attribute that statement to a source.  (Seems kind of hard to believe&#8230;)  The implications are that we should look to replicated and distributed SOLR indexing (hopefully stealing a page from &#8220;noSQL&#8221; handbook); keep an eye on Map/Reduce-based triple stores (interesting idea!); and repository storage will be spanning multple systems.</li><li><strong>What is a filesystem.</strong> Brad noted that with filesystems what was once hidden from the end user (think the large systems of the 1960s, 1970s and 1980s) became visible (the familiar desktop file folder structure) and is now becoming hidden again (as with mobile device apps).  Applications are now storing opaque objects again; how do we effectively ingest them into our repositories?</li></ol><p><div id="tweet_78539910046425089" class="wp-caption alignright" style="width: 310px;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; float: right;"><br /><style type='text/css'>.bbpBox78539910046425089{background:url(http://a1.twimg.com/images/themes/theme8/bg.gif) #8B542B;padding:20px;margin:0px}p.bbpTweet{background:#fff;padding:10px
12px 10px 12px;margin:0;min-height:48px;color:#000;font-size:18px !important;line-height:22px;-moz-border-radius:5px;-webkit-border-radius:5px}p.bbpTweet
span.metadata{display:block;width:100%;clear:both;margin-top:8px;padding-top:12px;height:40px;border-top:1px solid #fff;border-top:1px solid #e6e6e6}p.bbpTweet span.metadata
span.author{line-height:19px}p.bbpTweet span.metadata span.author
img{float:left;margin:0
7px 0 0px;width:38px;height:38px}p.bbpTweet a:hover{text-decoration:underline}p.bbpTweet
span.timestamp{font-size:12px;display:block}</style><div class='bbpBox78539910046425089'><p class='bbpTweet'>Takeaways from Simeon: think about what to present sans label; find cues you can use instead of labels; use labels for 2ndary info. <a href="http://twitter.com/search?q=%23or11" title="#or11" class="tweet-url hashtag" rel="nofollow">#or11</a><span class='timestamp'><a title='Wed Jun 08 19:12:28 +0000 2011' href='https://twitter.com/reporat/statuses/78539910046425089'>less than a minute ago</a> via <a href="http://www.tweetdeck.com" rel="nofollow" title="TweetDeck - Your social world">TweetDeck</a> <a href='http://twitter.com/intent/favorite?tweet_id=78539910046425089' title="http://twitter.com/intent/favorite?tweet_id=78539910046425089"><img src="http://cdn.dltj.org/wp-content/uploads/2011/06/favorite.png" /> Favorite</a> <a href='http://twitter.com/intent/retweet?tweet_id=78539910046425089' title="http://twitter.com/intent/retweet?tweet_id=78539910046425089"><img src="http://cdn.dltj.org/wp-content/uploads/2011/06/retweet.png" /> Retweet</a> <a href='http://twitter.com/intent/tweet?in_reply_to=78539910046425089' title="http://twitter.com/intent/tweet?in_reply_to=78539910046425089"><img src="http://cdn.dltj.org/wp-content/uploads/2011/06/reply.png" /> Reply</a></span><span class='metadata'><span class='author'><a href="http://twitter.com/RepoRat" title="http://twitter.com/RepoRat" class="broken_link" rel="nofollow"><img src="http://cdn.dltj.org/wp-content/uploads/2011/06/rat_normal.jpg" /></a><strong><a href="http://twitter.com/RepoRat" title="http://twitter.com/RepoRat" class="broken_link" rel="nofollow">Rattus repositor</a></strong><br />RepoRat</span></span></p></div><p><p style=' padding: 0 4px 5px; margin: 0;'  class="wp-caption-text">Tweet from Dorothea Salo</p></div><br />The second 24&#215;7 talk that struck a chord was &#8220;Don’t Bold the Field Name&#8221; by Simeon Warner.  And by that he literally meant &#8220;Don&#8217;t Bold the Field Name&#8221;.  He walked through a series of library interfaces and noted how we have a tendancy to display bolded field labels.  He then pointed out how this draws the eye&#8217;s attention to the labels and not the record content beside the labels.  Amazon doesn&#8217;t do this (at least with the metadata at the top of the page), Ebay doesn&#8217;t do this, and the search engines don&#8217;t do this.  He did note &#8212; pointing to the case of the &#8220;Product Details&#8221; section of an Amazon item page &#8212; that &#8220;the task of finding the piece of information is more important than consuming it.&#8221;  (Again, in the Amazon case, the purpose of bolding the label is to draw the eye to the location of data like publisher and shipping weight on the page.)  I think <a href="https://twitter.com/reporat/statuses/78539910046425089">Dorothea Salo&#8217;s tweet</a> summed it up best:  &#8220;Takeaways from Simeon: think about what to present sans label; find cues you can use instead of labels; use labels for 2ndary info. #or11&#8243;</p><p>I also attended the two sessions on identifiers in the afternoon (Peter Sefton&#8217;s &#8220;A Hint of Mint: Linked Authority Control Service&#8221; and Richard Rodgers&#8217;s &#8220;ORCID: Open Research and Contributor ID &#8212; An Open Registry of Scholarly IDs&#8221;), but the time is late and tomorrow&#8217;s events will come soon enough.  Given eough time and energy, I&#8217;ll try to summarize those sessions later.</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/or11-report-2/feed/</wfw:commentRss> <slash:comments>4</slash:comments> </item> <item><title>Open Repositories 2011 Report: DSpace on Spring and DuraSpace</title><link>http://dltj.org/article/or11-report-1/</link> <comments>http://dltj.org/article/or11-report-1/#comments</comments> <pubDate>Wed, 08 Jun 2011 13:50:36 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Meeting]]></category> <category><![CDATA[DSpace]]></category> <category><![CDATA[Duracloud]]></category> <category><![CDATA[Fedora]]></category> <category><![CDATA[Open Repositories 2011]]></category> <category><![CDATA[spring framework]]></category><guid isPermaLink="false">http://dltj.org/?p=2943</guid> <description><![CDATA[This week I am attending the Open Repositories conference in Austin, Texas, and yesterday was the second preconference day (and the first day I was in Austin). Coming in as I did I only had time to attend two preconference &#8230; <a href="http://dltj.org/article/or11-report-1/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=2943"></abbr><p>This week I am attending the <a href="https://conferences.tdl.org/or/index.php/OR2011/OR2011main">Open Repositories conference</a> in Austin, Texas, and yesterday was the second preconference day (and the first day I was in Austin).  Coming in as I did I only had time to attend two preconference sessions: one on the integration &#8212; or maybe &#8220;invasion&#8221; of the <a href="http://www.springsource.org/" title="SpringSource.org |">Spring Framework</a> &#8212; into <a href="http://www.dspace.org/" title="http://www.dspace.org/">DSpace</a> and one on the introduction of the <a href="http://www.duraspace.org/duracloud.php" title="NOW AVAILABLE: DuraCloud Open Source 0.7 | Duraspace">DuraCloud</a> service and code.<br /><span id="more-2943"></span><br /><h2>DSpace and the Spring Framework</h2><br />The Spring-Framework-in-DSpace was presented by <a href="https://profiles.google.com/mdiggory/about">Mark Diggory of @MIRE</a>.  He spoke from a DuraSpace wiki page set up as a <a href="https://wiki.duraspace.org/display/DSPACE/DSpace+Spring+Services+Tutorial">tutorial</a> on the topic.  In the first part of his presentation he introduced the inversion-of-control pattern, explaining why it is useful and showing how it works with simple code examples.  He then showed how a Spring-based ServiceManager can be integrated into the DSpace main code and then how new services can be plugged into that manager.</p><p>I came into the session more familiar with the Spring Framework than with the DSpace code, so I found the session to be a good introduction to some of the DSpace concepts even though I wasn&#8217;t the target audience.  (I imagine the target audience was someone familiar with the DSpace code wanting to learn about the Spring Framework.)  Thanks, Mark, for putting up the web tutorial and walking through it during the preconference session.</p><p><h2>DuraCloud Introduction</h2><br />The second preconference I went to was on the introduction of DuraCloud services from DuraSpace.  I can honestly say that I didn&#8217;t get what DuraCloud was supposed to be before, but seeing the about-to-be-released web interface I can say I think I finally get it.  DuraCloud is going to be both open source software and a service from DuraSpace that can back up a repository with storage, media access services, and compute/transformation services.</p><p>The session showed the web-based administration interface and the supporting tools for integrating a DSpace repository and a Fedora repository into DuraCloud.  Attendees were also given access to a command-line Java application that could be used to upload content into a DuraCloud instance, although sadly it wasn&#8217;t demonstrated during the preconference session.  (Perhaps I&#8217;ll try it out on the sly later with the DuraCloud credentials they gave us at the session&#8230;)  In addition to the functinality being built into DSpace and Fedora there will be REST-based code libraries for Java, PHP and Python &#8212; meaning that any developer could write code to make use of DuraCloud with any repository platform.  The whole DuraCloud application is going to be released into open source under the Apache 2.0 license as part of the efforts to create a community using the code and encourage others to write new services for DuraCloud.  This is something I&#8217;m going to keep watching; I&#8217;ve already signed up for a preview account for when the beta is released later this month.</p><p>Thanks also to DuraSpace for sponsoring the evening reception after the preconfernce session at the University of Texas Club.</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/or11-report-1/feed/</wfw:commentRss> <slash:comments>2</slash:comments> </item> <item><title>Code4Lib Virtual Lightning Talks &#8212; First round, April 4th 2011</title><link>http://dltj.org/article/code4lib-virtual-lightning-talks/</link> <comments>http://dltj.org/article/code4lib-virtual-lightning-talks/#comments</comments> <pubDate>Mon, 28 Mar 2011 14:14:58 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Meeting]]></category> <category><![CDATA[code4lib]]></category><guid isPermaLink="false">http://dltj.org/?p=2766</guid> <description><![CDATA[One of the highlights of the Code4Lib annual meeting is the “lightning talk” rounds. A lightning talk is a fast-paced 5 minute talk on a topic of the presenter’s choosing. They are usually scheduled on an ad-hoc, first-come-first-served basis on &#8230; <a href="http://dltj.org/article/code4lib-virtual-lightning-talks/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=2766"></abbr><p>One of the highlights of the <a href="http://www.code4lib.org/" title="code4lib | coders for libraries, libraries for coders">Code4Lib</a> <a href="http://www.code4lib.org/conference" title="Code4Lib conference | code4lib">annual meeting</a> is the “lightning talk” rounds. A lightning talk is a fast-paced 5 minute talk on a topic of the presenter’s choosing. They are usually scheduled on an ad-hoc, first-come-first-served basis on the day of the event. They are an opportunity to provide a platform for someone who is just getting started with public speaking, who wants to ask a question or invite people to help with a project, or for someone to boast about something he or she did or tell a short cautionary story. These things are all interesting and worth talking about, but there might not be enough to say about them to fill up a full session timeslot.</p><p>“Virtual Lightning Talks” replicates this conference activity online in a virtual meeting environment. Each one-hour block consists of 10 six-minute sessions (one minute for the presenter to take control of the virtual meeting environment and test audio followed by a five minute presentation). Presenters show their work by sharing their entire desktop; the presentation can consist of slides, web browser, command-line shell, or any other application that can be shown on the desktop.</p><p>The first round will be on <a href="http://timeanddate.com/worldclock/fixedtime.html?month=4&#038;day=4&#038;year=2011&#038;hour=13&#038;min=30&#038;sec=0&#038;p1=179" title="http://timeanddate.com/worldclock/fixedtime.html?month=4&#038;day=4&#038;year=2011&#038;hour=13&#038;min=30&#038;sec=0&#038;p1=179">April 4th at 1:30pm Eastern U.S. Daylight Time</a>.  The <a href="http://wiki.code4lib.org/index.php/Virtual_Lightning_Talks" title="Virtual Lightning Talks - Code4Lib">Virtual Lightning Talks page on the Code4Lib wiki</a> has more information and space to sign up to be a presenter or attendee.</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/code4lib-virtual-lightning-talks/feed/</wfw:commentRss> <slash:comments>4</slash:comments> </item> <item><title>OCLC Introduces &#8220;A Web Presence for Small Libraries&#8221;</title><link>http://dltj.org/article/a-web-presence-for-small-libraries/</link> <comments>http://dltj.org/article/a-web-presence-for-small-libraries/#comments</comments> <pubDate>Wed, 12 Jan 2011 01:35:41 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Meeting]]></category> <category><![CDATA[ALA Midwinter Conference 2011]]></category> <category><![CDATA[OCLC]]></category> <category><![CDATA[public library]]></category><guid isPermaLink="false">http://dltj.org/?p=2114</guid> <description><![CDATA[On Sunday evening, the OCLC Innovation Lab held a public demonstration of a project with the working title, &#8220;A Web Presence for Small Libraries.&#8221; It is a templated website that could serve as a library&#8217;s barest bones presence on the &#8230; <a href="http://dltj.org/article/a-web-presence-for-small-libraries/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=2114"></abbr><p><div id="attachment_2118" class="wp-caption alignright" style="width: 327px;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; float: right;"><a href="http://experimental.worldcat.org/lib/n/us.tn.loremville-public-library" title="Loremville Public Library"><img src="http://cdn.dltj.org/wp-content/uploads/2011/01/Loremville-Public-Library_1294793741440.png" alt="Screenshot of Sample Library Website" title="Screenshot of Sample Library Website" width="317" height="297" class="size-full wp-image-2118" /></a><p style=' padding: 0 4px 5px; margin: 0;'  class="wp-caption-text">Screenshot of Sample 'Web Presence for Small Libraries'</p></div> On Sunday evening, the <a href="http://experimental.worldcat.org/" title="OCLC Innovation Lab homepage">OCLC Innovation Lab</a> held a <a href="http://community.oclc.org/cooperative/2010/12/a-web-presence-for-every-library.html" title="A Web presence for every library - The OCLC Cooperative Blog">public demonstration</a> of a project with the working title, &#8220;<a href="http://experimental.worldcat.org/lib/" title="">A Web Presence for Small Libraries</a>.&#8221;  It is a templated website that could serve as a library&#8217;s barest bones presence on the web.  The target audience is small and/or rural libraries that may not have the technological infrastructure &#8212; human knowledge, equipment, and/or money &#8212; to host their own web presence.  If it comes to fruition, the basic service would give a library four pages on the web that can be customized by the library staff plus dynamic areas of content that would be generated by OCLC algorithms and optionally placed on each library&#8217;s site.  A more advanced version of the service could include a light-weight book inventory and circulation option.</p><p>They created a sample library called <a href="http://experimental.worldcat.org/lib/n/us.tn.loremville-public-library" rel="noindex, nofollow"  title="Loremville Public Library">Loremville, TN public library</a> to demonstrate key aspects of the service.  I did not ask them how long that particular example will be around, so you may follow that link at a later date and not find it.</p><p>This &#8220;Library Website in a Box&#8221; is a concept that has been around for many years, and the latest trigger to try something was a resolution from the last OCLC Members Council to make scaled-down versions of OCLC services for &#8220;small and rural libraries.&#8221;  The Innovation Lab group conducted some research about the existing state of public library web presences by sorting the <a href="http://www.imls.gov/news/2010/063010.shtm" title="http://www.imls.gov/news/2010/063010.shtm" class="broken_link" rel="nofollow">IMLS-reported data from 2008</a> by number of volumes held and number of library staff.  They looked at the lowest quartile (roughly 20,000 volumes or less and staff size in the low single digits) and found that there was generally no web presence for these libraries.  In the second quartile there were instances of library websites, but they did the library no credit (outdated, poorly constructed, incomplete information &#8212; as was said at the meeting, these libraries had a presence on the web but probably shouldn&#8217;t).  Some had automation systems supplied by large groups, but others didn&#8217;t have evidence of an automation system.  So the project charter was to find an easy and inexpensive way for a library in these quartiles to create a desktop and mobile device web presence.</p><p>One of the unique aspects of the project development was to first set the approximate lower ($5/month) and upper ($40/month) price boundaries and find a way to provide the highest level of service possible at those price points.  The Innovation Lab team tried techniques such as a farm of WordPress sites but found they couldn&#8217;t make the revenue-versus-cost equation to work.  In the end they constructed a custom database-driven content system in PHP.  Institutional data is initially pulled from sources such as the <a href="http://www.oclc.org/registry/default.htm" title="WorldCat Registry [OCLC - Web Services]">WorldCat Registry</a>, and there will be some process for a library to &#8220;claim&#8221; its site.  There might also be a way for a library to create a site for itself if no registry data yet exists.</p><p>There are four levels of site authorization:  public (unauthenticated) viewing, a registered patron, a staff member, and an administrator.  Content from the pages are edited by the administrator at the subscribing library with a WYSIWYG<sup><a href="http://dltj.org/article/a-web-presence-for-small-libraries/#footnote_0_2114" id="identifier_0_2114" class="footnote-link footnote-identifier-link" title="What-You-See-Is-What-You-Get, meaning as you edit the document with styles like bolding, underlining, and bulleted lists, what you see in the editing interface is exactly how it will appear in the viewing interface.">1</a></sup> editor.  There are content boxes for the library&#8217;s location, staff/volunteers listing, events calendar and news, hours and phone number, policies, and service information.  The library&#8217;s address is fed into a Google Maps service to display a map of the area surrounding the library.</p><p>The dynamic parts of each library&#8217;s website could have a list of books from various sources like the New York Times and Oprah&#8217;s book group.  The service also envisions offering a &#8220;default digital collection&#8221; using public domain works in text, PDF, and mobile reading device formats from sources such as Project Gutenberg and the Internet Archive.</p><p>The inventory and circulation module is simple and straight-forward.  Each item has only eight full-text fields with the intention that the description will likely be done by a volunteer without professional library training.  Cataloging can be done by typing in the information or scanning the ISBN with an app on a mobile device; item information is pulled from WorldCat if found.  The cataloging application does not attach holdings to WorldCat, but the OCLC number is kept and might be used to facilitate offloading MARC records in cases where a library outgrows this simple circulation module to a more functional integrated library system.  The circulation functions are check-in, check-out, renew, place hold and cancel hold.  There are no financial functions in the system.</p><p>At these price levels, the system needs to be highly automated and self-supporting.  The cost to OCLC of one call to a customer support phone number could easily run through all the revenue OCLC would receive from the subscribing library in a year.  A widely adopted implementation at the targeted price points means that OCLC could dedicate one or two technical staff to support and upgrade the system in addition to the hardware and support service amortization.</p><p>One unresolved issue is domain names &#8212; what is the URL that will be used for each library&#8217;s site.  OCLC is investigating options such as partnering with a domain registrar company (someone like GoDaddy), becoming a domain registrar themselves, or putting all the sites under one domain.  The economics of each of these options will be a factor. The sometimes cumbersome aspects of migrating domain names from one service to another may make that activity cost-prohibitive as well.</p><p>Mike Teets noted that this was at a &#8220;project&#8221; stage, not a &#8220;product&#8221; stage.  My paraphrasing of what this distinction means is that the technology to create a product is largely done, but the decisions on the final formative pieces of the technology and the surrounding support infrastructure &#8212; is not yet done (and might never be done).  There isn&#8217;t even a formal name for it yet; it is being called &#8220;A Web Presence for Small Libraries.&#8221;  One desired additional feature is to add e-mail boxes for library staff/functions to the site.  Those in the room, including me but more importantly others who are more closely aligned with the target &#8220;small and rural&#8221; public library population, were pretty excited about it and wanting to talk further about if what we saw could be made a reality.</p><p><div id="attachment_2118_video" class="wp-caption alignright" style="width: 327px;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; float: right;"><iframe title="YouTube video player" class="youtube-player" type="text/html" width="316" height="208" src="http://www.youtube.com/embed/-qxiW7gkQTU" frameborder="0"></iframe><p style=' padding: 0 4px 5px; margin: 0;'  class="wp-caption-text">One minute video of Mike Teets and Willie Neumann introducing the concepts behind the project</p></div><h2>Personal Impressions</h2></p><p>Like others in the room, I came away impressed by the project demonstration.  It is definitely fits the bill as a basic library website and even a starter inventory and circulation management system.  I see that libraries could start with something like this for the cost of a couple of books added to the collection (roughly estimated at $60/year).  At this price level, OCLC thinks they could sustain the costs of operations plus have some left over for investment in incremental improvements.  I also think that because a library would pay a minimal fee for it, they would feel a tangible sense of ownership over the site and would keep them up-to-date.  From this a library could &#8220;graduate&#8221; to another service &#8212; their own Drupal or WordPress site, to a shared ILS or Webscale Management Services.</p><p>The content areas seemed the most appropriate for the target audience.  The web page design is modern, and I could see options for future enhancements as time and revenue permit such as providing limited options to personalize the template (change colors, adding picture &#8212; or adding links to pictures that might be stored on services such as Flickr).</p><p>One attendee at the session suggested that rather than OCLC prepopulating a digital content library of <em>all</em> public domain content that the set be <span class="removed_link" title="http://www.clicweb.org/e_discover/e_discovermarc.html">limited to those that are the most downloaded</span> so as not to overwhelm the user with a lot of unused (and/or unusable) digital content.  That sounds like a good suggestion to me.</p><p>OCLC Staff are looking for feedback on this project.  They say that the system is &#8220;production ready&#8221; with all the software controls and data recovery features of OCLC behind it.  What they think is missing is community support to have local engagement with the targeted libraries to show them what is possible.  That is an area where OCLC needs help.  (I can only imagine the shocked silence of a volunteer at a small library to get a call from OCLC &#8212; if they even knew what OCLC was &#8212; with an offer to create a website for the library.  &#8220;Only $5/month&#8230;sign up now and we&#8217;ll throw in a second one for free!&#8221;) There is an e-mail address &#8212; <a href="mailto:innovation@oclc.org">innovation@oclc.org</a> &#8212; that goes to all the OCLC Innovation Lab members, and a <a href="http://www.webjunction.org/923" title="WebJunction - Library Websites Group  " class="broken_link" rel="nofollow">WebJunction group</a> for public discussion.</p><p><h2>About the Innovation Lab</h2><br />The <a href="http://experimental.worldcat.org/" title="OCLC Innovation Lab homepage">OCLC Innovation Lab</a> is a new unit headed up by <a href="http://www.linkedin.com/in/michaelteets" title="Michael Teets - LinkedIn">Mike Teets</a>.  The group has four full-time people (Mike plus <a href="http://www.oclc.org/speakers/bios/house_tip.htm" title="Tip House [OCLC]">Tip House</a>, <a href="http://nl.linkedin.com/pub/rob-koopman/12/b9/303" title="rob koopman - LinkedIn">Rob Koopman</a> and <a href="http://www.linkedin.com/in/willieneumann" title="Willie Neumann - LinkedIn">Willie Neumann</a>) and leverage staff from other parts of OCLC to work on projects.  (Willie performed the demonstration at Midwinter.)  Their charge is to be a quick and nimble team to go after problems of a business unit of the cooperative or something from OCLC as a whole that wants done but hasn&#8217;t made progress.  They like to use this quickness as a positive attribute to intentionally limit the scope of projects.  Since its formation in April 2010 it has come up with roughly one new thing a quarter, including the WorldCat Mobile interface (build in 22 days, Mike noted) and the <a href="http://community.oclc.org/cooperative/2010/06/sometimes-the-internet-is-just-not-big-enough-for-me.html" title="Sometimes the Internet is just not big enough for me - The OCLC Cooperative Blog">Ask4Stuff Twitter service</a>.</p><p>For this project, the Innovation Lab sought out cooperation from OCLC staff to build prototypes and components of the service in their spare time.  Each Monday the self-selected group would get together to show what had been built and discuss ways to move the project forward in the following week.  In this way they rapidly iterated over ideas to come up with what was ultimately proposed.<p style="padding:0;margin:0;font-style:italic;">The text was modified to update a link from http://expreimental.worldcat.org/ to http://experimental.worldcat.org/ on January 12th, 2011.</p><p style="padding:0;margin:0;font-style:italic;">The text was modified to update a link from http://www.webjunction.org/923 to http://www.webjunction.org/923 on January 12th, 2011.</p><p style="padding:0;margin:0;font-style:italic;" class="removed_link">The text was modified to remove a link to http://www.clicweb.org/e_discover/e_discovermarc.html on June 9th, 2011.</p><h2>Footnotes</h2><ol class="footnotes"><li id="footnote_0_2114" class="footnote">What-You-See-Is-What-You-Get, meaning as you edit the document with styles like bolding, underlining, and bulleted lists, what you see in the editing interface is exactly how it will appear in the viewing interface.</li></ol>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/a-web-presence-for-small-libraries/feed/</wfw:commentRss> <slash:comments>27</slash:comments> </item> <item><title>Options in Storage for Digital Preservation</title><link>http://dltj.org/article/preservation-storage-options/</link> <comments>http://dltj.org/article/preservation-storage-options/#comments</comments> <pubDate>Sun, 09 Jan 2011 22:45:10 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Meeting]]></category> <category><![CDATA[Amazon S3]]></category> <category><![CDATA[Association for Library Collections and Technical Services]]></category> <category><![CDATA[Chronopolis]]></category> <category><![CDATA[DAITS]]></category> <category><![CDATA[Duracloud]]></category> <category><![CDATA[Iron Mountain]]></category> <category><![CDATA[LOCKSS]]></category> <category><![CDATA[OCLC]]></category> <category><![CDATA[preservation]]></category> <category><![CDATA[storage]]></category> <category><![CDATA[trac]]></category><guid isPermaLink="false">http://dltj.org/?p=2101</guid> <description><![CDATA[A last-minute change to my plans for ALA Midwinter came on Tuesday when I was sought out to fill in for a speaker than canceled at the ALCTS Digital Preservation Interest Group meeting. Options for outsourcing storage and services for &#8230; <a href="http://dltj.org/article/preservation-storage-options/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=2101"></abbr><p>A last-minute change to <a href="http://dltj.org/article/alamw11-schedule/">my plans for ALA Midwinter</a> came on Tuesday when I was sought out to fill in for a speaker than canceled at the <a href="http://connect.ala.org/node/119686" title="Digital Preservation Interest Group (ALCTS) | ALA Connect">ALCTS Digital Preservation Interest Group meeting</a>.  Options for outsourcing storage and services for preserving digital content has been a recent interest, so I volunteered to combine two earlier <i><acronym title="Disruptive Library Technology Jester">DLTJ</acronym></i> blog posts with some new information and present it to the group for feedback.  The reaction was great, and here is the promised slide deck, links to further information, and some thoughts from the audience response.</p><p><h2>Slide Deck and References</h2><br /><div id="slideshare-options-in-storage" class="wp-caption alignright" style="width: 435px;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; float: right;"><div style="width:425px" id="__ss_6499127"><strong style="display:block;margin:12px 0 4px"><a href="http://www.slideshare.net/DataGazetteer/options-in-storage-for-digital-preservation" title="Options in Storage for Digital Preservation">Options in Storage for Digital Preservation</a></strong><object id="__sse6499127" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=201101digitalpreservationinterestgroupalcts-110109162838-phpapp02&#038;stripped_title=options-in-storage-for-digital-preservation&#038;userName=DataGazetteer" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed name="__sse6499127" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=201101digitalpreservationinterestgroupalcts-110109162838-phpapp02&#038;stripped_title=options-in-storage-for-digital-preservation&#038;userName=DataGazetteer" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object></div><p><p style=' padding: 0 4px 5px; margin: 0;'  class="wp-caption-text">Slides for &#039;Options in Storage for Digital Preservation&#039;</p></div><br />In the presentation there is a <a href="http://www.slideshare.net/DataGazetteer/options-in-storage-for-digital-preservation/14" title="Options in Storage for Digital Preservation">Table About Costs</a> that uses a scenario from an <a href="http://dltj.org/article/oclc-digital-archive-vs-amazon-s3/">earlier <i><acronym title="Disruptive Library Technology Jester">DLTJ</acronym></i> blog post</a>.  The text of the scenario is:<br /><blockquote>To examine the similarities and differences in costs, let’s use the OhioLINK Satellite Image collection as a prototypical example. It consists of about 2 terabytes (2TB) of high-quality images in TIFF format, with about 7.5GB of data going into the repository each month. In the interest of exploring everything that S3 can do, there is an assumption that approximately 4GB of data will be transferred out of the archive each month; OCLC’s Digital Archive does not have a end-user dissemination component.</p></blockquote><p>The point of showing this scenario is to show the widest range of costs &#8212; from a storage-only solution like Amazon S3 to a soup-to-nuts service like OCLC Digital Archive.  A word about the redacted costs.  Some of the numbers for OCLC&#8217;s Digital Archive response (from 2008) came from a confidential quote, so the numbers were removed from the public table.  For the numbers that are publicly listed, the values come from <a href="http://newsbreaks.infotoday.com/nbReader.asp?ArticleId=49018" title="OCLC Introduces High-Priced Digital Archive Service">Barbara Quint&#8217;s article</a>.</p><p>The articles and blog posts I referenced in the course of the presentation were:</p><p>Iglesias, Edward and Wittawat Meesangnil (2010). Using Amazon S3 in Digital Preservation in a mid sized academic library: A case study of CCSU ERIS digital archive system. <i>The Code4Lib Journal</i>, issue 12, retrieved 5-Jan-2011 from <a href="http://journal.code4lib.org/articles/4468" title="Using Amazon S3 in Digital Preservation in a mid sized academic library: A case study of CCSU ERIS digital archive system | The Code4Lib Journal">http://journal.code4lib.org/articles/4468</a></p><p>Murray, Peter (2008). Long-term Preservation Storage: OCLC Digital Archive versus Amazon S3. <i>Disruptive Library Technology Jester.</i> Retrieved 5-Jan-2011 from <a href="http://dltj.org/article/oclc-digital-archive-vs-amazon-s3/">http://dltj.org/article/oclc-digital-archive-vs-amazon-s3/</a></p><p>Murray, Peter (2009). Can We Outsource the Preservation of Digital Bits?. <i>Disruptive Library Technology Jester.</i> Retrieved 5-Jan-2011 from <a href="http://dltj.org/article/outsource-digital-bits/">http://dltj.org/article/outsource-digital-bits/</a></p><p>Quint, Barbara (2008). OCLC Introduces High-Priced Digital Archive Service. <i>Information Today.</i> Retrieved 5-Jan-2011 from <a href="http://newsbreaks.infotoday.com/nbReader.asp?ArticleId=49018" title="OCLC Introduces High-Priced Digital Archive Service | Information Today">http://newsbreaks.infotoday.com/nbReader.asp?ArticleId=49018</a></p><ul><li>Amazon S3. Retrieved 5-Jan-2011 from <a href="http://aws.amazon.com/s3/" title="Amazon Simple Storage Service (Amazon S3)">http://aws.amazon.com/s3/</a></li><li>Chronopolis. Retrieved 5-Jan-2011 from <a href="http://chronopolis.sdsc.edu/" title="Chronopolis -- Digital Preservation Program -- Long-Term Mass-Scale Federated Digital Preservation"> http://chronopolis.sdsc.edu/</a></li><li>DAITSS. Retrieved 5-Jan-2011 from <a href="http://daitss.fcla.edu/" title="DAITSS - Trac">http://daitss.fcla.edu/</a></li><li>DuraCloud. Retrieved 5-Jan-2011 from <a href="https://wiki.duraspace.org/display/duracloud/DuraCloud">https://wiki.duraspace.org/display/duracloud/DuraCloud</a></li><li>Iron Mountain. Retreived 5-Jan-2011 from <a href="http://www.ironmountain.com/news/2009/impr02232009.asp" title="Iron Mountain Digital Introduces the Industry" class="broken_link" rel="nofollow">http://www.ironmountain.com/news/2009/impr02232009.asp</a></li><li>OCLC Digital Archive. Retrieve 5-Jan-2011 from <a href="http://www.oclc.org/us/en/digitalarchive/" title="Digital Archive [OCLC - Digital Collection Services]">http://www.oclc.org/us/en/digitalarchive/</a></li><li>Private LOCKSS Networks. Retrieved 5-Jan-2011 from <a href="http://lockss.stanford.edu/lockss/Private_LOCKSS_Networks" title="Private LOCKSS Networks - LOCKSS">http://lockss.stanford.edu/lockss/Private_LOCKSS_Networks</a></li></ul><p><h2>Some Thoughts</h2><br />There was a great deal of discussion after the presentation about how good of a guarantee is good enough.  Amazon S3, offers two levels of availability:  &#8220;Designed to provide 99.999999999% durability and 99.99% availability of objects over a given year.&#8221;  The question was whether that slight risk of loss is &#8220;good enough&#8221; for our purposes.  Coming to grips with the digital storage, can we (as the librarian profession) get someone from Amazon to talk about what they do to assure that data is available?  Can the terms that they use be translated into terms that we use and understand?  Can we get a level of familiarity and comfort with their storage about what they do to trust them as a long-term data warehouse?  Can we pull out the appropriate questions of the <a href="http://www.dcc.ac.uk/resources/tools-and-applications/trustworthy-repositories" title="Trustworthy Repositories | Digital Curation Centre">Trusted Repositories Audit &amp; Certification: Criteria and Checklist</a> to see how Amazon S3 measures up?</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/preservation-storage-options/feed/</wfw:commentRss> <slash:comments>10</slash:comments> </item> <item><title>My ALA Midwinter 2011 Schedule</title><link>http://dltj.org/article/alamw11-schedule/</link> <comments>http://dltj.org/article/alamw11-schedule/#comments</comments> <pubDate>Tue, 04 Jan 2011 01:39:44 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Meeting]]></category> <category><![CDATA[ALA Midwinter Conference 2011]]></category> <category><![CDATA[American Library Association]]></category> <category><![CDATA[conference]]></category><guid isPermaLink="false">http://dltj.org/?p=2089</guid> <description><![CDATA[The end-of-year holidays are behind us and (in the northern parts of the northern hemisphere) the cold days of winter in front of us. What better time to bag it all and head to the warm(er) temperatures of San Diego, &#8230; <a href="http://dltj.org/article/alamw11-schedule/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=2089"></abbr><p>The end-of-year holidays are behind us and (in the northern parts of the northern hemisphere) the cold days of winter in front of us.  What better time to bag it all and head to the <a href="http://gocalifornia.about.com/od/casdmenu/ss/san_diego_by_month.htm" title="San Diego in January - Events - Temperatures - Weather - What to Do">warm(er) temperatures of San Diego, California</a> for the <a href="http://www.ala.org/midwinter/" title="ALA Midwinter homepage"><abbr title="American Library Association">ALA</abbr> Midwinter meeting</a>.  I mean &#8212; come&#8217;on &#8212; do you really want to dive into all of that work that piled up over the past week or so?  (You say that even more work will pile up if you attend the meeting?  Bah, humbug!)  If you are going, I wholeheartedly endorse the new <a href="http://connect.ala.org/conference/118914/sessions" title="All ALA Midwinter 2011 Sessions | ALA Connect">ALA Connect-based meeting planner</a>.  It is at times frustratingly slow, but chock full of ways to slice-and-dice meeting events that were not possible in the earlier version.  (I&#8217;m going to put in a suggested enhancement that the iCal file export includes URLs to the meeting listing online; that would be immensely helpful.)</p><p>Here is my schedule of events, with links into meeting descriptions on ALA Connect where appropriate.  Also don&#8217;t forget to check out the <a href="http://connect.ala.org/files/2/whmw2011final_pdf_42464.pdf" title="ALA Midwinter 2011 &#038;039;What&#038;039;'s Happening&#038;039; Document">What&#8217;s Happening</a> (229KB, PDF) document for the inside scoop from the ALA staff to the ALA Council members.  If you are interested in getting together with me, let me know and we&#8217;ll find a time.</p><p><h2>Friday, January 7th</h2><br />I&#8217;m flying Continental through Houston, arriving in San Diego at about quarter after four local time.  It has been years since I&#8217;ve flown Continental, and I&#8217;m kinda looking forward to it.  I have many fond memories of Continental Airlines from my days of a commuting suitor and engagement to my now-wife, so I&#8217;m hoping to experience their fine service one more time before the <a href="http://www.unitedcontinentalholdings.com/" title="United Continental Holdings, Inc.">merger with United Airlines</a> takes hold.  (Who knows what service will be like after that?)</p><p>I&#8217;m arriving late, but normally I would try to get to the <a href="http://connect.ala.org/node/120574" title="RMG Consultants - 21st Annual Presidents&amp;#039; Seminar | ALA Connect">RMG Integrated Library System vendor panel</a>.  If the <a href="http://rscel.evergreen-ils.org/node/1541" title="Koha and Evergreen Shine in Breeding ILS Survey Results | RSCEL: Resource and Sharing Cooperative of Evergreen Libraries">past year is any guide</a>, I think it is going to be another wild year for ILS vendors.  I do expect to pop into the <a href="http://lists.ala.org/wws/arc/lita-l/2010-12/msg00071.html" title="Midwinter Happy Hour | lita-l">LITA Happy Hour</a> at the Uber Lounge of the <span class="removed_link" title="http://www.sesandiego.com/">Se San Diego Hotel</span>.</p><p><h2>Saturday, January 8th</h2><br />I&#8217;m starting the day with a private meeting, then I expect to head to the Hilton San Diego Bayfront for a presentation by OCLC from 10:30am to noon: <a href="http://connect.ala.org/node/120319" title="OCLC The Power of Data, Technology and Community: The OCLC Platform Strategy | ALA Connect">The Power of Data, Technology and Community: The OCLC Platform Strategy</a>.  OCLC&#8217;s Webscale Management &mdash; like it or not &mdash; is going to rock the world of library automation.  I generally like what I see, and I want to hear the first-hand experiences of those that have tried it to know if that opinion is on target.  (OCLC asks that you <span class="removed_link" title="https://www3.oclc.org/app/ala_registration/">register</span> for its sessions, but I don&#8217;t think it is required.)</p><p>After time for lunch, some exhibits and a private meeting I&#8217;ll have a choice to make.  On the one hand is the regular <a href="http://connect.ala.org/node/120002" title="ACRL/SPARC Forum (ACRL) | ALA Connect">SPARC/ACRL forum</a> on the topic of <a href="http://www.arl.org/sparc/media/10-1122.shtml" title="Next SPARC-ACRL forum to examine changing state of open-access journal publishing (SPARC)">Open Access and the changing state of scholarly publishing</a>.  The lineup is great: <a href="http://no.linkedin.com/pub/caroline-sutton/1/94/90a" title="Caroline Sutton - LinkedIn">Caroline Sutton</a>, President of the Open Access Scholarly Publishers Association (OASPA);<br /><a href="http://www.springer.com/about+springer/company+information/management?SGWID=0-175704-19-799035-0" title="Wim van der Stelt | Springer Management">Wim van der Stelt</a>, Executive Vice President of Corporate Strategy for Springer; and <a href="http://www.plos.org/about/people/biology.php#cmaccallum" title="Public Library of Science: Staff">Catriona McCallum</a>, Senior Editor for PLoS Biology and Consulting Editor for PLoS ONE.  That sounds like a lively discussion.</p><p>On the other hand is the <i>ad hoc</i> <a href="http://connect.ala.org/node/120044" title="Google Books Task Force (ALA) | ALA Connect">Google Books Task Force meeting</a>.  The topic is certainly of interest, and news of any movement by the court on the settlement has been too quiet.  But I don&#8217;t see an agenda for the meeting and I don&#8217;t know what will be covered.  Fortunately, these two meetings are near each other in the convention center.  Maybe I&#8217;ll start here and move to the SPARC/ACRL Forum.</p><p><h2>Sunday, January 9th</h2><br />Sunday starts early with the <a href="http://connect.ala.org/node/120389" title="OCLC Update Breakfast | ALA Connect">OCLC Update Breakfast</a> from 7am to 8am in the Sapphire Ballroom of the Hilton San Diego Bayfront Hotel.  From there I&#8217;d like to head to the <a href="http://connect.ala.org/node/120454" title="Top Technology Trends (LITA) | ALA Connect">LITA Top Technology Trends</a> discussion from 8am to 10am, but the room will probably be packed.  I doubt I&#8217;ll get out of the OCLC Update Breakfast on time and the room is always jammed, so I might have to attend vicariously through the tweets and posts of others.  In any case I&#8217;ll head back to the exhibits on Sunday morning and then go to the <a href="http://connect.ala.org/node/120404" title="OCLC Developer Network Luncheon | ALA Connect">OCLC Developer Network Luncheon</a>.  Always good stuff to learn at this <a href="http://www.oclc.org/developer/events/ala-midwinter-2011" title="ALA Midwinter 2011 | OCLC Developer Network">lunch</a> about the <a href="http://www.oclc.org/developer/webservices" title="Web Services | OCLC Developer Network">OCLC <abbr title="Application Programming Interfaces">APIs</abbr></a> and <a href="http://www.oclc.org/developer/applications" title="Applications | OCLC Developer Network">what people are doing with them</a>.</p><p>Then I have another tough choice.  I could go to the <a href="http://connect.ala.org/node/119843" title="Emerging Technologies Interest Group (LITA) | ALA Connect">LITA Emerging Technologies Interest Group meeting</a>.  There will be a follow-up discussion to the panel from ALA Annual (which unfortunately I missed due to a family illness).  Or I could go to the <a href="http://connect.ala.org/node/120627" title="Cloud Computing and Virtualization Interest Group (LITA) | ALA Connect">LITA Cloud Computing / Virtualization Interest Group meeting</a>.  I can&#8217;t find an agenda for this meeting either, but the topic is of interest at work.</p><p>Rounding out the day will be the public launch of <a href="http://community.oclc.org/cooperative/2010/12/a-web-presence-for-every-library.html" title="A Web presence for every library - The OCLC Cooperative Blog">an experimental service &#8220;providing a low cost and easy-to-use Web site service for small and rural public libraries&#8221;</a> by the <a href="http://www.oclc.org/nextspace/015/labs.htm" title="labs [OCLC]">OCLC Innovation Lab</a>.</p><p><h2>Monday, January 10th</h2><br />Monday morning will be NISO time, with back-to-back meetings of the <a href="http://connect.ala.org/node/120530" title="NISO Topic Committee Meeting | ALA Connect">Topic Committees as a whole</a> and the <a href="http://www.niso.org/topics/d2d/" title="Discovery to Delivery - National Information Standards Organization">Discovery to Delivery topic committee</a>.  Monday afternoon is open at this point, but probably will be filled either with other private meetings or some time writing up notes and summaries from all the weekend events.</p><p><h2>Tuesday, January 11th</h2><br />Flying out early, early Tuesday morning back through Houston (with fingers crossed that the choice of going through Houston will mean no weather delays).<p style="padding:0;margin:0;font-style:italic;" class="removed_link">The text was modified to remove a link to https://www3.oclc.org/app/ala_registration/ on January 28th, 2011.</p><p style="padding:0;margin:0;font-style:italic;" class="removed_link">The text was modified to remove a link to http://www.sesandiego.com/ on June 9th, 2011.</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/alamw11-schedule/feed/</wfw:commentRss> <slash:comments>3</slash:comments> </item> <item><title>&#8220;What Is Your Library Doing about Emerging Technologies?&#8221;</title><link>http://dltj.org/article/ala2010-program/</link> <comments>http://dltj.org/article/ala2010-program/#comments</comments> <pubDate>Wed, 23 Jun 2010 01:38:06 +0000</pubDate> <dc:creator>Peter Murray</dc:creator> <category><![CDATA[Meeting]]></category> <category><![CDATA[ALA Annual Conference 2010]]></category> <category><![CDATA[conference]]></category> <category><![CDATA[emerging technology]]></category> <category><![CDATA[Library and Information Technology Association]]></category> <category><![CDATA[presentation]]></category><guid isPermaLink="false">http://dltj.org/?p=1638</guid> <description><![CDATA[At the American Library Association conference this weekend, I&#8217;ll be part of a panel presentation from the LITA Emerging Technologies Interest Group with the title &#8220;What Is Your Library Doing about Emerging Technologies?&#8221; The presentation will be on Saturday, June &#8230; <a href="http://dltj.org/article/ala2010-program/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<abbr class="unapi-id ignore noPrint" title="http://dltj.org/?p=1638"></abbr><p>At the American Library Association conference this weekend, I&#8217;ll be part of a panel presentation from the <acronym title="Library and Information Technology Association"><a href="http://www.lita.org/" title="Library and Information Technology Association homepage" rel="homepage">LITA</a></acronym> <a href="http://connect.ala.org/node/66489" title="Emerging Technologies Interest Group (LITA - Library &amp; Information Technology Association) | ALA Connect">Emerging Technologies Interest Group</a> with the title &#8220;<a href="http://connect.ala.org/node/104303" title="ALA 2010 Program: &quot;What Is Your Library Doing about Emerging Technologies?&quot; | ALA Connect">What Is Your Library Doing about Emerging Technologies?</a>&#8221;  The presentation will be on Saturday, June 26 from 1:30pm to 3:30pm in room 103B of the Washington Convention Center.  The publicity blurb:<br /><blockquote>A new job title of “Emerging Technology Librarian” seems to reflect an awareness among today’s libraries that there is a need for a librarians whose main role is to explore, evaluate, promote, and implement various emerging technologies.  19 librarians in different fields of librarianship at academic, school, and public libraries will discuss the topic of emerging technologies at libraries, their evaluation, implementation, adoption, and management challenges.</p></blockquote><p><br />My panel group met by conference call this afternoon to discuss the topic, and I came away feeling great about the synergy of this group.  The panel style is the panelists responding to a question from the moderator and reactions from each other with time for questions from the audience.  No canned presentations!</p><p>Thanks to <a href="http://connect.ala.org/user/68988" title="Bohyun Kim | ALA Connect">Bohyun Kim</a> from Florida International University for setting up the panel discussion.</p>]]></content:encoded> <wfw:commentRss>http://dltj.org/article/ala2010-program/feed/</wfw:commentRss> <slash:comments>4</slash:comments> </item> </channel> </rss>
<!-- Served from: dltj.org @ 2012-05-24 03:40:56 by W3 Total Cache -->
