Skip to content
Solely for the Purpose of Catching $PAMRZ

Open Source for Open Repositories — New Models for Software Development and Sustainability


This is a summary of a presentation by James L. Hilton, Vice President and CIO of University of Virginia, at the opening keynote session of Open Repositories 2007. I tried to capture the esessence of his presentation, and omissions, contradictions, and inaccuracies in this summary are likely mine and not that of the presenter.

Setting the stage

This is a moment in which institutions may be willing to invest in open source development in a systematic way (as opposed to what could currently be characterized as an ad hoc fashion) driven by these factors:

  • Fear. Prior to Oracle’s hostile take-over of PeopleSoft, the conventional wisdom of universities was that they needed to buy their core enterprise applications rather than build them. In doing so, they sought the comfort of buying the security of a leading platform. Oracle’s actions diminished that comfort level. Blackboard acquisition of WebCT and lawsuit against a competitor does not help either.
  • Disillusionment and ERP fatigue. What was largely thought to be an outsourced project was found to be an endless upgrade cycle. Organizations need to build entire support units to handle the upgrades for large ERP systems rather than supporting the needs of the users.
  • Incredulity — we’re supposed to do what? The application of technology typically has a disruptive impact (cannot predict the end), the stakes are incredibly high (higher education and/or research could be lost in a decade), it tends to be expensive, and the most common survival strategy is to seed many expensive experiments in the hopes that one will be in the right place at the time the transition needs to happen. The massive investment anticipated for technology to support academic computing (libraries, high-performance clusters, etc) will pale in comparison to the investment in administrative computing.
  • Rising tide of collaboration. This is a realization that the only way to succeed is through collaboration. To paraphrase Hilton, “In the new order it will be picking the right collaborative partners where the new competitive advantage will come from.”

Distinctions

Hilton offered these definitions and contrasts as a way to frame the rest of his discussion. First was Open or “free” software. Free as in beer, or free as in “adopt a puppy.” The software comes with the ability to do with as you want with the code, not just the ability to use the code. They he defined the term License as a contract — what ever you agree to you are bound to; you cannot use copyright law to protect you. The rules and conditions that are applied to the software do matter.

Lastly, he talked about Copyleft or “viral” licensing. There are different interpretations of “open” in open source. “Copyleft” has come to mean that code should be freely available to be used and modified, and it should never by locked up. GPL is an example. This is often called “viral” because if you include software with this license in any other work that is released, the additional software must be released under the same license. This is seen by some as valuable because it prevents open source from being encircled by proprietary code. Copyleft is contrasted with an “open/open” license — you can do whatever you want to do with a code under this license. An “open/open” license places no restrictions on what users do with code in derivative software packages.

Case Study — Michigan’s Sakai Sojourn

Hilton briefly described why UMich went down the Sakai path in 2001-2002:

  • Legacy system with no positive trajectory forward. It could never be released into open source; all of the development would have to be carried on UMich’s shoulders forever.
  • Saw market consolidation in CMS. This was mostly evident in the commercial sector with Blackboard and WebCT being the dominant choices. They had concerns about the cost of licenses in this environment down the road.
  • Saw the potential of tapping the institution’s core competencies and starting a virtuous cycle of development, teaching and research. Or, put another way, they didn’t want core competencies in teaching and research held hostage to a commercial development cycle.
  • Strategic desire to blur the distinction between the laboratory/classroom and between knowledge creation/digestion. They realized that the functions of a research support tool and a course support tool were pretty much the same under different skins, and they sought to blur that distinction even more.
  • NRC report and the need for collaboration. UMich was willing to fund the project two years internally but knew after that need to find collaborative partners by the fifth year in order to be declared a success.
  • A moment of time opportunity that synchronized the development process of several partners with funding provided by the Mellon Foundation.

There were also specific goals for the Sakai project. The new system had to replicate the functionality of existing course and research collaboration environments. They also wanted experience in finding partners willing to collaborate. Hilton said, “Sakai was/is at least as interest from a collaboration perspective as it is from the technology perspective.” Bringing together disparate organizations with different beliefs on how things should be done is a challenge. Additionally, they wanted to get better as an institution at discerning open source winners; it shouldn’t be like a lottery. Lastly, they wanted to implement software parts that were not built at UMich. Each partner institutions committed to implementing the same thing even if wasn’t built at that institution. This is tough to do, but they knew they needed to do it for their own good in the long run.

What happened? Not only did the original partners show up, but the community came, too. Even more interesting was that the community was formed with dues-paying members — even in a world where the software is free. It became a vibrant community, too, with a conference every six months. Sakai was released under an open-open license model, and corporate partners showed up as well (selling support services, or hosting services, or hardware for the software). The software did grow up and left its home; a separate foundation now holds the intellectual property of the code (originally partners assigned copyright to UMich). They also positioned Sakai to be a creditable threat to the commercial entities in order to force them to the standards table.

Takeaway lessons that generalize to open source development

First, the benefits of open source development.

  • destiny control (but only when you really need to drive). having the control is not always a good thing. Is it worth the effort? Is the project core to the institution’s mission? (Does it directly support scholarship and teaching?)
  • builds community and camaraderie (in the case of Sakai, both locally at UMich and internationally)
  • unbundles software ownership and its support. inspires more competition in the implementation and support space.
  • community source provides institutions an opportunity to leverage links between open source, open access and culture of the academy/wider world (a.k.a. put up or shut up)

Then, the challenges of open source development.

  • Guaranteeing clean code (IP) is hard (read as “impossible”). A certain amount of faith about the code they get and there needs to be consideration for mitigating risks.
  • Figuring out who is authorized to license institutionally-owned code is challenging and then you have to convince them to give it away. No one in the institution typically has been appointed or given the authority to release code. One of the things that the sakai licensing discussions highlighted was institutional differences in requirements and aesthetics.
  • Patent quagmire always looming. How do you know your software is not infringing? How do you make sure you don’t inadvertently give away all institution patents? Be careful when looking at licenses from an institutional perspective versus an individual perspective.
  • There is also the inevitable lawsuit risk. Or, as your counsel might say to you, “Let me get this straight, we can get sued but there’s no one we can sue.”

Then, some discoveries that they made along the way.

  • An open source project not a silver bullet. The commitment to build rather than buy must align with institutional priorities and competencies; it is not right for every project/application.
  • Licensing does matter; it is a contract: whatever you stick in its rules is what sticks. There are probably have too many open source license options and some sort of standardization is needed. Also keep in mind that if you release something under an open/open license, you can’t include any copyleft components.
  • Communities don’t just happen, they require: specific shared purpose (when visions vary, or when they change, collaborations struggle); and governance (e.g., separate board with dedicated developers sitting between institutions). Cooperation (”I won’t hurt you if you don’t hurt me”) is not collaboration.
  • Open (community) source requires real project discipline. “It is as spontaneous as a shuttle launch.” Along the way one needs to learn to balance pragmatics and ideals. One also needs to learn to trust your partners. “It really requires learning to let go.” Letting go, and having the community make the decisions, may be the quickest path to efficiency.

Reflection on open/community source for repositories

Repositories are at the center of everything at the institution. It connects with the library, with the presses/scholarly publishing operation, with classroom teaching, with the laboratory, and with the world. It is a core piece of of infrastructure for the university of the 21st century. As institutions, we need to make sustaining investments in our repositories.

Hilton sees three different approaches to “community” in the existing projects:

  • dspace: community of user/developers. The come together to talk about what they want to do, write code, and support each other. Clearly there are enthusiastic users as developers.
  • eprints: appears as like a vendor talking with customers wanting the community help shape the direction.
  • fedora: in transition from a combination of the previous two models moving towards a Sakia-like model. it will require institutions to make commitments to it.

In the end, Hilton asked some thought-provoking questions. Is now the time for institutional investment in open/community source? Will a coherent community (or communities) emerge in ways that are sustainable? — is there a shared vision?

5 Trackbacks

  1. Peter Suber, Open Access News | January 25, 2007 at 12:15 pm | Permalink

    Kramer auto Pingback[...] James Hilton on open source development and IRs Jester has blogged some notes on James Hilton’s presentation at Open Repositories 2007 (San Antonio, January 23-26, 2007).  Excerpt: [...]

  2. Anonymous | February 11, 2007 at 11:58 pm | Permalink

    Kramer auto Pingback的一次近期演讲演示一下大型开源应用系统的动力[translated to English]

  3. Kramer auto Pingback[...] I was, for various reasons, unable to attend the Open Repositories 2007 conference in San Antonio last week. Although the presentations themselves don’t appear to have made it online yet (I’ll post when they do), there has been plenty of blogging going on, especially over at Jim Downing’s blog:http://wwmm.ch.cam.ac.uk/blogs/downing/?p=65http://wwmm.ch.cam.ac.uk/blogs/downing/?p=67http://wwmm.ch.cam.ac.uk/blogs/downing/?p=68http://wwmm.ch.cam.ac.uk/blogs/downing/?p=69And to save the clicks, here’s some that he’s linked to:http://dltj.org/2007/01/open-source-for-open-repositories/http://blog.zsr.wfu.edu/pd/?p=38http://nx650.wordpress.com/2007/01/23/at-open-repositories-san-antonio/Plus some great summaries from Dorothea Salo, for which, as a non-attendee, I am very grateful:http://cavlec.yarinareth.net/archives/2007/01/28/simile/http://cavlec.yarinareth.net/archives/2007/01/28/manakin-and-geospatial-metadata/http://cavlec.yarinareth.net/archives/2007/01/28/themes-in-manakin/http://cavlec.yarinareth.net/archives/2007/01/27/custom-interfaces-with-content-packaging-plugins/http://cavlec.yarinareth.net/archives/2007/01/27/dspace-preservation-repository-design/http://cavlec.yarinareth.net/archives/2007/01/27/dspace-configurable-submission-system/http://cavlec.yarinareth.net/archives/2007/01/27/pf-dspace/http://cavlec.yarinareth.net/archives/2007/01/25/introducing-new-services-with-dspace/http://cavlec.yarinareth.net/archives/2007/01/25/dspace-at-ncsu/http://cavlec.yarinareth.net/archives/2007/01/23/dspace-for-managing-digitized-collections/http://cavlec.yarinareth.net/archives/2007/01/23/manakin-ui/http://cavlec.yarinareth.net/archives/2007/01/23/dspace-the-next-generation/Promise I’ll be there for Open Repositories 2008 in Southampton. It is only just up the road!EDIT: All the links were broken the first time I published this. All fixed now. [...]

  4. HitchHikr | February 26, 2007 at 5:21 pm | Permalink

    Kramer auto Pingback[...] RSS SubscribeOR2007, Retrospection 2: We need to talk about… Mon, 12 Feb 2007 05:45:21 GMTThere is already a collection of notes and links on the web about OR2007. Some blog posts summarizing OR2007 presentations can be found using the technorati tag “icor2007″. And links to many presentations, demos, and attendee photo collections can be found with the del.icio.us tag “or2007″. So I’m going to continue my own retrospective on the Posted in PRAXIS101< /strong> ( 39 links from 9 sites) by band … Read More Presentation Summary: ?Cross Repository S… Thu, 15 Feb 2007 08:43:56 GMTRichard Rodgers presented this talk based on the work of he and MacKenzie Smith in the Digital Library Research Group at MIT. The original abstract of the presentation was: Many questions are raised as previously unreachable digital content is found in and among new repositories?is each repository an island or a separately searchable resource? Posted in Disruptive Library Technology Jester ( 129 links from 40 sites) by PeterAtOhioLINK … Read More Presentation Summary: ?MPTStore: Implementing a… Mon, 12 Feb 2007 02:49:29 GMTChris Wilper gave this presentation on behalf of the work that he and Aaron Birkland did to improve the performance of the Fedora Resource Index. Presentation slides via SlideShare Version 2.0 of the Fedora digital object repository software added a feature called the Resource Index (RI). Based on Resource Description Framework (RDF) Posted in Disruptive Library Technology Jester ( 129 links from 40 sites) by PeterAtOhioLINK … Read More OR2007, Retrospection 1: “What is an open… Mon, 12 Feb 2007 05:45:21 GMTI was planning to liveblog the Open Repositories 2007 Conference: listen to presentations, make notes in Ecto, paste in URLs, add my own associations, proofread and post. But I discovered that it was not so easy, for me, to blog and listen and worry about my worn-out battery at the same time. So I listened and wrote notes on paper. I thought Posted in PRAXIS101< /strong> ( 39 links from 9 sites) by band … Read More Open Repositories 2007 Friday MorningTue, 20 Feb 2007 16:38:41 GMTOpen Repositories 2007 - Friday Morning Friday, January 26, 2007, 12:58 pm by mitcheet This morning sees the conclusion of icor2007 &amp; begins with presentations on application profiles, repository interoperability, and OSA models. The final Keynote session revisited the ideas introduced in the SPARC discussion touching on future directions in Posted in ZSR Professional Development ( 36 links from 3 sites) … Read More Open Repositories Day 2 Making Fedora Easier… Thu, 25 Jan 2007 20:41:55 GMTMaking Fedora Easier to Implement with Fez - Christiaan Kortekaas What is Fez? Manages all kinds of documents publications, images, learning objects, spreadsheets, databases and datasets, course materials Fez technologies PHP 5 MySQL 4.1+ Apache Java SDK ImageMagick Graphviz JHOVE Fedora Unique Benefits Dynamic GUI configurabale document Posted in Library Web Chic ( 85 links from 57 sites) … Read More Open Source for Open Repositories ? New Models f… Mon, 12 Feb 2007 02:49:30 GMTThis is a summary of a presentation by James L. Hilton, Vice President and CIO of University of Virginia, at the opening keynote session of Open Repositories 2007. I tried to capture the esessence of his presentation, and omissions, contradictions, and inaccuracies in this summary are likely mine and not that of the presenter. Setting the stage Posted in Disruptive Library Technology Jester ( 129 links from 40 sites) by PeterAtOhioLINK … Read More A Vision for FEDORA?s Future, an I… Mon, 12 Feb 2007 02:49:28 GMTThis morning, Sandy Payette of Cornell University and FEDORA project co-director, gave an update on the FEDORA project including a statement of a vision for FEDORA?s future, information about the emerging FEDORA Commons non-profit, and a status report/roadmap for the software itself. Below is a summary based on my notes of Sandy?s comments and Posted in Disruptive Library Technology Jester ( 129 links from 40 sites) by PeterAtOhioLINK … Read More Open Repositories Presentation: Building an IR… Mon, 12 Feb 2007 02:49:29 GMTBelow is the outline of the Ohio DRC presentation from today?s FEDORA session at Open Repositories conference. Comments welcome! Executive Overview of the Ohio Digital Resource Commons Facets of the Digital Resource Commons Vision DRC Vision (Multi-Institutional) DRC Vision (Cross-Institutional) DRC Vision (Access Control) DRC Vision Posted in Disruptive Library Technology Jester ( 129 links from 40 sites) by PeterAtOhioLINK … Read More Blogging About Open Repositories 2007? Use the… Wed, 24 Jan 2007 16:35:36 GMTAre you blogging the Open Repositories conference in San Antonio this week? Are you posting pictures to Flickr? if so, may I suggest using the ?icor2007? tag when posting your content. When doing so, the HitchHikr service will aggregate content from Technorati and Flickr based on that tag. (Why not simply ?or2007?? ? it looks like that tag Posted in Disruptive Library Technology Jester ( 129 links from 40 sites) by PeterAtOhioLINK … Read More Building an Institutional Repository Interface… Wed, 24 Jan 2007 16:35:35 GMTThis tour is designed to show the overall architecture of a FEDORA digital object repository application within the JBoss Seam framework while at the same time pointing out individual design decisions and extension points that are specific to the Ohio Digital Resource Commons application. Geared towards software developers, a familiarity with Posted in Disruptive Library Technology Jester ( 129 links from 40 sites) by PeterAtOhioLINK … Read More Looking Forward to Version 2.2 of FEDORAWed, 24 Jan 2007 16:35:37 GMTSandy Payette, Co-Director of the Fedora Project and Researcher in the Cornell Information Science department, announced a tentative date for the release 2.2 of the FEDORA digital object repository. The Fedora development team would like to announce that Fedora 2.2 will be released on Friday, January 19, 2007. This new release will contain Posted in Disruptive Library Technology Jester ( 129 links from 40 sites) by PeterAtOhioLINK … Read More a couple upcoming events repositories and… Sun, 09 Jul 2006 02:11:28 GMT*** 1 Open Repositories 2007 - 2nd International Conference on Open Repositories (ICOR2007) January 23-26, 2007 San Antonio, Texas CFP deadline: October 2, 2006 Extended abstract, less than 500 words, double spaced via Disruptive Library Technology Jester *** 2 code4lib 2007 February 28 - March 2, 2007 Athens, Georgia Posted in Science Library Pad ( 168 links from 71 sites) … Read More Heads up! International Conference on Open… Tue, 04 Jul 2006 01:29:57 GMTOpen Repositories 2007 is coming up next year, and it looks to be an interesting meeting. The first day is open user group meetings for DSpace, Fedora, and Eprints, followed by general conference sessions that cover issues that cut across all of the open repository systems. This year, the user groups will partition their programs into Plenary, Posted in Disruptive Library Technology Jester ( 129 links from 40 sites) by PeterAtOhioLINK … Read More [Conduct Search from Secondary Source] [...]

  5. Kramer auto Pingback[...] material into a chapter for a book.Not surprisingly, I’m a big fan of James Hilton. I recommend Peter Murray’s synopsis on his blog, as he;s already said everything that I could say.I hope that presentations are going [...]

Post a Comment

Your email is never published nor shared. Required fields are marked *
Human Detection Scheme
(What's this?)
Comment Preview

Subscribe without commenting

From the Disruptive Library Technology Jester (http://dltj.org/), printed on Wednesday the 12th of November 2008 at 5:47:10 PM EST (-0500). The URL to this page is http://dltj.org/article/open-source-for-open-repositories/

[Creative Commons Logo] This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/us/ or send a letter to Creative Commons, 543 Howard Street, 5th Floor, San Francisco, California, 94105, USA.