Solr-ized MARC Record Catalog

Posted on 2 minute read

× This article was imported from this blog's previous content management system (WordPress), and may have errors in formatting and functionality. If you find these errors are a significant barrier to understanding the article, please let me know.

Rob Casson of Miami University announced this weekend the beta availability of their video catalog. In a subsequent posting, Rob describes the user interface elements. Rob and the crew at Miami are seeking feedback on the interface, so if you have some be sure to offer it to them.

A couple of notes on the mechanisms Rob is using. Apache Solr is an open source enterprise search server based on the Lucene Java search library (also an Apache project). You can think of Lucene as the raw indexing and search engine with Solr layered on top to provide a non-Java interface to a rich feature set. What Miami has done is extract all of the bibliographic and related item records out of their Innovative Interface system, written programs to transform that data into XML, indexed it with Solr/Lucene and created a search interface.

Now what makes this really interesting is how much useful information is in the MARC record that doesn't currently find its way into the WebPAC interface. For instance, this snapshot shows the facets where the MARC record has fielded data that can be turned into browsable lists:

Browse Catalog by LC ClassExample from Miami University's video catalog showing the available fielded data.

The corresponding WebPAC pre-search limits (for keyword searching) only includes a subset of languages, media formats, locations and does not include topic, genre, LC/SuDoc classes, and coverage date. In other words, there is a whole lot of information in the MARC record that isn't being exposed in the normal WebPAC interface. Since Miami is in full control over the data in the Solr-based index, though, they are free to include as much or as little in the end-user interface.

Combined with faceted browsing, this makes for a very simple and quick interface to narrow down a large set of records. At the time of writing this entry, Miami's video library consisted of 10,538 records. In three clicks, one can narrow that down to the 12 French-language comedy films in DVD format:

Miami Video Catalog browse for French comedies in DVDExample from Miami University's video catalog facets after browsing for French-language Comedies on DVD.

From this screen, to see what is available in all media formats one need just click the red 'X' to the right of "DVD". Also note the "RSS Feed" symbol on the right side of this interface snapshot. The results of any search/browse are immediately available as an RSS feed -- a very convenient way to receive notifications of new titles that match this search!

Congratulations, Rob and everyone else at Miami that brought this interface into existence. It is a nice model and something we all can learn from through your experiences. Please keep us updated as the project continues.