building a large scale open source repository at ohiolink a cautionary tale in three acts 2007 lita...

26
Building a Large Scale Open Source Repository at OhioLINK A Cautionary Tale In Three Acts 2007 LITA National Forum Thomas Dowling [email protected]

Upload: neal-powers

Post on 16-Dec-2015

215 views

Category:

Documents


0 download

TRANSCRIPT

Building a Large Scale Open Source Repository at

OhioLINK

A Cautionary Tale In Three Acts

2007 LITA National ForumThomas Dowling

[email protected]

Act I“The Plan”

(ca. 2006)

OhioLINK Digital Media Center

dmc.ohiolink.eduArt and Architecture

1999: Art Collection (AMICO), Saskia

2000: Art and Architecture from Univ of Cincinnati

2001: Akron Art Museum Images

2002: WPA Prints By Cleveland Artists

Video Collections

2002: Foreign Language Instruction Videos

2002: Encyclopedia of Physics Demonstrations

2003: Educational Films And Documentaries (FFHS, Ambrose)

History and Archives

2001: Wright Brothers Collection

2001: E.W. Scripps Papers

2001: Mayan Archeology Photos

2001: Sanborn Historic Maps

2002: Greek and Latin Inscriptions (squeezes)

2003: Lake Erie’s Yesterdays

2003: Nat’l Underground RR Freedom Ctr Images

2004: Historic Maps of Akron

2005: Kent State Shootings Oral Histories

Science Collections

1999: Landsat Satellite Images of Ohio2003: Borror Laboratory of Bioacoustics Digital Animal Sounds(1 | 2)

2004: Geology Digital Photographs(1 | 2)

2004: Ohio Ag Experiment Station Forestry Images

2004: Reproductive Physiology Animations

2004: Dolphin Embryos

OhioLINK Digital Media Center

dmc.ohiolink.edu

Oh By The Way

Electronic Journal Center: 9 million locally loaded journal articles

Electronic Theses and Dissertations: 12,000 ETDs from 17 schools

Coupla thousand locally loaded e-books

~80 million records from A&I databases ~10 million bib records in union catalog

OhioLINK Services, June 2006

EJC

DMC

ETDC

IR

Catalog A/I DBs

OhioLINK Services, June 2006

EJC

DMC

ETDC

IR

Ingest, Store, Discover, Display

Ingest, Store, Discover, Display

Ingest, Store, Search, Display

Ingest, Store, Discover, Display

Catalog A/I DBs

OhioLINK Services, June 2006

EJC

DMC

ETDC

IR

Ingest, Store, Discover, Display

Ingest, Store, Discover, Display

Ingest, Store, Discover, Display

Ingest, Store, Discover, Display

Catalog A/I DBs

Com

mercia

l, Pro

pie

tary

OhioLINK Services, June 2008

EJC

DRC

ETDC

IR

Ingest

Store

Discover

Display

Catalog A/I DBs

Open S

ource

Open Source AdvocacyThe Early Days

Why Open Source?Beyond Evangelism

Best of breed software Server software: Apache, Tomcat Repository: Fedora Middleware: XTF, DLXS Search: Lucene, Solr XML Tools, programming languages

Freedom to play (Freedom to fail?)

Why Open Source?The Received Wisdom

Why Open Source?The Situation At OhioLINK

Which Adds Up To…

Ingest: XML + HTTP + programming language of choice

Repository: Fedora Discovery/Interface: XML +HTTP +

XTF? DLXS? Web programming language of choice?

Search: Lucene/Solr

…Or In Other Words…

Journal Content

Journal Content

Journal Content

E-Book Content

Image Database

Dissertation

Research Report

Ingester

Ingester

Ingester

Ingester

Ingester

Ingester

Ingester

Fedora Repository

User Interfaces Solr Index Metasearch

XML XSLT

PerlPHP

Ruby…

Act II“Reality Rears Its Ugly

Head”

The Thing About Conference Planning

“OhioLINK, the Ohio Library and Information Network, has embarked on a large-scale project to migrate several major digital library services from disparate, commercial/proprietary platforms to a unified repository architecture built with open source tools.”

Faithfully submitted,Thomas DowlingDecember 15, 2006

Deadlines

E-Journals – commercial software support officially ends December 2006.

Image databases – software license and Sun server maintenance renewals due January 2007.

E-Books – ~7500 books licensed in Fall, 2006.

Institutional repository – First collection due in September 2007.

Full Steam Sideways

Image Collections: University of Michigan DLXS

E-Books: CDL XTF Institutional Repository: DSpace E-Journals: ??? ETDs: ??? (DSpace?)

Joy. Silos.

Act III“Lessons [Eventually]

Learned”

What’s Wrong With This Plan?

“…migrate several major digital library services from disparate, commercial/proprietary platforms to a unified repository architecture built with open source tools.”

“The best is the enemy of the good”

— Voltaire1694–1778Writer, Philosopher,IT Manager

One Repository To Rule Them All

And In The Darkness Bind Them