applying taxonomic intelligence to digitization initiatives ..but first

64
MBL WHOI Library Marine Biological Laboratory Woods Hole Oceanographic Institution MBL WHOI Library Marine Biological Laboratory Woods Hole Oceanographic Institution © 2007 MBLWHOI Library www.mblwhoilibrary.org Applying Taxonomic Intelligence to Digitization Initiatives ..but First.. Avoiding Extinction: Translating the Value of the Research Library Memorial Sloan Kettering Center Center Library Elsevier: Library Connect Colloquium Cathy Norton MBLWHOI Library Director Deputy Director Biodiversity Heritage Library October 24, 2007

Post on 21-Oct-2014

2.390 views

Category:

Economy & Finance


1 download

DESCRIPTION

Applying Taxonomic Intelligence to Digitization Initiatives ..but First... by Cathy Norton, Marine Biological Laboratory / Woods Hole Oceanographic Institution Library. Avoiding Extinction: Translating the Value of the Research Library / Memorial Sloan Kettering Center Center Library. October 24, 2007. New York, NY.

TRANSCRIPT

Page 1: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Applying Taxonomic Intelligenceto Digitization Initiatives ..but

First..

Avoiding Extinction: Translating the Value of the Research LibraryMemorial Sloan Kettering Center Center LibraryElsevier: Library ConnectColloquium

Cathy NortonMBLWHOI Library Director

Deputy Director Biodiversity Heritage LibraryOctober 24, 2007

Page 2: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Are we road kill yet on Google’s march to

dominance?

Page 3: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Ten top trends in librarianshipIntrospection: who are we and

what value do we bringBudget pressure: accountability

for results, and rising user expectations

Growth of consortia an collaborative group.

High demand for new skills; espIT fluency

Restructuring work and jobs

OCLC Oct, 2007Karen Calhoun

• Find it get it - opac sucks debate

• Complex, rapidly changing e environment

• Aggregation ,data/metadata management. Where do libraries fit?

• Assessment: Evidence - based decision making

• Usability and user centered design: social software participation

Page 4: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Nielson/NetRatings : top 10 social sites and blog sites. “MySpace.com is still the top ranked US social-networking site with 58.6 million unique

visitors in September, according to a custom list of top US social networking sites.”

Here are the Oct. 2007 rankings:

1. MySpace2. Facebook3. Classmates Online4. Windows Live Spaces5. AOL Hometwon6. Reunion.com7. LinkedIn8. AOL People Connection9. Club Penguin10. Buzznet

Page 5: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Time’s Person of the year: You2006

Its a story about community and collaboration on a scale never

seen before. Lev Grossman

Page 6: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Networked World90% of the total general public have used the internet for >fouryears

In 3 Years:Search engine use has gone from 71% to 90%Email use has gone from 73% to 97%Blogs have gone from 16% fo 46%

�Library web sites have gone from 30% to 20%52% believe their information is kept private or more secure than 2 years ago 63% believe banking is private11% believe a library web site is private!

However 60% TRUST LibraryOclc,2007

Page 7: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Director’s paranoia!• Library directors have used the internet for more than

10 years• Are more likely to use an alias on line• 75 % use face book but only once a week and their

motivation is for browsing services and content• 25% think email is not secure

Page 8: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Future is now?• Google(Search, Ads & Apps)

Dominance?• Global Change ( China, India, USA)• U.S. Debt Charges ($1 trillion)• Oil shock ( $4.50?+/gallon)• Major Technology Shifts (PDA,

broadband)

Page 9: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Second Life

• Second Life Library• Travlin Librarian

– Audio books and other media discusssion

– Katt KongoMetaverse messager

Page 10: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Second Life Medical Library• Consumer Health• Educational, Clinical

or Research• Experiments and

development of new ways of interaction between users and libraries.

Page 11: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Librarian Avatars

• Networks are changing the behavior of users.. B at the point of need because: Sharing seems to trump privacy in the new age. We need to invite our patrons “in” to create content and a place to share ideas. Privacy, 2007

Page 12: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

• Biodiversity Heritage Libraries• Taxonomic Intelligent Literature

Page 13: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

NMFS - 1871

MBL - 1888

WHOI - 1930

USGS - 1960

SEA - 1971

WHRC - 1985

Woods Hole Scientific Community

This library serves the MBL, WHOI, USGS, NMFS, SEA, WHRC,

and other scientific groups in the area.

Facing a new dynamic phase

Page 14: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

The vision

Imagine an electronic page for each species of organism on Earth, available everywhere by single access on command. The page contains the scientific name of the species, a pictorial or genomic presentation of the primary type specimen on which its name is based, and a summary of its diagnostic traits. The page opens out directly or by linkage with other databases such as ARKive, Ecoport, and GenBank. It comprises a summary of everything known about the species’ genome, proteome, geographic distribution, phylogenetic position, habitat, ecological relationships, and, not least, its perceived practical importance for humanity.

E. O. Wilson, 2003.

Page 15: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Page 16: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

• Legacy Taxonomic Literature available in museums has limited access

• Much of it is rare

• Systematic literature depends on the historic literature

• The cited half-life of natural history is longer than that of any other scientific domain

• 90% of Biodiversity Information is in these libraries

• 90% of Biodiversity is in 3rd world countries like Africa and South America

Page 17: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

NAME of Consortium -

BioDiversity Heritage Library

Web Presence!

Page 18: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Page 19: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Page 20: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Page 21: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Page 22: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Page 23: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Page 24: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Page 25: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Page 26: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Page 27: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Page 28: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Page 29: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

In the end… simplicity…• http://bhl.si.edu/

• www.EOL.org

Page 30: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Principles of OCA• The OCA will encourage the greatest possible degree of access to and reuse of

collections in the archive, while respecting the rights of content owners and contributors.

• Contributors will determine the terms and conditions under which their collections are distributed and how attribution should be made.

• The OCA need not be obligated to accept all content that is offered to it and may give preference to that which can be made widely accessible.

• The OCA will offer collection and item-level metadata of its hosted collections in a variety of formats.

• The OCA welcomes efforts to create and offer tools (including finding aids, catalogs, and indexes) that will enhance the usability of the materials in the archive.

• Copies of the OCA collections will reside in multiple archives internationally to ensure their long-term preservation and accessibility to all.

Page 31: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

“All accumulated information of a species is tied to a scientific name, a name that serves as a link between what has been learned in the past and what we today add to the body of knowledge.”

~ Grimaldi & Engel, 2005, Evolution of the Insects

Page 32: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

• Information about named groups (taxa) of organisms (taxon-related information)

• Extends back at least 1000 years

• Books, journals, surveys• Museum specimens,

herbaria• In many languages and is

distributed

From T.E. Glover, The Fishes of Southwestern Japan, c.1870

Page 33: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

The challenge for contemporary DIGITAL libraries

But … names of organisms change over time

Goal:

Use one name to find the content for all names

Page 34: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Names are even misspelled, such as Loligopealei

Loligo pealeiiLoligo pealiiLoligo pealei

Page 35: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Homonyms and polysemes VirginiaPeoplePlacesAnimals

And of course Anorexia nervosaHabeas corpus, and Etcetera etcetera

Peranema– the fern

Peranema– the euglenid

Page 36: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Libraries

PublishersMuseums

Federal Agencies

Search engines

Federated databases

Students and researchers

106000515358003371215585018700Red spotted newt

COML

Page 37: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Serious challenges in federated environments

One organism

4 scientific names

4 maps

We want one map

Page 38: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

• Metadata – such as names – provided the power to index and search

• Classifications allowed us browse, navigate, and run hierarchical searches

Classifications

Page 39: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Reconciliation – linking alternative names for the same organism

A query initiated with any name, can be expanded to all names and will unify data associated with each

Page 40: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

And the other issueconnecting ALL data about all organisms together?

• Data stores – mostly was not happening (despite the success of Genbank)

• Search engines – not taxonomically intelligent and missed 90%• Hyperlinks – slow, tedious, and unstable• Dynamic links – using variables, databases, and code (e.g.

micro*scope)• Federation – cluster of partners playing by the same rules (e.g.

OBIS)• Data transfer standards – rules that anyone can use (e.g. DiGIR,

TAPIR, UBDB)• API’s – spigots from databases• Aggregation (mashups) – the chosen way

Page 41: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Taxonomic intelligence is the inclusion of taxonomic practices, skills and knowledge within informatics services to manage information about organisms

• All names & all Classifications ClassificationBank

• Alternative names reconciled

• Similar names disambiguated

• Exploit hierarchies to browse and search, build a comprehensive classification

• Improve performance with federated systems

• Read documents, web sites, databases and taxonomically indexing the content

• Create a unified portal to information about organisms on the internet

Page 42: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Taxonomically intelligent aggregation technology builds portals to distribute information about

organisms

• There are many resources out there, but no single comprehensive resource for species information

• Rather than building another big database, we can create a new way to link existing information using an aggregation portal

• This places little or no burden on data providers

• Protecting ownership and diversity of initiatives

Page 43: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

uBioRSS Taxonomically Intelligent RSS Feed Aggregator

Page 44: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

MBL WHOI Library –Woods Hole authors’publications

Page 45: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

MBL WHOI Library –Woods Hole species publications

Page 46: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Taxonomically intelligent scientific text parsing

Page 47: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Taxonomically intelligent scientific text parsing

Page 48: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Taxonomic intelligence works miracles

• It will benefit any initiative that uses distributed and heterogeneous information about biology

• Distributed content on the same species can be drawn together because different names will be standardized through reconciliation

• We can read documents, find names, catalog and taxonomically index documents

• Produce a framework around which we can organize and assembleremote and local content

Page 49: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

“Taxonomic intelligence”enhances

search

Page 50: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

• Documents go to Internet Archive for OCR and storage

• The documents are added to the BHL collection

• uBio checks the BHL collection for new documents

• The documents are scanned for names

• TaxonFinder adds new strings to Namebank

• Document markup with anchors

• TaxonFinder adds all namebankIDs to Taxonomic Index

• This index is called upon by various applications...

Page 51: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Biological Data Revolution

Biomedical Knowledge Biodiversity Knowledge

Page 52: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Scientific Names

No Complete List of Scientific Names

112,133 741,872�

49,382*

*Scientific Names ≠ Species

Published Variants

Escherichia coli

Objective SynonymsBacterium coliBacillus coli

Mis-spellings

Escheria coli

Page 53: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Taxonomic Knowledge

Page 54: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Data, Data, Everywhere

Page 55: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

The ‘biopipes’ concept

BIOPIPESBIOPIPES

NomenclatorZoologicus

Then, dragged the functions (pipes) you wanted onto your desktop

get data blast get tree get matching clade name

get ITIS preferred name

GoogleEarth

get all names

reconciled search

myEoL page

get subset ofEoL species site

Original publicationinformation

Get originaldescription

And, of course, saved the functionality to apply to the next data

Page 56: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Proceeding Boldly

Page 57: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Progress

MID 2007 --July• EoL funds started to flow

By - MID 2008• EOL Informatics Teams--Core bioinformatics infrastructure

(Taxonomic Intelligence and high priority marine modules of the Universal Biodiversity Data Bus) will be in place

• BioPipes for OBIS, BOLD, GENBANK, EoL, BHL

• List of most marine genera

• EoL with agreement show content from FishBase, SeaLifeBase, CephBase etc.

• RSS feeds and other alerts established to inform interested parties of new content

Page 58: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Progress

• BHL• 10 Scribes installed in Boston

– MBL/ Harvard/SI/ BNH/MOBOT/Field Museum all scanning

– AMNH/NYBG will use NY PUBLIC– Close to 2 million pages- AS OF NOV 07

Page 59: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Challenges• User/librarian experience• Technology• Partnership & alliances• Capacity for Leadership and Change• Change Leadership or change management• Enemy ourselves?

Page 60: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Back of the house • Opac…. bad• I want information….• Find it, get it, do it!• Find books and more….

• Concentrate on FRONT/ USERS.

Page 61: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Libraries are…

• Libraries are about learning and building communities.

• Make a virtual tour of your library• Get staff up to speed.. 12 week course!

Page 62: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

What role will librariesplay once the scanning is

done?

Will you be negotiators like you are now with serials?

Public domain publications restricted for EVER by contract or open?

Page 63: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

Road map

• Be a community• Be a working environment• Be creative• Become and Informatics Center with scientific

appointments.• Think translation not transactions!• Stay alive professionally

Page 64: Applying Taxonomic Intelligence to Digitization Initiatives ..but First

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

MBL WHOI LibraryMarine Biological Laboratory

Woods Hole Oceanographic Institution

© 2007 MBLWHOI Library www.mblwhoilibrary.org

AcknowledgmentsNeil SarkarDavid RemsenDavid PattersonDiane RielingerSteve AbramsOCLC

Martin KalfatovicTom GarnetGraham HigleyConnie Rinaldo

A.W. Mellon FoundationAlfred P Sloan Foundation

John D and Catherine T MacArthur Foundation