the biodiversity heritage library: corn-fed, missouri raised, going global
DESCRIPTION
The Biodiversity Heritage Library: Corn-fed, Missouri Raised, Going Global. Martin R. Kalfatovic. Missouri Botanical Garden Staff Meeting. August 19, 2009. Saint Louis, MO.TRANSCRIPT
The Biodiversity Heritage LibraryCorn-fed, Missouri Raised, Going Global
Martin R. KalfatovicSmithsonian Institution Libraries
Missouri Botanical GardenAugust 19, 2009
• 2003. Telluride. Encyclopedia of Life meeting
• February 2005. London. Library and Laboratory: the Marriage of Research, Data and Taxonomic Literature
• May 2005. Washington. Ground work for the Biodiversity Heritage Library
• June 2006. Washington. Organizational and Technical meeting
• August 2006. New York Botanical Garden. BHL Director’s Meeting.
• October 2006. St. Louis/San Francisco. Technical meetings
• February 2007. Museum of Comparative Zoology. Organizational meeting
• May 2007. Encyclopedia of Life and BHL Portal Launch. Washington DC.
American Museum of Natural History (New York)
Academy of Natural Sciences Philadelphia
California Academy of Sciences (San Francisco)
Field Museum (Chicago)
Natural History Museum (London)
Smithsonian Institution Libraries (Washington)
Missouri Botanical Garden (St. Louis)
New York Botanical Garden (New York)
Royal Botanic Garden, Kew
Botany Libraries, Harvard University
Ernst Mayr Library of the Museum of Comparative Zoology, Harvard University
Marine Biological Laboratory / Woods Hole Oceanographic Institution
Scanning PartnerInternet Archive
ContributorUniversity of Illinois, Urbana-Champaign
ContributorCalifornia Digital Library
ContributorLibrary of Congress
Initial grant from the MacArthur and Sloan Foundations (as part of the Encyclopedia of Life grant)
Additional support from parent institutions
Supplemental grants in place for specific development (e.g. Moore Foundation for Fedora)
Additional grants being actively pursued by BHL and individual members
TheEncyclopedia of Life
H
InformaticsMarine Biological Laboratory
Missouri Botanical Garden
Species Pages & SecretariatSmithsonian
Education and OutreachSmithsonian & Harvard
Synthesis CenterField Museum
How much is there:
Core literature pre-1923: 100 million pages (?)
All pre-1923: 120-150 million pages
All literature: 280-320 million pages
More than:
36,000 volumes
15 million pages
Only 290 million to go!
Avg. monthly growth rate
1,500 volumes
600,000 pages
See you in 2048!
Now Online
Marine Biological Laboratory/WHOI
– Marine monographs
– General Science
Museum of Comparative Zoology
– MCZ publications
– Herpetology monographs and serials
– Ichthyology monographs and serials
Rough Selection
University of Illinois
– Fieldiana
– Natural history of Illinois
American Museum of Natural History
– AMNH publications
– Ornithology
Natural History Museum
– NHM publications
– Major natural history general serials
Rough Selection
Smithsonian Institution Libraries
– Smithsonian publications
– Entomology collection
– Marine mammals
– Fishes
– Selected special collections materials
– Cooperate with other botanical collections
Rough Selection
Botany Collections
Missouri Botanical Garden,
New York Botanical Garden,
Harvard Botany Libraries, and
Royal Botanic Garden, Kew
Smithsonian Libraries Botany
– will cooperatively develop a methodology for botanical publications and botanical collections from other BHL members will fill in gaps
Rough Selection
Collections Coordinator on board in February 2009.
Bianca Lipscomb, based at the Smithsonian, will coordinate material selection across the BHL and contributing partners
Rough Selection
How to make THIS into 0’s and 1’s
Selection Tools:
Combined Serial list for selection of title to scan to avoid duplication of effort
Monographic “de-duping” algorithm
OCLC Collection Analysis
Mass Scanning Workflow• Bid Lists• Serials Management• Pick Lists• Packing Lists• Monographic Management• Local data flow• WonderFetch(tm)
• Return of data• Return of material• Billing
Select Book ~Pull from Shelf
Review Physically and Metadata
Establish viability and create Wonderfetch
Send to IA scanning center
Book is scanned & QA
Page images loaded
Derivatives created
Book returned to library
Files harvested from IA portal to BHL
Taxonomic Intelligence Added
Available through BHL
BHL Scanning
Internet Archive
• 501(c)(3) organization
• Dedicated to “Universal Access to Human Knowledge”
• Founder of the Open Content Alliance
• Provides:– Mass scanning– Archival storage of files– Image processing– Technology
development
Single Scribe Machine
Built by the Internet Archive
Human operated3,500 page per shift per day
Northeast Regional Scanning Center
– 10 Scribe machines
– MBL/WHOI
– Harvard
Jersey City Facility
– 10 Scribe machines
– AMNH
– NYBG
University of Illinois
– 2 Scribe machines
Natural History Museum, London
– 1 Scribe machine
Missouri Botanical Garden
– Non-Scribe operation
Washington, DC
– 1 Scribe machine at Smithsonian Libraries
– 11 Scribe facility at Library of Congress (FedScan)
Acquiring Other
Content
What about other scanning?
• Missouri Botanical Garden Library continuing in-house scanning process
• Other BHL members also have non-Internet Archive scanning operations
Ingest of other collections
12,000,000 pages+ from other
Internet Archive scanning partners
Biodiversity Heritage Library Permission ProcessWorking with non-profit publishers for sharing with the BHL
To digitize and mount works under copyright BHL must obtain permission from the copyright holders.
Many biodiversity journals and monographs are published by non-profit institutions or learned societies whose mission is to promote research and learning.
Some of these institutions have not sold their rights to commercial publishers and are open to sharing with the BHL.
Current Permission Agreement:The agreement is non-exclusive. The copyright holder can use the content for other purposes.
It does not involve any transfer of copyright to the BHL or its member institutions.
It “grants to the current and future member Participating Institutions of the BHL a world-wide, nonexclusive, perpetual, irrevocable, royalty-free, sublicenseable license to digitize and use the Titles (as identified above) in connection with the BHL, including the right to make reproductions in digital form, publicly display, and disseminate the Titles via the BHL and related websites, and create derivative works in digital form based on the Titles. The scope of this license is equivalent to an open source license, which permits others to use, reproduce, supplement, modify, create derivatives, and otherwise use the Titles, for any and all non-commercial purposes, with proper attribution to the Licensor as the source.”
Process:There is room for some modification of the wording of the draft permission document. When it is finalized, BHL Director sign for the BHL and the Editor-in-chief or Chairperson of the society signs.
The process is usually very smooth. >60 titles to date, many published in the US, some of which are published in Europe and Asia.
Permissions Database
But what’s this all mean
to me!?
BesidesCalamari?
Plant Names
Specimens
Plant Names
Plant NamesSpecimensDescriptions
Plant Names
Plant Names
Citations
Built from a variety of new and existing sources
Views available for varying levels of expertise from novice to expert
Legacy literature a key component of the EOL species pages
Encyclopedia of Life Species Pages
? ? ? ? ?
BHL Portalhttp://www.biodiversitylibrary.org
Citehttp://cite.biodiversitylibrary.org
Internet Archivehttp://www.archive.org
Ubiohttp://www.ubio.org
Links
Credits
• Chris Freeland
• Suzanne Pilsk
• Tom Garnett
Thanks for sticking around!
The Biodiversity Heritage LibraryCorn-fed, Missouri Raised, Going Global
Martin R. KalfatovicSmithsonian Institution Libraries
Missouri Botanical GardenAugust 19, 2009