bhl @ #tdwg09
DESCRIPTION
Recent developments and new technologies available through Biodiversity Heritage Library. TDWG. Montpellier, France. 12 November 2009.TRANSCRIPT
![Page 1: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/1.jpg)
LINKED LITERATURE BHL DEVELOPMENTSCITEBANKChris FreelandTechnical Director, BHL
![Page 2: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/2.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
BHL Members
![Page 3: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/3.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
BHL Members: US/UK
Academy of Natural Science (Philadelphia, PA) American Museum of Natural History (New York, NY) California Academy of Science (San Francisco, CA) The Field Museum (Chicago, IL) Harvard University Botany Libraries (Cambridge, MA) Harvard University, Ernst Mayr Library of the Museum
of Comparative Zoology (Cambridge, MA) Marine Biological Laboratory / Woods Hole
Oceanographic Institution (Woods Hole, MA) Missouri Botanical Garden (St. Louis, MO) Natural History Museum (London, UK) The New York Botanical Garden (New York, NY) Royal Botanic Gardens, Kew (Richmond, UK) Smithsonian Institution Libraries (Washington, DC)
![Page 4: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/4.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
BHL Members: BHL-Europe Museum für Naturkunde - Leibniz-
Institut für Evolutions- und Biodiversitätsforschung an der Humboldt-Universität zu Berlin
Natural History Museum, UK Narodni muzeum NMP CZ Angewandte Informationstechnik
Forschungsgesellschaft mbH Freie Universität Berlin FUBBGBM Georg-August-Universität
Göttingen Stiftung Öffentlichen Rechts
Naturhistorisches Museum Wien Hungarian Natural History
Museum Museum and Institute of Zoology,
Polish Academy of Sciences University of Copenhagen
Stichting Nationaal Natuurhistorisch Museum, Naturalis
National Botanic Garden of Belgium
Royal Museum for Central Africa, Royal Belgian Institute of
Natural Sciences Bibliothèque nationale de France Museum national d’histoire
naturelle Consejo Superior de
Investigaciones Cientificas Università degli Studi di Firenze Royal Botanic Garden,
Edinburgh Species 2000 John Wiley & Sons limited Helsingin yliopisto UH-Viikki
![Page 5: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/5.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Stats: Now Online
15,000 titles 40,000 volumes 16.4mil pages
Soon: 34,000 titles 65,000 volumes 24mil pages
Oldest book: Schöffer’s Herbarius, 1484.
![Page 6: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/6.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Stats: Usage
Jan – Sep 2009 266,000 visitors 436,000 visits 2.1million
pageviews
Daily average 970 visitors 1,600 visits / day 7,700 pageviews /
dayJan – Sep 2009
Launch to 30 Sep 2009
![Page 7: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/7.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Cloud storage & computing
![Page 8: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/8.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Global, coordinated development New functionality from BHL-Europe
Improved deduplication tools Semantic interface OAIS-compliant preservation infrastructure
Building a community of developers Funded & volunteer RubyBHL: http://github.com/mjy/rubyBHL
PyBHL: http://linux.softpedia.com/get/Programming/Libraries/pybhl-51612.shtml
New partners, new content
![Page 9: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/9.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Open Source Pageturning UI
http://github.com/openlibrary/bookreader
![Page 10: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/10.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Open Software & Development BHL Bits:
Portal code, utilities, services http://code.google.com/p/bhl-bits/
Taxonomic Literature Group Google Group for discussion of “taxonomic
literature & the services required to make literature interoperable within biodiversity research and biodiversity informatics.”
http://groups.google.com/group/taxonlit
![Page 11: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/11.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Open Data
Downloads Simple tab-delimited exports of core data http://www.biodiversitylibrary.org/data/BHLExportSchema.pdf
Data model DB schema as ERD
http://bhl-bits.googlecode.com/files/20090930_BHLDataModel.pdf
![Page 12: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/12.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Services
Names Service Return all occurrences of a name throughout BHL digitized
corpus Documentation: http://bit.ly/2e6sg9
Access to 51million name strings using TaxonFinder 1.4million unique names
Working out a strategy for obscure species Algorithm improvements to detect nomenclatural &
taxonomic acts
OpenURL Facilitate links to citations: protologues, articles,
references Documentation:
http://www.biodiversitylibrary.org/openurlhelp.aspx Useful to Nomenclators, Reference Systems
IPNI Tropicos
![Page 13: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/13.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Services: OpenURL
http://www.biodiversitylibrary.org/openurl?pid=title:3934&volume=14&issue=&spage=301&date=1879
http://www.biodiversitylibrary.org/openurl?pid=title:3934&volume=14&issue=&spage=301&date=1879
http://www.tropicos.org/Name/1200408
![Page 14: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/14.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Services: OpenURL Disambiguation Looking for:
BHL returns:
![Page 15: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/15.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Services: OpenURL Results
![Page 16: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/16.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
How?
Tropicos maintains internal authority list of publications:
Each protologue/reference tied to authority:
Matched Tropicos TitleIDs to BHL TitleIDs:
Throw citations at resolver at regular intervals & cache data in Tropicos
http://www.tropicos.org/Publication/775
http://www.biodiversitylibrary.org/title/3934http://www.tropicos.org/Publication/775 =
http://www.biodiversitylibrary.org/openurl?pid=title:3934&volume=14&issue=&spage=301&date=1879
http://www.biodiversitylibrary.org/openurl?pid=title:3934&volume=14&issue=&spage=301&date=1879
![Page 17: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/17.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Encyclopedia of Life
522,000 species pages linked to BHL #1 referring site
![Page 18: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/18.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Other Consumers
EarthCape Labs Sort/Search capabilities with harvested names YouTube demo:
http://www.youtube.com/watch?v=qw7qw87JTOs
BioGUID / iPhylo BHL Name Timeline & Comparison
http://bioguid.info/bhl/ http://bioguid.info/bhl/compare.php
New Viewer Tagging So much cool stuff we can’t keep up!
http://iphylo.blogspot.com/search/label/BHL
@rdmpage
![Page 19: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/19.jpg)
http://bioguid.info/bhl/compare.php?name1=Physeter+catodon&name2=Physeter+macrocephalus
![Page 20: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/20.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Crowdsourced Articles
http://www.biodiversitylibrary.org/pdfgen/17298
Demo: http://youtube.com/watch?v=oidf3b26jVs
![Page 21: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/21.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Crowdsourced Articles
12,000 PDFs generated through September 2009 4,900 submitted with article metadata Analysis: http://bit.ly/4Jqu9
![Page 22: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/22.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Great, but how to…
display / manage?
meet community demands for bibliography / citation management?
build from more open source tools?
![Page 23: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/23.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Development goals re: citations Create a repository for community-
vetted taxonomic bibliographies. Ability to ingest, display, download, and
index articles so that the BHL can operate as an article repository.
Build from existing community of work around Drupal / Biblio. In use by collaborators
![Page 24: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/24.jpg)
“something like GenBank or NameBank for citations…”
So, CitationBank…or CiteBank (saves chars)
Need…
![Page 26: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/26.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Crowdsourced Articles
PDFs from BHL pushed into Drupal/Biblio:
![Page 27: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/27.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
http://citebank.biodiversitylibrary.org/search
![Page 28: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/28.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
http://citebank.biodiversitylibrary.org/node/47423
![Page 29: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/29.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
http://www.biodiversitylibrary.org/pdf1/000295100017298.pdf
![Page 30: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/30.jpg)
![Page 31: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/31.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
CiteBank boundaries
Book
Citation
Pageturning UIPDFOCR
eBook/Kindle
Stored *somewhere* & retrievable via HTTP URI
CitationCitationCitation
Bibliography
CiteBank
![Page 32: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/32.jpg)
BHL Data Flow – Sep 2009
CiteBank
![Page 33: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/33.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Copyright
Bold statements that need some good legal counsel: Citations don’t have copyright
Unless you get them from OCLC, other services
Bibliographies have copyright They’re a scholarly work
Underlying content has copyright Except when it doesn’t
![Page 34: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/34.jpg)
Up for discussion…
![Page 35: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/35.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Who can upload & edit?
Trusted repositories? Approved specialists? BHL Librarians? People in this session? Citizen scientists? 6th graders? Rod Page?
![Page 36: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/36.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
What about duplicates?
3 Bibliographies had Syst. Nat. All 3 in different reference manager formats All 3 had variant forms of title:
Syst. Nat.
Systema Naturae
Systema naturae per regna tria naturae
Library catalogues:Caroli Linnaei...Systema naturae per regna tria
naturae :secundum classes, ordines, genera, species, cum characteribus, differentiis, synonymis, locis.
![Page 37: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/37.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Accuracy
How clean is clean? How dirty is dirty? What’s good enough?
How to Rank Gold/Platinum ranking? Dirty Bucket / Clean Bucket?
![Page 38: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/38.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Right technologies?
“But Drupal’s awful…just ask ___ for their bad experience.”
“Drupal’s great!”
“MySQL won’t scale” “MySQL’s great!”
![Page 39: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/39.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Next steps
Bring hardware online at MBL Have one point of redundancy By Q1 2010
Bring BHL-Europe & other nodes online In conjunction with DuraCloud & other solutions
Release CiteBank for beta & sandbox testing Beta at http://citebank.biodiversitylibrary.org Sandbox at http://sandcite.biodiversitylibrary.org Production release by Q2 2010
Integration of BHL-Europe tools & content
![Page 40: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/40.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Coming soon
Darwin’s Library AMNH, NHM, CUL, BHL (MOBOT) Funded by NEH/JISC Digitization of Darwin’s personal library,
with annotations New interfaces for recording, indexing,
displaying annotations Inhouse scanning from
partners/contributors
![Page 41: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/41.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Fun: BHL In Your Pocket!
Content now available in EPUB format Used by Stanza, transferable to Kindle
Blog post by John Mignault (NYBG): http://john.mignault.net/blog/2009/10/28/first-bhl-e-book-
experiments/
![Page 42: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/42.jpg)
![Page 43: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/43.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Links & such
Biodiversity Heritage Libraryhttp://biodiversitylibrary.org
CiteBank beta
http://citebank.biodiversitylibrary.orgCiteBank sandbox
http://sandcite.biodiversitylibrary.org
Go play!
http://twitter.com/BioDivLibrary
Follow BHL on
![Page 44: BHL @ #TDWG09](https://reader033.vdocument.in/reader033/viewer/2022052315/5559faacd8b42ad00a8b4ad1/html5/thumbnails/44.jpg)
Biodiversity Heritage Library: http://biodiversitylibrary.org
Thanks!
Chris FreelandTechnical Director, BHL
Director, Center for Biodiversity Informatics, Missouri Botanical Garden
[email protected]://twitter.com/chrisfreeland
Presentation online through TDWG & athttp://www.chrisfreeland.com