bhl, biostor, and beyond

32
BHL, BioStor, and beyond #BHLat10 @rdmpage http://iphylo.blogspot.com

Upload: roderic-page

Post on 20-Mar-2017

485 views

Category:

Science


0 download

TRANSCRIPT

PowerPoint Presentation

BHL, BioStor, and beyond

#BHLat10

@rdmpage

http://iphylo.blogspot.com

#iamataxonomist

3

Pinnotheres atrinicola Page, 1983

http://www.facebook.com/photo.php?pid=13530101&fbid=10150231079625521&op=1&o=global&view=global&subj=1112517192&id=6810205203

One species of peacrab had a parasitewhat is it?

Sur un type nouveau d'Epicarides Rhopalione uromyzon n. g. n. sp., parasite sous-abdominal d'un Pinnothere

Rhopalione in BHL

Why BHL is cool #1

Accessibility

First impressions, mehOMG its full of plants

Its all old stuff

Where the $#@! are the articles?

More hack, less yack

[to] be able to move some subset of the world from the leverage point of the command line.Steven E. Jones The Emergence of the Digital Humanities

Why BHL is cool #2

It is hackable

No articles? No problem!

Data is available for download

Also an API (and OAI-PMH, yuck!)

So, lets go find the articles

Find articles - simplesTitleVolumePageJournalVolumeStart page end page

Article

Extracting scientific articles from a large digital archive: BioStor and the Biodiversity Heritage Library doi:10.1186/1471-2105-12-187Mapping between BHL and articles

http://biostor.org/reference/102054

BioStor and Pintrest

BioStor and JournalMap

BioStor and BHL

articles

First impressions, mehOMG its full of plants

Its all old stuff

Where the $#@! are the articles?

Not so cool

Scanning currently dominated by USDA

BHL-Europe: unhackable zombie

Where next?

Findability: DOIs for articles

10.5962/bhl.part.14773

Mickey Mouse is evil

http://artlawjournal.com/mickey-mouse-keeps-changing-copyright-law/

100,000 articles from http://biostor.org (BHL)

1923today

http://biostor.org25

Synthetic documentsS. Michael Machines as readers: A solution to the copyright problemwe proposed to scan works digitally to extract their intellectual content, and then generate by machine synthetic works that capture this content and distribute them free of copyright

Cited, linkable specimens

NMNH Vertebrate Zoology Herpetology Collections11194

CAS Herpetology Collection CatalogMCZ Herpetology CollectionHerpetology Collection (University of Kansas Biodiversity Research Center)961967205818

http://iphylo.blogspot.co.uk/2012/02/gbif-specimens-in-biostor-who-are-top.html

The case for a PubMed Central for Biodiversity

Isnt that, um, PubMed Central?...

Europe PMC

PubMed Central for biodiversityTaxonomic names

Geographic localities

Specimen codes

Handle XML, PDF, OCR text

Store facts as well as documents

Google figured out how to manage abundance while every other media company in the world was trying to manufacture scarcity, and for that we should be grateful. Siva Vaidhyanathan The Googlization of everything (and why we should worry)