inotaxa markup and its relations to vibrant

5
Virtual Biodiversity ViBRANT INOTAXA markup and its relation to ViBRANT Chris Lyal Natural History Museum [email protected] ViBRANT Virtual Biodiversity Workpackage 7 Biodiversity literature access and data mining

Upload: vbrant

Post on 19-Jan-2015

345 views

Category:

Documents


3 download

DESCRIPTION

 

TRANSCRIPT

Page 1: INOTAXA markup and its relations to ViBRANT

Virtual BiodiversityViBRANT

INOTAXA markup and its relation to ViBRANT

Chris LyalNatural History Museum

[email protected]

ViBRANTVirtual Biodiversity

Workpackage 7Biodiversity literature access anddata mining

Page 2: INOTAXA markup and its relations to ViBRANT

Virtual BiodiversityViBRANT

2 of

taXMLit and INOTAXA

5

Joint project with Anna Weitzman, Smithsonian InstitutionPrior work with OU team in ABLE projecttaXMLit• XML schema for taxonomic literature - atomised markup• covers complete papers, not just treatments• markup route via TEI-Lite+ / taXMLit-simpleINOTAXA - http://www.inotaxa.org• developed using user needs feedback• simple and Boolean searches and browse facility • current contents: ca. 900pp legacy and recent literature• developing a further ca 50,000 treatments for simple search• being developed from pilot to production

Page 3: INOTAXA markup and its relations to ViBRANT

Virtual BiodiversityViBRANT

3 of

What we will do in ViBRANT

5

1. Provide assistance to OU (and other WP 7 members if required) on subject-specialist taxonomic issues for markup.

2. Provision of taXMLit & taXMLit-simple schemas + mark-ups + documentation. - next week

3. Mapping between other schemas / DTDs (taxonX, TaxPub) and taXMLit-simple (format-based, without deep atomisation)

- Month 3

4 Search and information retrieval from marked-up documents, using INOTAXA- Will be made available with documentation by month 12 for development

5 Upload tool to put content into back-end database- Will be made available with documentation by month 12 for development

6 Review of pilot mark-up processes- Month 20

Page 4: INOTAXA markup and its relations to ViBRANT

Virtual BiodiversityViBRANT

4 of

How we are doing it

5

• Close liaison, particularly with OU team, on search system.

• Liaison with PenSoft, KIT team, on taXMLit-simple / taXMLit schemas

• INOTAXA database and search/retrieval system currently being re-built using PHP and a MySQL database. Funded outside ViBRANT.

• Upload tool for texts in taXMLit and taXMLit-simple to be built. Funded outside ViBRANT.

• Integration of INOTAXA and upload tool with Scratchpad to be discussed once full documentation available.

• Integration of INOTAXA with other WP7 outputs (with other WP7 participants).

Page 5: INOTAXA markup and its relations to ViBRANT

Virtual BiodiversityViBRANT

5 of

Who are our users & how will they engage?

5

Taxonomists via the Scratchpads

Taxonomists directly to INOTAXA.org • currently sited on NHM server• will be sited also on Smithsonian server• metrics not currently available• INOTAXA built with user involvement

Wider users via EoL (currently 866 pages; pages viewed in December = 78; 152 page views)