digitised content in an api world

18
Digitised Content in an API World Alastair Dunning, JISC a.dunning AT jisc.ac.uk Resource Discovery Taskforce Meeting London, 20 th April 2011 Acronym Count: 8

Upload: alastair-dunning

Post on 05-Dec-2014

679 views

Category:

Documents


4 download

DESCRIPTION

Digitised content is often created behind tailored interfaces. How can the world of open data and APIs allow for different interfaces be built over the same content for different audiences

TRANSCRIPT

Page 1: Digitised Content in an API world

Digitised Content in an API World

Alastair Dunning, JISCa.dunning AT jisc.ac.uk

Resource Discovery Taskforce MeetingLondon, 20th April 2011

Acronym Count: 8

Page 2: Digitised Content in an API world

types of content I’m talking about: digitised text, manuscripts, images, film footage, audio archives, newspapers, documents, music (and their metadata). ie. the type of stuff at http://www.jisc-content.ac.uk/

Page 3: Digitised Content in an API world

such content tends to be locked down behind interfaces. usage is tied to technical infrastructure and interface

Page 4: Digitised Content in an API world

the trouble with current resources is that they demand certain ways of analysing and representing the resource – and they constitute the creators’ way of seeing the world, not the users’

Page 5: Digitised Content in an API world

SEARCH then LIST

Page 6: Digitised Content in an API world

but what would such content look like in an RDTF world, where data and service are separated? What do APIs allow to happen?

Page 7: Digitised Content in an API world

an API driven world would allow much greater flexibility over analysing a digitised dataset, i.e. different intellectual questions to be asked

Page 8: Digitised Content in an API world

and also different ways of visualising that digital content

Thanks David McCandless! http://www.informationisbeautiful.net/visualizations/

Page 9: Digitised Content in an API world

and also of different ways of tailoring content for different audiences – different interfaces for schools, undergrads and researchers – all over the same content

Page 10: Digitised Content in an API world

more importantly, it can help break down the notion of a collection, and the related silos

Page 11: Digitised Content in an API world

http://www.connectedhistories.org is a great example. on the surface, it appears like any other resource

Page 12: Digitised Content in an API world

but it is based on an API architecture which allows for aggregation and cross-search of 11 enriched metadata sets for the resources listed above(note: the aggregators created the enriched metadata and the APIs not the resource provider, and are testing the business model behind this)

• British History Online• British Museum Images

• Burney Newspaper Collection, 1600-1800• Charles Booth Archive

• Clergy of the Church of England Database, 1540-1835• House of Commons Parliamentary Papers

• John Johnson Collection of Printed Ephemera• John Strype’s Survey of London

• London Lives, 1690-1800: Crime, Poverty, and Social Policy in the Metropolis

• Origins.net• Proceedings of the Old Bailey Online, 1674-1913

Page 13: Digitised Content in an API world

so not only has an API architecture created a new resource for early modern British history, aggregating many disparate datasets

but others can come along and create their own interfaces – including or excluding elements of 11 resources (and adding others) as required

within these sources, there is rich metadata about places, areas, streets, names, crimes, genders, ages, occupations – these can be exploited in myriad ways

Page 14: Digitised Content in an API world

indeed, the team will be incorporating map data and archaeological data

from BL and Museum of London to allow for spatial

visualisation via geographical data (maps in this case) and

mashing of historical data (largely about events and

people) with archaeological data (largely about objects)

Map - First Series Ordnance Survey, c.1805 from British Library via

http://visionofbritain.org.uk/maps

Page 15: Digitised Content in an API world

and think how this could work when you start bringing datasets and content from different subject areas – economics, anthropology, fashion

Page 16: Digitised Content in an API world

on a practical note: don’t forget sustainability – the pressure of sustaining dataset and digitised content is relaxed for the collection holder; looking after the interface less important

Page 17: Digitised Content in an API world

short-term wins

content and enthusiasm is out there, although disparate – see The New History Lab article

visualisation can produce eye-catching success short bursts of funding can make things happen scholarly labs at KCL, UCL, Sheffield and elsewhere

(BL, BUFVC) enthusiasm of GLAM sector (good work at V+A

and Sci Museum) opportunties for enriching metadata via

crowdsourcing

Page 18: Digitised Content in an API world

long-term challenges

getting people to build and document and sustain APIs; explaing to collection curators how and why to do it

(some) publishers suspicious getting people to build interfaces on top of APIs;

technical knowledge required to do so quality of metadata; who owns enriched metadata? business models unclear; related licencing interoperability between APIs? citation academic scepticism + misunderstanding

{the web changes}