user-centered data science for digital humanities
TRANSCRIPT
Victor de Boer
User-centered Data Science for Digital HumanitiesDIVE, Dutch Ships and Sailors and ArchimediaL
Digital Humanities
Part of the effort of humanities researcher is moved from the physical archives to digital ones
New possibilities for humanities research
Img:www.doaks.org, www.dkrz.de
Integrating collections as Linked Data
Tools built on top of the data
Continuous
enrichmentEmbed in humanities methodology
Continuous collection enrichmentMultimedia analysis (image, text, video)
Human computation
Linked Data
Human-based computation
Nichesourcing CrowdsourcingProfessional annotation
Niche groups of amateur experts with shared characteristics
Dutch Ships and Sailors
(semi-) automatically establish links between datasets and to external sources
dss:Recordgzmvoc:Telling
gzmvoc:telling-1046-De_Berkel
__bnode_1
gzmvoc:aziatischeBemanning
dss:Shipgzmvoc:Schip
gzmvoc: schip-1046-De_Berkel
dss:has_shipgzmvoc:schip
"1046"
“Schip”
“De Berkel”
rdfs:labeldss:scheepsnaam
gzmvoc:scheepsnaam
dss:ShipTypegzmvoc:Scheepstype
gzmvoc: type-Shipdss:has_shiptype
gzmvoc:has_shiptype
gzmvoc:scheepstype
“21”
“Moorsemattroosen”
dss:azRegistratieKop
gzmvoc:azAantalMatrozen
gzmvoc:telling
gzmvoc:heeft DAS heenreis
dss:Recorddas:Voyage
das:voyage-1918_61
LocationsRanksShip typesVoyages…
Men
tion
ed in
Novel data analysis and visualisation
DIVE+INTO THE EVENT-ENRICHED
LINKED OPEN CULTURAL HERITAGE
Access to Integrated Online Multimedia collectionsusing Linked Open Data
Interactive Exploration & Discovery in Contextlinking objects to events and entitiesbuilding automatic storylines (narratives)
DIVE+
OPENIMAGES.EU
3,220 news broadcasts
Netherlands Institute for Sound & Vision
GTAA thesaurus
DELPHER.NL
197,199 Scans of Radio bulletins
1937 – 1984
AMSTERDAM MUSEUM
73,447 cultural heritage objects
AM Thesaurus
TROPENMUSEUM
78,270 cultural heritage objects
SVNC thesaurus
Collections and Vocabularies
Hybrid enrichment pipeline
ENTITY EXTRACTION
EVENTS CROWDSOURCING AND LINKING
TO CONCEPTS THROUGH
CROWDTRUTH.ORG
SEGMENTATION & KEYFRAMES
LINKING EVENTS AND
CONCEPTS TO
KEYFRAMES
DIVE:MediaObject
Nieuws uit Indonesië:
opheffing van het KNIL
dive:depictedBy
sem:hasTimestamp
sem:Event
ANP:1950-08-11:50
dive:isRelatedTodive:relatedPlacesem:hasPlace
dive:isRelatedTodive:relatedActorsem:hasActor
dive:isRelatedTodive:relatedPlacesem:hasPlace
sem:Time
25 Juli 1950
dive:depictedBy
sem:hasTimestamp
DIVE:MediaObject
Mannen bij het huis van Paul Spies
aan de Parapattan 42, Djakarta
dive:depictedBy
dive:depictedBy
dive:depictedBy
DIVE:MediaObject
ANP:1950-08-11:50DIVE:MediaObject
Schaal
sem:Time
11 Augustus 1950
sem:Event
ontbindingsceremonie
sem:Place
Djakarta
sem:Place
Indonesië
sem:Actor
Mohammad Hatta
Integration of Heterogeneous Collections
Innovative exploratory UI
diveplus.beeldengeluid.nl
ArchiMediaLDeveloping Post-colonial Interpretations of Built Form
through Heterogeneous Linked Digital Media
Computer Vision + crowdsourcing
How to identify (elements of) buildings across different representations
Flexible data model allows for multi-interpretation
Continuous enrichment and linking of heterogeneous collections brings new possibilities for access, analysis
Using automatic methodsAlways with human(s) in the loop
Victor de [email protected]