digital humanities project: visual analytic tool for human rights remembrance, education, and...
DESCRIPTION
This is a work-in-progress project in digital humanities. We have developed a visual analytic prototype to facilitate the analysis of curated video/audio interview testimonies about human rights. The curated data are stored in Stories Matter, an open source database developed by The Centre for Oral History and Digital Storytelling (COHDS) at Concordia University http://storytelling.concordia.ca/TRANSCRIPT
14-‐03-‐04 1
Stories Ma+er database -‐ an Example of Digital Cura3on Effort in Oral History Research Community
Cura0ng Tes0mony: a design-‐based partnership in crea0ng digital environment for human rights remembrance, educa0on, and research
Our Par-cipatory Design Approach …
Data
VisualizationInteractive Graphs and
Images
DiscoverPatterns & Knowledge:
CategoriesClusterModelRules..
Perception
End Users
New Knowledge
Adjust and Refine the visualization and data mining strategy
Exploration and Analysis
Data Mining/Natural Language Processing
Interac-ve Data/Informa-on Visualiza-on
Our Technical Solu-on
Par0cipatory Design Approach
14-‐03-‐04 3
• Ini0al design mee0ngs – Introduc0on of visual analy0cs (example, defini0on)
Par0cipatory Design Approach
14-‐03-‐04 4
• Ini0al design mee0ngs – Interpreta0on of Stories MaKer structure
Par0cipatory Design Approach
14-‐03-‐04 5
• Ini0al design mee0ngs – Three Design Ideas
U1
Project 1
U2
U3
U4
U5
Project 2
Project 6
Project 3
Project 4
Project 5
I S
C I
I S
C
C C C
I
S C
S
C
I
I
C
C
C
C
C C
C
I
I
S
I
CKM: Clock-based Keyphrases Map
Automa0c Keyphrases Extrac0on
• Keyphrases Extrac0on Algorithm (KEA) – TF & IDF and First Occurrence as two main features – Naïve Bayes as the discrimina0ve model
• Ra0onales for KEA: – Start point – Computa0onal effec0ve and efficient – A flexible framework for future extension and refinement
•
Characteristics
Text datapreprocessing
Stories Matter Database
Metadata extraction
Visual Entities
KEA Training
Trainingdata
Keyphrases
GMLs Visualization graphs
Model
TestingData
Data Entry
Data Entry
Extracting time, people, location, etc
Generating Training Data
Generate Testing Data
Generating Model
KEA predicting
Top rank keyphrases
Interactions and new knowledge from users
Data source management Keyphrases Map VisualizationAutomatical Keyphrases Extraction
YouTube Demo: search for “Clock-‐based Map” hKp://www.youtube.com/watch?v=5Yy1xCjC-‐Hk
Informa-on Retrieval Pa+ern Iden-fica-on
Informa-on Sharing Collabora-on Support
Cura-ng Tes-mony – a Digital Environment for
Human Rights Remembrance, Educa-on,
and research
Our Methodology: Design-‐based partnership; Par-cipatory Design
– Team: • Universi0es:
– Concordia University, Centre for Oral History and Digital Storytelling (Stories MaKer database)
– Western University, Human-‐Informa0on Interac0on Lab, Department of Computer Science
• Non-‐Profits: – Page Rwanda (www.pagerwanda.ca) – Hablacentro (www.hablacentro.com)
Languages of the Curated Tes0mony: English, French, Spanish
Cura0ng Tes0mony: a design-‐based partnership in crea0ng digital environment for human rights remembrance, educa0on, and research
Acknowledgement
• SSHRC “Digging into Data” grant (2011) • Research team: Yan Luo, Steven High