digital humanities project: visual analytic tool for human rights remembrance, education, and...

Post on 27-Jan-2015

108 Views

Category:

Education

6 Downloads

Preview:

Click to see full reader

DESCRIPTION

This is a work-in-progress project in digital humanities. We have developed a visual analytic prototype to facilitate the analysis of curated video/audio interview testimonies about human rights. The curated data are stored in Stories Matter, an open source database developed by The Centre for Oral History and Digital Storytelling (COHDS) at Concordia University http://storytelling.concordia.ca/

TRANSCRIPT

14-­‐03-­‐04   1  

Stories  Ma+er  database  -­‐  an  Example  of  Digital  Cura3on  Effort  in  Oral  History  Research  Community    

Cura0ng  Tes0mony:  a  design-­‐based  partnership  in  crea0ng  digital  environment  for  human  rights  remembrance,  educa0on,  and  research  

Our  Par-cipatory  Design  Approach  …  

Data

VisualizationInteractive Graphs and

Images

DiscoverPatterns & Knowledge:

CategoriesClusterModelRules..

Perception

End Users

New Knowledge

Adjust and Refine the visualization and data mining strategy

Exploration and Analysis

                       

Data  Mining/Natural  Language  Processing  

                       

Interac-ve  Data/Informa-on  Visualiza-on  

Our  Technical  Solu-on  

Par0cipatory  Design  Approach  

14-­‐03-­‐04   3  

•  Ini0al  design  mee0ngs  –  Introduc0on  of  visual  analy0cs  (example,  defini0on)  

Par0cipatory  Design  Approach  

14-­‐03-­‐04   4  

•  Ini0al  design  mee0ngs  –  Interpreta0on  of  Stories  MaKer  structure  

Par0cipatory  Design  Approach  

14-­‐03-­‐04   5  

•  Ini0al  design  mee0ngs  – Three  Design  Ideas  

U1

Project  1

U2

U3

U4

U5

Project  2

Project  6

Project  3

Project  4

Project  5

I S

C I

I S

C

C C C

I

S C

S

C

I

I

C

C

C

C

C C

C

I

I

S

I

CKM: Clock-based Keyphrases Map

Automa0c  Keyphrases  Extrac0on  

•  Keyphrases  Extrac0on  Algorithm  (KEA)  –  TF  &  IDF  and  First  Occurrence  as  two  main  features  – Naïve  Bayes  as  the  discrimina0ve  model    

•  Ra0onales  for  KEA:  –  Start  point  –  Computa0onal  effec0ve  and  efficient  – A  flexible  framework  for  future  extension  and  refinement  

•     

Characteristics

Text datapreprocessing

Stories Matter Database

Metadata extraction

Visual Entities

KEA Training

Trainingdata

Keyphrases

GMLs Visualization graphs

Model

TestingData

Data Entry

Data Entry

Extracting time, people, location, etc

Generating Training Data

Generate Testing Data

Generating Model

KEA predicting

Top rank keyphrases

Interactions and new knowledge from users

Data source management Keyphrases Map VisualizationAutomatical Keyphrases Extraction

YouTube  Demo:  search  for  “Clock-­‐based  Map”    hKp://www.youtube.com/watch?v=5Yy1xCjC-­‐Hk  

Informa-on  Retrieval   Pa+ern  Iden-fica-on  

Informa-on  Sharing   Collabora-on  Support  

Cura-ng  Tes-mony  –  a  Digital  Environment  for  

Human  Rights  Remembrance,  Educa-on,  

and  research  

Our  Methodology:  Design-­‐based  partnership;  Par-cipatory  Design  

– Team:    •  Universi0es:    

–  Concordia  University,  Centre  for  Oral  History  and  Digital  Storytelling    (Stories  MaKer  database)  

– Western  University,  Human-­‐Informa0on  Interac0on  Lab,  Department  of  Computer  Science  

•  Non-­‐Profits:    –  Page  Rwanda  (www.pagerwanda.ca)  –  Hablacentro  (www.hablacentro.com)  

Languages  of  the  Curated  Tes0mony:  English,  French,  Spanish

Cura0ng  Tes0mony:  a  design-­‐based  partnership  in  crea0ng  digital  environment  for  human  rights  remembrance,  educa0on,  and  research  

Acknowledgement

•  SSHRC  “Digging  into  Data”  grant  (2011)  •  Research  team:  Yan  Luo,  Steven  High

top related