big rdf data cleaning - da qcri

37
Big RDF Data Cleaning Nan Tang

Upload: others

Post on 27-Mar-2022

1 views

Category:

Documents


0 download

TRANSCRIPT

big.rdf.data.cleaning.key2
2
2
2
typos normalization
incomplete inconsistent
typos normalization
incomplete inconsistent
typos normalization
incomplete inconsistent
Knowledge Matching
Nan Tang QCRI Senior Scientist Doha
Nan Tang CWI Postdoc Netherlands
Stratos Idreos CWI PhD. Amsterdam
Knowledge Matching
Nan Tang QCRI Senior Scientist Doha
Nan Tang CWI Postdoc Netherlands
Stratos Idreos CWI PhD. Amsterdam
Knowledge Matching
Nan Tang QCRI Senior Scientist Doha
Nan Tang CWI Postdoc Netherlands
Stratos Idreos CWI PhD. Amsterdam
Knowledge Matching
Nan Tang QCRI Senior Scientist Doha
Nan Tang CWI Postdoc Netherlands
Stratos Idreos CWI PhD. Amsterdam
Knowledge Matching
Nan Tang QCRI Senior Scientist Doha
Nan Tang CWI Postdoc Netherlands
Stratos Idreos CWI PhD. Amsterdam
Nan Tang
Stratos Idreos
CWI QCRI
D (Pirates)
E (Afrikaans)
D (Pirates)
E (Afrikaans)
Detect
GenFix
Detect
GenFix
Detect
GenFix
La ye r
9
9
S P O Paul student_in Yale John student_in UCLA Sally student_in UCLA
William professor_in UCLA Paul advised_by William John advised_by William Sally advised_by William
There cannot exist two students in different universities with the same advisor: (Paul, John) and (Paul, Sally)
BIGDANSING in Action: Repair RDF Data
9
S P O Paul student_in Yale John student_in UCLA Sally student_in UCLA
William professor_in UCLA Paul advised_by William John advised_by William Sally advised_by William
There cannot exist two students in different universities with the same advisor: (Paul, John) and (Paul, Sally)
S O Paul Yale
9
S P O Paul student_in Yale John student_in UCLA Sally student_in UCLA
William professor_in UCLA Paul advised_by William John advised_by William Sally advised_by William
There cannot exist two students in different universities with the same advisor: (Paul, John) and (Paul, Sally)
S O Paul Yale
9
S P O Paul student_in Yale John student_in UCLA Sally student_in UCLA
William professor_in UCLA Paul advised_by William John advised_by William Sally advised_by William
There cannot exist two students in different universities with the same advisor: (Paul, John) and (Paul, Sally)
S O Paul Yale
Paul William
Sally UCLA
Sally William
John William
John UCLA
S O1 O2 Paul William Yale John William UCLA Sally William UCLA
Scope (S, O,
9
S P O Paul student_in Yale John student_in UCLA Sally student_in UCLA
William professor_in UCLA Paul advised_by William John advised_by William Sally advised_by William
There cannot exist two students in different universities with the same advisor: (Paul, John) and (Paul, Sally)
S O Paul Yale
Paul William
Sally UCLA
Sally William
John William
John UCLA
S O1 O2 Paul William Yale John William UCLA Sally William UCLA
O1 S O2 William Paul Yale William John UCLA William Sally UCLA
Scope (S, O,
9
S P O Paul student_in Yale John student_in UCLA Sally student_in UCLA
William professor_in UCLA Paul advised_by William John advised_by William Sally advised_by William
There cannot exist two students in different universities with the same advisor: (Paul, John) and (Paul, Sally)
S O Paul Yale
Paul William
Sally UCLA
Sally William
John William
John UCLA
S O1 O2 Paul William Yale John William UCLA Sally William UCLA
O1 S O2 William Paul Yale William John UCLA William Sally UCLA
(Paul, Yale, John, UCLA) (Paul, Yale,
Sally, UCLA) (John, UCLA, Sally, UCLA)
Scope (S, O,
9
S P O Paul student_in Yale John student_in UCLA Sally student_in UCLA
William professor_in UCLA Paul advised_by William John advised_by William Sally advised_by William
There cannot exist two students in different universities with the same advisor: (Paul, John) and (Paul, Sally)
S O Paul Yale
Paul William
Sally UCLA
Sally William
John William
John UCLA
S O1 O2 Paul William Yale John William UCLA Sally William UCLA
O1 S O2 William Paul Yale William John UCLA William Sally UCLA
(Paul, Yale, John, UCLA) (Paul, Yale,
Sally, UCLA) (John, UCLA, Sally, UCLA)
(Paul, Yale, John, UCLA) (Paul, Yale,
Sally, UCLA)
9
S P O Paul student_in Yale John student_in UCLA Sally student_in UCLA
William professor_in UCLA Paul advised_by William John advised_by William Sally advised_by William
There cannot exist two students in different universities with the same advisor: (Paul, John) and (Paul, Sally)
S O Paul Yale
Paul William
Sally UCLA
Sally William
John William
John UCLA
S O1 O2 Paul William Yale John William UCLA Sally William UCLA
O1 S O2 William Paul Yale William John UCLA William Sally UCLA
(Paul, Yale, John, UCLA) (Paul, Yale,
Sally, UCLA) (John, UCLA, Sally, UCLA)
(Paul, Yale, John, UCLA) (Paul, Yale,
Sally, UCLA)
•Interactive RDF data cleaning