linked data - metadataetc.orgmetadataetc.org/lod/5star.pdf · online coins of the roman empire...
TRANSCRIPT
4/23/18
1
An Introduction
Linked Data1 2 3 4 5
1--Onegoal2– Twotypesofquestions3– RDFtriples4– Fourprinciples5– FivestarLOD
LearnbyUnderstanding
LearnbyAnalyzing
MarciaZeng,2018DIS 1
SirTimBerners-Lee,theinventoroftheWWWandtheinitiatorofLinkedData,presentedaStarSchemefor
measuringtherankofadataset:
2
Five-StarLOD ★★★★★5
https://www.w3.org/DesignIssues/LinkedData.htmlMarciaZeng,2018DIS
4/23/18
2
Cases:UsingLODintheLAMs*1. SpecialCollections,Archives
a. LinkedJazzb. OnlineCoinsoftheRomanEmpire(OCRE)
2. Bibliographicdataa. WorldCatb. TheBritishNationalBibliography(BNB)
3. Knowledgeorganizationsystems(KOS)- thesauri,nameauthorities,andothersa. FASTb. GettyVocabs
4. DigitalScholarships– VIVObased- Scholars@Cornell
3
LearnbyAnalyzing
*LAMs=Libraries,archives,andmuseumsMarciaZeng,2018DIS
1. SpecialCollections,Archives
a.LinkedJazz
Theprojectfocusesondigitalizedarchivesofjazzhistorytoexposerelationshipsbetweenmusiciansandrevealtheircommunity’snetwork.
http://linkedjazz.org/
http://linkedjazz.org/network/4MarciaZeng,2018DIS
4/23/18
3
MethodologySummary:• Anaturalprocessingtoolpullsexcerptsfromtranscriptsofinterviewswithjazz
musicians thatmentionarelationship withanotherjazzmusician.• Aftertheprocessofcontrollingsynonymsandeliminatingambiguity,themusician
namesweremappedtotheDBpedia,anddata abouteachpersonwasobtained.• Therelationshipswerepresentedbasedonanontology.• Avisualizationtoolwasusedtopresentauniqueinteractiveinterface.
5MarciaZeng,2018DIS
6
•VisualizedresultinGephi.(DatamashupbetweenLinkedJazzmusiciansandCarnegieHallperformers.)
LinkedJazzandCarnegieHall (Indevelopment)About
http://pfch.nyc/linked_jazz_meets_carnegie_hall/CH-LJ_network/index.html#Mary%20Lou%20Williams
MarciaZeng,2018DIS
4/23/18
4
(cont.)1.SpecialCollections,Archivesb.OnlineCoinsoftheRomanEmpire(OCRE)
7
http://numismatics.org/ocre/
MarciaZeng,2018DIS
8
• Ontologicalclasses-- browse1.bOCRE
• Modelinginanontology(formedinclasses,properties,relationships);
• FollowingLinkedDataprinciples;• UsingRDFtriplesforentities;• QueryinginSPARQLlanguage.
MarciaZeng,2018DIS
4/23/18
5
9http://numismatics.org/ocre/id/ric.6.lon.66
Foranindividualobject,ausercanfindauto-generateddatarelatedtoit,themap(s),andquantitativeanalysis.
MarciaZeng,2018DIS
1.bOCRE
10
• Visualizeyourquerieson-the-flyHow?http://wiki.numismatics.org/numishare:visualize
• UsingSPARQLqueriestofind;• Auto-Visualizing;• Auserdoesnotneedtosee
oruseSPARQLlanguage.
MarciaZeng,2018DIS
1.bOCRE
4/23/18
6
11
• Interactwithamap
MarciaZeng,2018DIS
1.bOCRE
MarciaZeng,2018DIS 12http://numismatics.org/ocre/
1.bOCRE
4/23/18
7
Formoreinformation• Webinar:EthanGruber:From0to60onSPARQLqueriesin50minutes,May
13,2015WatchtheYouTube:https://www.youtube.com/watch?v=3YhG5QQmhvU
• EthanGruber’swebpagehttp://numismatics.org/ethangruber/– WhereyoucanconnecttohisGithub,https://github.com/ewg118– SPARQLquerieshttps://gist.github.com/ewg118
• AFinalreportsubmittedtothefunder,NEH,2017– http://www.dayofarchaeology.com/final-report-to-the-neh-for-online-coins-of-the-roman-empire/
13
OnlineCoinsoftheRomanEmpire(OCRE)
MarciaZeng,2018DIS
Cases:UsingLODintheLAMs*1. SpecialCollections,Archives
a. LinkedJazzb. OnlineCoinsoftheRomanEmpire(OCRE)
2. Bibliographicdataa. WorldCatb. TheBritishNationalBibliography(BNB)
3. Knowledgeorganizationsystems(KOS)- thesauri,nameauthorities,andothera. FASTb. GettyVocabs
4. DigitalScholarships– VIVObased- Scholars@Cornell
14
LearnbyAnalyzing
*LAM=Libraries,archives,andmuseumsMarciaZeng,2018DIS
4/23/18
8
http://www.worldcat.org/oclc/246662790
http://viaf.org/viaf/102333412
schema:creator
creatorPrideandPrejudice
MarciaZeng,2018DIS 15
Case:2.aWorldCat
16
Case2.bTheBritishNationalBibliography(BNB)-- usesitsownontology
About:• TheBritishLibraryisthenationallibraryoftheUKandisresponsiblefor
distributingmetadatadescribingitscollections andrecordingUKpublishingoutputintheBritishNationalBibliography(BNB) http://bnb.data.bl.uk.
• In2011,theBritishLibrarybeganpublishingaLODversionoftheBNBaspartofitsopenmetadatastrategy.ThemovetoLODBNBprovedinfluentialamongthelibrarycommunityinmovingtheLinkedData‘debate’fromtheorytopractice.
• TheLODBNBhascontinuedtoevolvewithregularmonthlyupdates,theinclusionofnewlinks(e.g.totheISNI)andcontent(e.g.serials).
Try:• GotoitsFlint Sparql Endpointat:http://bnb.data.bl.uk/flint-sparq• Usethesamplequeriestoseeexamples.• Trytoformyourownqueriesandgetdifferentdatasets.
Read:• Howtogetthebulkdownloadhttp://www.bl.uk/bibliographic/download.html
MarciaZeng,2018DIS
4/23/18
9
References: Lists of all classes, properties, and prefixes of the metadata vocabularies used by BNB.
2. Results in Plain text. Other output options are XML and JSON.
1. Query text for “Which titles by detective writer Ian Rankin appear in the BNB?”
SELECT*WHERE{?s?p?o}
1
2
17
Cases:UsingLODintheLAMs*1. SpecialCollections,Archives
a. LinkedJazzb. OnlineCoinsoftheRomanEmpire(OCRE)
2. Bibliographicdata– WorldCat
3. Knowledgeorganizationsystems(KOS)- thesauri,nameauthorities,andothera. FASTb. GettyVocabs
4. DigitalScholarships– VIVObased- Scholars@Cornell
18
LearnbyAnalyzing
*LAM=Libraries,archives,andmuseumsMarciaZeng,2018DIS
4/23/18
10
Source:extractedscreenshots(2017-07-12)Fromhttp://fast.oclc.org/searchfast/
19
3.KOSa.FAST
MarciaZeng,2018DIS
Source:extractedscreenshots(2017-07-12)athttp://experimental.worldcat.org/fast/35588/rdf.xml
JohnF.Kennedy’sentryinFASTisenrichedwithothersources.
• TheDBpedia identifiersallowFASTtermstoincludedetailedinformationthatisusuallyexcludedinauthorityrecords.
• TheVIAFURIallowsFASTtermstotakeadvantageofallofthevariousstringvaluesincludedinVIAFwithouthavingtomanuallyincludethevaluesintheRDFtriplesforthespecificterm.
20MarciaZeng,2018DIS
3.KOSa.FAST
4/23/18
11
Imagesource:CapturedAug.2017.http://experimental.worldcat.org/mapfast/
TheGeoNamesdataisusedtopowerMapFAST,whichisaGoogleMapsmash-up.
21MarciaZeng,2018DIS
3.KOSa.FAST
Cases:UsingLODintheLAMs*1. SpecialCollections,Archives
a. LinkedJazzb. OnlineCoinsoftheRomanEmpire(OCRE)
2. Bibliographicdata– WorldCat
3. Knowledgeorganizationsystems(KOS)- thesauri,nameauthorities,andothersa. FASTb. GettyVocabs
4. DigitalScholarships– VIVObased- Scholars@Cornell
22
LearnbyAnalyzing
*LAM=Libraries,archives,andmuseumsMarciaZeng,2018DIS
4/23/18
12
23
http://vocab.getty.edu/sparql
http://vocab.getty.edu
http://vocab.getty.edu/queriesMarciaZeng,2018DIS
3.KOSb.GettyVocabs
GettyVocabsLOD
AATTGNULAN[CONA]
We will try them at the Hands-on session.
Atthequerytemplatespage
FindthesectionforULAN.
Therearemanyinterestingqueryexamples.
24
http://vocab.getty.edu/queries
Nameauthoritiesofferfoundationalstructureddatafornetworkanalyses.
Whatkindsofqueryexamples?
MarciaZeng,2018DIS
3.b.GettyVocabs
ULAN
UnionListofArtistNames
4/23/18
13
25
Howtouseatemplate?(1)Chooseaquery(left),e.g.,#5.2;thetemplateboxwillshowup(lower-right).(2)Atthattemplatebox’supper-rightcorner,clickonthatSPARQLsign;thequerywillautomaticallyjumpuptotheQuerybox(top).(3)Submit.
1
2
3
MarciaZeng,2018DIS
ULAN
26
Theresult:“allassociativerelationshipsofulan:500115493Duerer,Albrecht”(showingaportionoftheresults).
MarciaZeng,2018DIS
ULAN
4/23/18
14
http://vocab.getty.edu/queries#Top-level_Subjects
Browse the examples of queries
You can obtaining special RDF graphs or datasets for very complicated questions, andrevealing unknown relationships.
KOSinLODbecomeknowledgebasesofresearch.
27
TGN
ThesaurusforGeographicNames(TGN)
MarciaZeng,2018DIS
28
1
2
3
Steps: (1) Choose #4.18 query, (2) click on that SPARQL sign in that 4.18 template box. After click on that SPARQL sign, the query should be automatically uploaded to the top box. (3) Submit. Note: Since this is a complicated query, it will run a few seconds. Be patient.
E.g.,LookforcastlesaroundTheNetherlands(withintheboundaryof50.7871853.38972253.5422657.169019)
TGN
MarciaZeng,2018DIS
4/23/18
15
E.g.,LookforcastlesaroundTheNetherlands
4
(4) Download the datasets in a selected format.The best way is to download the csvfile. (5) You should either keep the query in your CSV file or make a note what you searched for and in which boundary.
Finished.6
6
Optional:(6) Click on any castle’s ID, & open the single data record for this concept. (7) Click on the Website to see its normal html view.
7
29
TGN
MarciaZeng,2018DIS
E.g.,Queryaspecificplacetype(e.g.,WorldHeritageSites)inageographicboundary.Gottheresults&downloadabledatasets:
WorldHeritageSiteswithin(24.7508328.9577843.80722108.92861)aroundtheSilkroad.
30
TGN
4/23/18
16
Use a <Guide Term> to obtain all concept URIs
and preferred terms in the hierarchies (for a
microthesaurus or a pick list)
in <xyz>
31
Microthesaurus =designatedsubsetofathesaurus thatiscapableoffunctioningasacompletethesaurus.
-- ISO25964-2:2013
CreateMicrothesauri orpicklistsfromtheGettyLODVocabularies
AAT(cont.)3.KOSb.GettyVocabs
ArtandArchitectureThesaurus(AAT)
MarciaZeng,2018DIS
Cases:UsingLODintheLAMs*1. SpecialCollections,Archives
a. LinkedJazzb. OnlineCoinsoftheRomanEmpire(OCRE)
2. Bibliographicdata– WorldCat
3. Knowledgeorganizationsystems(KOS)- thesauri,nameauthorities,andothera. FASTb. GettyVocabs
4. DigitalScholarships– VIVObased- Scholars@Cornell
32
LearnbyAnalyzing
*LAM=Libraries,archives,andmuseumsMarciaZeng,2018DIS
4/23/18
17
https://scholars.cornell.edu/
Scholars@Cornell
Scholars@Cornelloffersintegratedandindividualizedprofilesabout:• faculty,• instituteunits,
researchdomains,
• collaboratingnetworks,and
• academicoutcomes.
MarciaZeng,2018DIS 33
E.g.,Chooseonefacultymemberorresearcher
34
Forexample,whenlookingforaninformationscienceresearcher,Ifoundaprofessor,SusanR.Fussell.• Whatdoesthisprofile
tellusaboutthisperson?
• Howarethingsconnected?
"Co-Authors”"Co-Investigators”
MarciaZeng,2018DIS
4/23/18
18
• https://scholars.cornell.edu/35
Allinteractive.Datavaluesinvariousontologicalclassesareconnected,integrated.
MarciaZeng,2018DIS
36
VIVO(notanacronym)isawell-knownontology-basedscholarlynetworkinganddiscoverytoolformanaginginformationandknowledgeinlargeinstitutionsandassociations,asdemonstratedbytheVIVO-poweredwebsitese.g.,• ScrippsResearchInstitute,• U.S.DepartmentofAgriculture,• UNAVCO,and• manyothers-seeregistry:
http://duraspace.org/registry/vivo
• InotherVIVO-basedsite,therearealsoMapofScience
MarciaZeng,2018DIS
4/23/18
19
The changing concepts•(seeing from the content)
– From "Web of Documents" to "Web of Data"– From linking strings to linking things– From digitization to datalization
•(seeing from the results)– From "On the Web" to "Of the Web”
37
Summary
MarciaZeng,2018DIS
38
What is Linked Data?
• -- is a term used to describe a method of exposing, sharing, and connecting data on the Web using URIs and RDF
• --is about: • using the Web to connect related data that
was not previously linked, • using the Web to lower the barriers to linking
data currently linked using other methods. [1]
[1] http://linkeddata.org/
MarciaZeng,2018DIS