sharing data on the web (2013)
DESCRIPTION
Toronto Semantic Web MeetupTRANSCRIPT
![Page 2: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/2.jpg)
★
Towards Linked Open Data
● On the Web with an open license
“It's great to have the data accessible on the Web under an open license, however, the data is locked-up in a document. Other than writing a custom scraper, it's hard to get the data out of the document.”
![Page 3: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/3.jpg)
★★
Towards Linked Open Data
● On the Web with an open license● Machine-readable data
“Splendid! The data is accessible on the Web in a structured way (that is, machine-readable), however, the data is still locked-up in a document. To get the data out of the document you depend on proprietary software.”
![Page 4: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/4.jpg)
★★★
Towards Linked Open Data
● On the Web with an open license● Machine-readable data● Non-proprietary format
“Excellent! The data is not only available via the Web but now everyone can use the data easily. On the other hand, it's still data on the Web and not data in the Web.”
![Page 5: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/5.jpg)
★★★★
Towards Linked Open Data
● On the Web with an open license● Machine-readable data● Non-proprietary format● RDF standards
“Wonderful! Now it's data in the Web. The (most important) data items have a URI and can be shared on the Web. A native way to represent the data is using RDF, however other formats such as Atom can be converted/mapped, if required.”
![Page 6: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/6.jpg)
★★★★★
Towards Linked Open Data
● On the Web with an open license● Machine-readable data● Non-proprietary format● RDF standards● Linked RDF
“Brilliant! Now it's data, in the Web linked to other data. Both the consumer and the publisher benefit from the network effect. ”
![Page 7: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/7.jpg)
Path to Success
● Start easy– Well curated– Small dataset
● Reach out to developers● Get others involved early● Ensure internal benefit● Maximize productivity● Integrate related dataset
![Page 8: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/8.jpg)
Choosing a License
● Allow modification?● Require attribution?● Permit commercial uses?● Must others share alike?● What jurisdiction?
![Page 9: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/9.jpg)
Address Data Quality
● Completeness● Conformity● Consistency● Accuracy● Integrity● Eliminate duplication
![Page 10: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/10.jpg)
★
Put It On The Web
● Upload it and share● Document what is available● Document how to use it
– Solve a consumer need● Encourage feedback
– Continues improvement
![Page 11: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/11.jpg)
★★
Make It Machine-readable
● Simplify data structure● Don't include unstable identifiers● Automate publishing
![Page 12: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/12.jpg)
★★★
Use a Non-proprietary Format
● Standardized formats– CSV instead of excel– XML instead of MDB
● Benefits:– Accessibility– Interoperability– Reduce risk of
● Confidential info● Software viruses
![Page 13: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/13.jpg)
★★★
API Service
● Data access layer● Atom Publishing Protocol● Google Data Protocol● Open Data Protocol● Facebook Graph API● Linked Data
![Page 14: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/14.jpg)
★★★
The Web
● Universal unidirectional links using URLs– Graph of nodes identified by URL
● Cooperation without coordination● It's simple nodes and Links
![Page 15: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/15.jpg)
★★★
The Semantic Web
● A universal scalable interconnection bus● Share and reuse data across
– Community boundaries– Application– Enterprise
![Page 16: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/16.jpg)
★★★
(re)Model the Data
● Identify existing model● Reuse common vocabularies● Model data relationships
– Put aside particular needs of any application● Diagram relationships● Test assumptions with subject matter experts
![Page 17: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/17.jpg)
★★★
Universal Identifiers
● Others can reference things● The Web uses URL for
downloadable things● Uses URI for real world things and
abstract concepts● Two references with the same URI
are the same thing● Quick, easy and scalable look-up
![Page 18: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/18.jpg)
★★★★
Publish RDF
● RDF connects better than any other model● Cooperation without coordination● Data can be combined safely● Mix of vocabularies
![Page 19: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/19.jpg)
★★★★
VoID Description
● De facto standard for describing Linked Data● Used in linked data catalogues● Include provenance and attributions● Associate consumers' license
![Page 20: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/20.jpg)
★★★★
RDF Schema
● Understand structure of the graph– Self-describing
● Define terms and relationships
● Classes of things
![Page 21: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/21.jpg)
★★★★
Simple Knowledge Organization
● Controlled vocabularies● Taxonomies● Thesauri
![Page 22: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/22.jpg)
★★★★
Controlled Vocabularies
● Strongly agree● Agree● Somewhat agree● Neither agree nor
disagree● Somewhat disagree● Disagree● Strongly disagree
● Mr.● Ms.● Mrs.● Miss.● Dr.
● AB● BC● MB● NB● NL● NS● ON● PE● QC● SK● NT● NU● YT
![Page 23: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/23.jpg)
★★★★
Taxonomies
![Page 24: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/24.jpg)
★★★★
Thesauri
![Page 25: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/25.jpg)
★★★★
Web Ontology Language
● Property restrictions● Equivalences across
ontologies● Constraints● Automatic reasoners● Annotations
![Page 26: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/26.jpg)
★★★★★
Linked RDF
● Linked your data to others● Adds more related data● Add value to the data
– By joining it to other data
![Page 27: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/27.jpg)
★★★★★
Generating Data Websites
● Give ordinary users direct access the data● Templates to generate HTML
– Generate static HTML offline– Produce HTML on-demand
![Page 28: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/28.jpg)
Social Responsibility
● Responsibility to maintain published data● Publish frequency of data updates● Have a persistence strategy● Ensure data is accurate as possible● Respond to reports of problematic data
![Page 29: Sharing data on the web (2013)](https://reader033.vdocument.in/reader033/viewer/2022042814/554b8456b4c90561588b4f60/html5/thumbnails/29.jpg)
Comments or Questions?