swt lecture session 1 - introduction
DESCRIPTION
Lecture 1 for the course Semantic Web Technologies (presented at Free University of Bozen Bolzano, 2013)TRANSCRIPT
![Page 1: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/1.jpg)
+
Semantic Web Technologies 2012-2013Part I
Mariano Rodriguez-Muro, Free University of Bozen-Bolzano
![Page 2: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/2.jpg)
+Disclaimer
License
This work is licensed under the Creative Commons Attribution-Share Alike 3.0 License http://creativecommons.org/licenses/by-sa/3.0/
![Page 3: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/3.jpg)
+Intro
Course organization
Intro to Semantic Web
Intro to Semantic Technologies
![Page 4: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/4.jpg)
+
Course organization
![Page 5: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/5.jpg)
+About me
Research interests:
Techniques for query answering optimization
SPARQL, Big RDFS, virtual RDF
Data integration with Semantic Tech and SemTech in the enterprise.
Mariano Rodríguez-MuroAssistant Professor at KRDBFaculty of computer Science (POS Building, 202)Tel. +390471016228rodriguez =at= inf.unibz.it
![Page 6: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/6.jpg)
+About you
Which program?
Which semester?
Why are you here? Topic is mandatory Topic relates to my area Looking for project/thesis? Just Interesting? Need some credits?
Special interests?
![Page 7: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/7.jpg)
+Course organization (Part I)
Website: http://rodriguez-muro.com/courses/index.php?title=SWT12
Moodle …
Schedule Lecture: Tuesday:10:30 am to 12:30 pm Lecture: Thursday 8:30 am to 10:30 am Lab: Tuesday 2:00 to 4:00 pm
Office Hours With appointment Please use forums as main means of comunication
![Page 8: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/8.jpg)
+Reference Material
Slides, Papers
Foundations of Semantic Web. Pascal HItzler, Markus Krotzsch and Sebastian Rudolph. Chapman & Hall/CRC, 2010. (Code FSW)
Semantic Web Programming. John Hebeler et. al. Wiley. 2009. (Code SWP)
Programming the Semantic Web. Toby Segaran, Colin Evans and Jamie Taylor. O’Reilly. 2009. (Code PTSW)
Available at the library. SWP and PTSW available as ebooks.
![Page 9: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/9.jpg)
+Grading
Part I 50%, Part II 50%
Grading Part I Lab exercises: 15% Mid-term: 35%
Exercises: Each week a new assignment. All assignments are graded. All assignments are mandatory. Delivery must be done by the next week. Java and SQL/JDBC is required. Projects must be packaged with Maven.
Midterm. Covers all material seen during the lectures. From slides, presentation and selected book chapters/readings (marked at the end of each slide)
![Page 10: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/10.jpg)
+
IntroductionSemantic Web
![Page 11: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/11.jpg)
+Web of Documents
Primary objects: documents Links between documents
Degree of structure in data: low
Semantics of content:Implicit
Designed for: human consumption
![Page 12: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/12.jpg)
+Web of documents: The problem
![Page 13: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/13.jpg)
+Example: Elvis
![Page 14: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/14.jpg)
![Page 15: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/15.jpg)
![Page 16: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/16.jpg)
+Web of data: The problem
How about this query: How many romantic comedy Hollywood movies are directed by
a person who is born in a city that has average temperature above 15 degrees!?
You need to: Find reliable sources containing facts about movies (genre &
director), birthplaces of famous artists/directors, average temperature of cities across the world, etc. The result: several lists of thousands of facts
Integrate all the data, join the facts that come from heterogeneous sources
Even if possible, it may take days to answer just a single query!
![Page 17: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/17.jpg)
+
The VisionI have a dream for the Web in which computers become capable of analyzing all the data on the Web - the content, links, and transactions between people and computers. A Semantic Web, which should make this possible, has yet to emerge, but when it does, the day-to-day mechanisms of trade, bureaucracy and our daily lives will be handled by machines talking to ma- chines. The intelligent agents people have touted for ages will finally materialize.
Barners-Lee, 1999
![Page 18: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/18.jpg)
+The semantic web
Primary objects: things Links between: things
Degree of Structure: high
Explicit semantics of contents and links
Designed for both machines and humans
![Page 19: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/19.jpg)
+Web of data
![Page 20: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/20.jpg)
+
Semantic Technologies
![Page 21: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/21.jpg)
+Not only about the web
The semantic web vision has generated technologies that are applied outside the web context including: Enterprise intelligence Government Research (Bio, Geo, Cultural heritage, etc.) Software development …
Semantic technologies provide flexible and powerful tools to accomplish things that were not possible or not practical in the past.
![Page 22: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/22.jpg)
+Introduction to the Semantic Web approach
22
How does a Semantic Web approach help us merge data sets, infer new
relations, and integrate outside data sources?
![Page 23: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/23.jpg)
+The rough structure of data integration with SWT
1. Map the various data onto an abstract data representation
• Make the data independent of its internal representation…
2. Merge the resulting representations
3. Start making queries on the whole• Queries not possible on the individual data sets
23
![Page 24: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/24.jpg)
+Data set “A”: A simplified book store
24
ID Author Title Publisher Year
ISBN0-00-651409-X
id_xyz The Glass Palace id_qpr 2000
ID Name Home page
id_xyz Ghosh, Amitav http://www.amitavghosh.com
ID Publisher Name
City
id_qpr Harper Collins London
Books
Authors
Publishers
![Page 25: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/25.jpg)
+1st: Export your data as a set of relations
25
![Page 26: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/26.jpg)
+Some notes on the data export
Data export does not necessarily mean physical conversion of the data
Relations can be virtual, generated on-the-fly at query time
via SQL “bridges” scraping HTML pages extracting data from Excel sheets etc.
One can export part of the data
26
![Page 27: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/27.jpg)
+
A B D E
1 ID Titre Original
2
ISBN0 2020386682 A13 ISBN-0-00-651409-X
3
6 ID Auteur7 ISBN-0-00-651409-X A12
11
12
13
TraducteurLe Palais des miroirs
NomGhosh, AmitavBesse, Christianne
Data set “F”: Another book store’s data
27
![Page 28: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/28.jpg)
+2nd: Export your second set of data
28
![Page 29: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/29.jpg)
+3rd: start merging your data 29
![Page 30: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/30.jpg)
+3rd: start merging your data (cont’d) 30
![Page 31: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/31.jpg)
+4th: Merge identical resources 31
![Page 32: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/32.jpg)
+Start making queries…
User of data set “F” can now ask queries like: “What is the title of the original version of Le Palais des
miroirs?”
This information is not in the data set “F”...
…but can be retrieved after merging with data set “A”!
32
![Page 33: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/33.jpg)
+5th: Query the merged data set 33
![Page 34: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/34.jpg)
+However, more can be achieved…
We “know” that a:author and f:auteur are really the same
But our automatic merge does not know that!
Let us add some extra information to the merged data: a:author is equivalent to f:auteur Both identify a Person, a category (type) for certain
resources a:name and f:nom are equivalent to foaf:name
34
![Page 35: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/35.jpg)
+3rd revisited: Use the extra knowledge 35
![Page 36: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/36.jpg)
+Start making richer queries! User of data set “F” can now query:
“What is the home page of Le Palais des miroirs’s ‘auteur’?”
The information is not in data set “F” or “A”…
…but was made available by: Merging data sets “A” and “F” Adding three simple “glue” statements
36
![Page 37: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/37.jpg)
+6th: Richer queries 37
![Page 38: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/38.jpg)
+Bring in other data sources
We can integrate new information into our merged data set from other sources e.g. additional information about author Amitav Ghosh
Perhaps the largest public source of general knowledge is Wikipedia Structured data can be extracted from Wikipedia using
dedicated tools
May 12, 2009
38
![Page 39: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/39.jpg)
+7th: Merge with Wikipedia data 39
owl:sameAs
![Page 40: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/40.jpg)
+7th (cont’d): Merge with Wikipedia data 40
owl:sameAs
![Page 41: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/41.jpg)
+7th (cont’d): Merge with Wikipedia data41
owl:sameAs
![Page 42: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/42.jpg)
+Is that surprising?
It may look like it but, in fact, it should not be…
What happened via automatic means is done every day by Web users!
The difference: a bit of extra rigour so that machines could do this, too
42
![Page 43: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/43.jpg)
+What did we do?
We combined different data sets that ...may be internal or somewhere on the Web ...are of different formats (RDBMS, Excel spreadsheet, (X)HTML,
etc) ...have different names for the same relations
We could combine the data because some URIs were identical i.e. the ISBNs in this case
We could add some simple additional information (the “glue”) to help further merge data sets
The result? Answer queries that could not previously be asked
43
![Page 44: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/44.jpg)
+What did we do? (cont’d)
44
![Page 45: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/45.jpg)
+The abstraction pays off because…
…the graph representation is independent of the details of the native structures
…a change in local database schemas, HTML structures, etc. do not affect the whole “schema independence”
…new data, new connections can be added seamlessly & incrementally
… it doesn’t matter if you are at the enterprise level or at the web level
45
![Page 46: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/46.jpg)
+So where is the Semantic Web?
46
Semantic Web technologies make such integration possible
![Page 47: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/47.jpg)
+
Semantic TechnologiesToday: Applications, Use cases, Technologies, Systems
![Page 48: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/48.jpg)
+Web of data today
![Page 49: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/49.jpg)
+Semantics today
Linked-in
Schema.org
Good-relations
Oracle (Server)
IBM (DB2, Watson)
Apple (Siri)
SAP
Evri, Linked-in, many startups
Many deployed systems
![Page 50: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/50.jpg)
+Semantic Web Technologies
A set of technologies and frameworks that enable semantic data management, data integration and the web of data Resource Description Framework (RDF) A variety of data interchange formats (e.g., RDF/XML, N3,
Turtle, N-Triples) Semantic languages such as RDF Schema (RDFS) and the
Web Ontology Language (OWL) and Rules (SWRL) Query language (SPARQL) Software infrastructure (RDF/SPARQL frameworks, Triple
stores, Data integrators, Query engines, Reasoners) Publicly available connected dataset and open data
initiatives (LOD)
![Page 51: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/51.jpg)
+SWT Part I
The Data Model (RDF)
The query language (SPARQL)
Software Development (Architecture, Frameworks and Tools)
A little more semantics (RDFS, inference techniques, tools and data integration)
Interacting with the enterprise (Legacy sources, XML, DBMS, mappings)
More complex semantics (Rules, data integration and reasoning with rules)
![Page 52: SWT Lecture Session 1 - Introduction](https://reader033.vdocument.in/reader033/viewer/2022061218/54b7ab724a795993718b4973/html5/thumbnails/52.jpg)
+Reading material
PTSW Chapter 1
SWP Part I, Chapter 1
FTW Section 1.4