achieving time effective federated information from scalable rdf data using sparql queries
TRANSCRIPT
Achieving Time Effective Federated Information from scalable RDF data using
SPARQL Queries
ByD.Teja Santosh, Assistant ProfessorComputer Science and Engineering
GITAM University,Rudraram, Hyderabad
04/13/23 NCACPR-14 1
Aim: To retrieve federated information from scalable RDF data using SPARQL query as a “Global Service” in very less time.
Technologies Used:RDF, SPARQL,OWL.
Novelty of the concept: Integrating heterogeneous data from different databases is shown using
a single RDF data file. Retrieving federated data using SPARQL query as a Global Web Service.
04/13/23 NCACPR-14 2
My LINKED IN Profile
04/13/23 NCACPR-14 3
My International Conf. Title
04/13/23 NCACPR-14 4
Web 2.0 Architecture
04/13/23 NCACPR-14 5
Issues in integrating two different web services
• Different platforms.• Different languages/technologies.• Individual XML data viewed as Tree that may lead to
inconsistent responses when integrated.• Availability of web services.
04/13/23 NCACPR-14 6
Web 3.0 Architecture
04/13/23 NCACPR-14 7
How RDF overcomes this issue?
• The RDF model is made up of triples: subject-predicate-object.
• These triples are uniquely identified on the web through URI. [Like “PASSPORT NUMBER” to uniquely identify a person across the real world].
• This lets machines understand human knowledge statements. [Computer saying: Oh!]
• The RDF model is essentially the canonicalization of a (directed) graph, and so as such has all the advantages (and generality) of structuring information using graphs
• Any number of author profile and corresponding conference title data (federation) in combined format are linked and retrieved due to the query pointing to the generic subject node(s).
04/13/23 NCACPR-14 8
RDF Data Graph
04/13/23 NCACPR-14 9
RDF Tabulator Screenshot
04/13/23 NCACPR-14 10
SPARQL• I call SPARQL as a test bed which makes us to have clear idea about the result
accuracy (as a Web 3.0 learner).
• Queries RDF data. If your data is in RDF, then SPARQL can query it natively.
• Implicit join syntax. SPARQL queries RDF graphs, which consist of various triples expressing binary relations between resources. As all relationships are of a fixed size and data lives in a single graph, SPARQL does not require explicit joins that specify the relationship between differently structured data.
• The SPARQL query above has a similar structure:
SELECT <variable list>WHERE {<graph pattern> }
• FROM is used as a Base URL of the RDF Triple Store.
04/13/23 NCACPR-14 11
SPARQL QUERY IN TWINKLE
04/13/23 NCACPR-14 12
FEDERATED INFORMATION OF LINKEDIN AUTHOR AND ICHCI INTL. CONF. TITLE OF THE SAME PERSON DATA
SPARQL QUERY PROCESSING• SPARQL queries are executed against RDF datasets, consisting of RDF graphs.
• A SPARQL endpoint accepts queries and returns results via HTTP.
• SPARQL endpoints will query any Web-accessible RDF data.
• The results of SPARQL queries can be returned and/or rendered in a variety of formats:
– XML– JSON– RDF– HTML– CSV
04/13/23 NCACPR-14 13
ANALYSIS OF RDF DATA GRAPH FOR FEDERATED QUERY I. SCALABLE TRIPLES VISUALIZATION
04/13/23 NCACPR-14 14
II. RESPONSE TIME OF SPARQL QUERY ON ‘n’ (VIRTUAL) OBJECTS
04/13/23 NCACPR-14 15
SELECTIVITY: • sel(t) = sel(s) * sel(p) * sel(o)• sel(s) = 1/R, R - No. of Resources.• sel(p) = Tp/T, T – Total No. of triples, Tp – Triples matching predicate p.
• sel(o) = hc(p,oc)/Tp, where (p,oc) represents the class of the histogram for predicate p in which object o falls.
INFERENCE:When Selectivity parameter is used for analysis of SPARQL query, accessing the data using subject is encouraged when federated response is required.
04/13/23 NCACPR-14 16
REFERENCES[1] RDF and SOA by David Booth, Ph.D., HP Software.[2] TechnicaLeeSpeaking: Software designs, implementations, solutions,
and musings by Lee Feigenbaum. [3] Frank Manola, Eric Miller, W3C, RDF Primer.[4] David Booth, W3C Fellow / Hewlett-Packard, Hugo Haas, W3C Francis
McCabe, Fujitsu Labs of America, Eric Newcomer (until October 2003), Iona Michael Champion (until March 2003), Software AG, Chris Ferris (until March 2003), IBM, David Orchard (until March 2003), BEA Systems, Web Services Architecture.
[5] Eric Prud'hommeaux, W3C ,Andy Seaborne, Hewlett-Packard Laboratories, Bristol, SPARQL Query Language for RDF,W3C Recommendation 15 January 2008.
[6] Lesley Charles, November 23, 2009, SPARQL Query Optimization.[7] A. Bernstein, M. Stocker, and C. Kiefer. SPARQL Query Optimization Using
Selectivity Estimation. InPoster Proceedings of the 6th International Semantic Web Conference (ISWC), Lecture Notes in Computer Science.Springer, 2007.
04/13/23 NCACPR-14 17