adding semantic to web data and servicesbeaune/websem/cours2008_2009/... · data warehousing in...

27
1 Adding Semantic to Web Data and Services Part 4 – Use Cases Doctoral School, St Etienne January 2009 Alain Léger FT R&D Orange Labs Research DR Knowledge Processing (KRR) Manager Industry Area IST NoEs OntoWeb et Knowledgeweb (2000 -2007) Associated DR CNRS Lyon I - LIRIS Slide 2 © KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005 Plan Cours 1 (5 janv 09 13:30 – 17:15 / 6 janv 09 8:15 – 12:00 ) ¾ Why adding semantics to the Web ? (1h30) Introduction Take Away and References ¾ Foundations of Semantic Web (2h15) Introduction to Logics and Description Logics Standards and non-standards Inferences ¾ From XML, RDF to OWL (2h45) XML, RDF, RDF-S OWL ¾ Applications and Roadmap (1h00) Application Scenarios Visions prospectives et verrous technologiques

Upload: others

Post on 24-May-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

1

Adding Semantic to Web Data and ServicesPart 4 – Use Cases

Doctoral School, St Etienne January 2009

Alain Léger FT R&D Orange Labs ResearchDR Knowledge Processing (KRR)Manager Industry Area IST NoEs OntoWeb et Knowledgeweb (2000 -2007)

Associated DR CNRS Lyon I - LIRIS

Slide 2© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Plan Cours 1 (5 janv 09 13:30 – 17:15 / 6 janv 09 8:15 – 12:00)

Why adding semantics to the Web ? (1h30)

– Introduction

– Take Away and References

Foundations of Semantic Web (2h15)

– Introduction to Logics and Description Logics

– Standards and non-standards Inferences

From XML, RDF to OWL (2h45)

– XML, RDF, RDF-S

– OWL

Applications and Roadmap (1h00)

– Application Scenarios

– Visions prospectives et verrous technologiques

Page 2: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

2

Slide 3© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Plan

Semantic Web : The Big pictureHype or Reality?

Accelerate the take-upRealize Business Use Cases!Make them profitable!

And next …Relax ☺

Slide 4© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Industry permeates the Research

Page 3: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

3

Slide 5© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Fast take up in industry is key !!

Showing the value to Business Units– Do not oversell the technology (AI syndrom …)– Convincing benefits on Not toy scenarios !– Fast ROI

Hiding the complexity of technology to all usersFocusing the research effort on key Industry RoadblocksMaking available tools and compliant FrameworksSharing the knowledge and theoretical skill with industryStandardizing on key elements

Do not realize the full picture at once !

Slide 6© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Applications

Research

Technologies, Innovations

Deployed useful applications

&

Page 4: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

4

Slide 7© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Main GoalPromote greater awareness and faster take-up

of SWS technology by industry – in close and permanent synergy with research area

How? create a virtuous feedback loop between industry and research, including education and training

Academia

Industry“bridge the gap” between industry and research(bi-directional process!)

Slide 8© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

NoE Knowledge Web Industry Area process

SpecificIndustry Needs

Use Cases &Knowledge components

Technology SuitabilityEvaluation

Interoperable Framework& Ontology content

Ontology ContentRecommendation

certifying, and serving validated ontologies

1

Kweb

Indu

stry

Allian

ce

2 3

Cross-Network cooperation – joint program of activities & Joint Education, competence centers5

Kweb Portal6

Promotion, Deployment– Technology RoadMaps, Success stories, Technology show cases, Best Practices

4

Research Education

OOA Alliance

Page 5: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

5

Industry Support

Alain LégerRobert Meersman

Chairs of Industry Area Knowledgeweb

Slide 10© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Industry Board membersIncluding:- Client Industry- Technology push companies- Ontology content partners

Page 6: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

6

Slide 11© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Industry members

Human Ressources– SAP.de– EDS.nl– NBC.nl– Kenteq.nl– Ordina.nl– BT.co.uk– City of Antwerp.be– Randstad.nl– Klett Verlag.de– Editis.fr– IMC.de– IDS-Scheer.de– VDAB.be – Le FOREM.be

Technology– TXT e-Solutions– Acklin– Distributed Thinking– Onto Text Lab– Isoco– Bitext– Risaris– IKV++– Semtation– HR-XML consortium– Amper– Grupo Miramon– Labein– Robotiker Infotech– Berlecon– Computas Technology– Synergetics– Net Dynamics– Green Cacti– Expert System Language

Publishing, Content– Merrall Ross International Ltd– Office Line Engineering– NIWA

Manufacturing– WTCM

Banking– Tecnologia, Informacion and Finanzas– Cidem

Telecom– France Telecom– British Telecom– Telefonica– Neofonie

Energy– IFP– EDF (not finalised)

Transport– SNCF– TrenItalia– EADS AirBus (not finalized)

Health care and Biomedecine– Biovista– L&C Languge and Computing– AstraZeneca.se– Synthematix.usa– Roche.sz– Novartis.sz– Entelos Inc.usa

Automotive– Daimler Chrysler

Foods– Illy Caffè

Space Defense– Thalès– DSTL– Ministero de Defensa– Deimos Space Focusing FIRST on

very promising areas

0

1

2

3

4

5

6

7

8

9

10

Companies

Country

UKAustAllEsFrItGrBeBulDkIrlUSA

Slide 12© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Industry Partners facts

50 Industry partners on Board (Including OOA)50 prospects 15 nationalities14 economic sectors– Health, Telecom, Automotive, Energy, Food, Media, Transport, HR,

Space, Publishing, Banking, Manufacturing, Laws, and TechnologyFocused on areas– With rapid promising results

• Health Care and Life Sciences• Telecommunications• Human Resources management• Culture Museums

Give a « membership » statusVery demanding task

Core team partners very motivated

Page 7: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

7

Slide 13© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Contract Alliance

Knowledge Web Early Access Policy– A panel of industrial corporations working as a “laboratorial” target market– where all can

• study industrial requirements, • test the industrial value of their ideas,• collect inspiration for new research opportunities.

Early Knowledge transfer– "Early Releases“

• Results and deliverables from KW technical, are to be made available to Panellists at the respective owners' discretion;

– Early Releases have to be treated as confidential by the Panellist, – The Panelists agree

• To test the Early Releases diligently and to communicate to the Contractors sufficient progress and error reports,

• To provide inputs such as but not limited to, recommendations, or industrial requirements, feedback to the Contractors in relation to its own use of the Semantic Web services & technology,

Industry managers responsibility– ensuring timely achievement of the Panellists' tasks,– ensuring feedback and information for the benefit of all Contractors

Mutual obligations of results

Slide 14© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Ontology Outreach AuthorityFrom the POV of an enterprise, Semantic Web is an intrusive and a

disruptive technology. Need also to pay attention to roles of legacy systems,

methodology, and resulting research and education challenges

Ontologies are very difficult to standardize. Ontologies usually are

application-dependent; subjective; general evaluation criteria lacking

We suggest “standardization lite” by recommending ontologies:

Evaluate an ontology whether it is in concordance Evaluate an ontology whether it is in concordance with the claims of its developers.with the claims of its developers.

Goals, requirements, scope, reusability, usability, etc.

First sectors addressed:

• Human Resources and Employment • Healthcare and Life Sciences

Page 8: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

8

Semantic Web Use Cases

KnowledgeWeb : Industry Area – Research Area

Alain Léger, Lyndon Nixon, Malgorzata Mochol, Roberta Cuel, François Paulus, Mustafa Jarrar. Heraklion, 3 June 2005

Slide 16© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Use case collection process

We send a short questionnaire to eachindustry board member asking for input

Page 9: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

9

Slide 17© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Use case collection process

We follow up with a face-to-face meeting to collect furtherdetails and write up the use case in consultation with

industry

Slide 18© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Use Cases: Results so far

1. Recruitment2. Multimedia content analysis and annotation3. Peer-to-peer eScience portal4. News aggregration service5. Product lifecycle management6. Data warehousing in healthcare7. B2C marketplace for tourism8. Digital photo album management9. Geosciences project memory10. R&D support for coffee11. Co-ordination of Real Estate Management12. Hospital Information System13. Agent-based system for insurance14. Daimler Chrysler Semantic Web Portal15. Specialized Web Portals for Businesses16. Integrated access to Biological data

Page 10: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

10

Slide 19© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Knowledge processing tasks

Extract research challenges from use caseDirect research to industrial requirements

To package and purchasea nice week-end

To plan a nice week-end

Customer Content and Service Providers (C/S Ps)

ManageContent-Aggregation

MappingContent ControlContent Transform-XmlStream

Access ServicerequestSchemas

Manage-Content

loadXmlStream

transformContent

returnResult

Task: Data Translation

Component: Wrapper

Requirements: Static mappings are not scalable. Correspondences between

content need to be determined dynamically.

Task: Results Reconciliation

Component: Reconciler

Requirements: Integrated data should be checked for duplications or overlap in the

collected information.

Slide 20© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Analysis of use cases

Use Cases by Industry 6% 6%

6%

13%

18%13%

19%

13%6%

Automobile Energy

Food Industry Government & Public Sector

Media & Communications Pharmaceuticals & Health

Service Industry Technology Providers

Transport & Logistics

0

2

4

6

8

Solutions sought by industry

Matching Annotation Search

Navigation Integration of data Standardization of vocabulary

Data management Consistency checking Personalisation

Recruitment use case:

Solution – „matching between job offers and job seekers“

Why – „Employee recruitment is increasingly being carriedout online… Finding the best suited candidate in the fastest

time leads to cost cutting and resource sparing“

Data warehousing in heathcareuse case:

Solution – „introduce a commonterminology for healthcare dataand wrap all legacy data in this

terminology“

Why – „…allows for a dataintegration and consistency

checking solution… „

0

2

4

6

Technology locks

Sem antic query Ontology m atching Storage and retrieval

Knowledge extraction Ontology-based reasoning Sem antic Web ServicesOntology m apping Sem i-autom ated annotation Ontology authoring tools

Ontology developm ent Support for rules Trus tOntology m aintenance

R&D Support for Coffee use case:

Summary – Semantic search engine for ontology basedqueries in a Knowledge Management system

Technology locks – Corporate domain ontologybuilding and maintenance

Integration of Biological Data Repositoriesuse case:

Summary – An unified point of access to different biological data repositories

accessible through the Internet

Technology locks – Generation and extraction of knowledge from biological

data…

Using standards.. providing a unified entrypoint to different biological data

repositories (ontology-based reasoning)

Page 11: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

11

Slide 21© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Task typology

Directory ManagerDirectory Management

ProfilerPersonalization

Match ManagerProducing Explanations

Ontology ManagerSchema/Ontology Merging

Results ReconcilerResults Reconciliation

PlannerComposition of Web Services

Query ProcessorSemantic Query Processing

ReasonerReasoning

Annotation ManagerContent Annotation

Match Manager Matching Results Analysis

Match ManagerMatching

Ontology Manager Ontology Management

WrapperData Translation

ComponentsKnowledge processing tasks

Most tasks repeat over cases, suggesting a stable typology containing core tasks

stipulated by industry needs.

Drilling down into Use Cases

Page 12: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

12

Slide 23© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

OverviewOverviewD1.1.4: Use Case Summaries– Use Case 1: Recruitment, Worldwidejobs– Use Case 2: B2C portal, FT– Use Case 3: News Aggregration, Neofonie– Use Case 4: Product Lifecycle Management, Semtation– Use Case 5: Managing Knowledge at Trenitalia– Use Case 6: Access to Biological data, Robotiker– Use Case 7: Needs in Petroleum industry, IFP– Use Case 8: Hospital information systems, L&C global– Use Case 9: Multimedia Processing

Conclusions

Slide 24© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Selection of key use cases

Page 13: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

13

Slide 25© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Use Case 1: Recruitment

ApplicantEmployer

Job Posting annotated using controlled vocabularies

Semantic Matching of applicant’s profile with job postings

Recommended open positions

Job Application annotated using controlled vocabularies

Semantic Matching of applications with position’s requirements

Interview Recommendations

Automated Preselection

Job Portal

Slide 26© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

InformationIntegration +

MatchingInformationConsumers

SemanticMatchingEngine

Crawler

RDFRepository

Semantic Portal

InformationProviders

RDF-annotated Websites

RDFRepository

Non-RDFHR System

Wrapper

NetAPI

NetAPI

Use Case 1: Recruitment

Page 14: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

14

Slide 27© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Reputation as ranking criteriaMatch Manager

Trust

Guidelines, change tracking (versioning)

Ontology manager

Ontology managing

Mapping non-RDF, semi-automation

Wrapper; annotater

KnowledgeExtraction

Scalability, retrieval reliability&performance

Directory Manager

Storage& retrieval

Improved matching: weighting, measures…

Match Manager

Semantic Matching

Use Case 1: Recruitment

Slide 28© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Use Case 2: B2C portal

OffersCustomer

Request understanding

Semantic best Match of customer request and profile with current offers

Selection of matching combinations

Selected services and Data resources annotated using Ontologies

Knowledge fusion and summary of top best match

Travel Package thrustedRecommendations

Automated Knowledge fusion

Query plan generationDiscovery and composition

Plane booking

Train booking

Car rentals

Books and Guides

Next two weeks I’m goingto Heraklion. Could you

propose me a personalizedleasure package

and a concise guide in spanish?

Page 15: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

15

Slide 29© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Use Case 2: B2C portal

Reputation as ranking criteriaMatch Manager

Trust

Guidelines, change tracking (versioning), tool and methodology

Ontology manager

Ontology managing

Mapping non-RDF, semi-automation, ease of integration of new services / resources

Automated Wrapper

Knowledge Extraction

Semi-automated ontology Merging and mappings

Ontology Manager

Local schemas mappings

Adapted matchers,for DB and services

Match Manager

Semantic Matching

Slide 30© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Use Case 3: News Aggregration

The current solution is a semi-automatic approach consisting of two phases:• the manual creation by a source expert of a XSLT template for each news source,• the automatic processing of that news source through a thematicclustering algorithm (NLP) and classification with category mappings.

Provision of an aggregated news service that is able to provide business clients with accurate search, thematic clustering, classification of news stories, and e-mail notification of stories of interest.http://www.newsexpress.de

Page 16: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

16

Slide 31© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Use Case 3: News Aggregration

Usage rights, assesstrust

Annotation, wrapper, results

Security, Trust

Rich, efficient query. Matching

Query,reasoner, results reconcile

Search

Extraction from text, fuzziness

Ontology managerOntologyDevelopment

Dynamicism, performance

WrapperOntologyMapping

Tool integration, automation

Annotation managerAnnotation

Slide 32© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Use Case 4: Product Lifecycle

There is a high cost associated with the development and maintenance of product catalogues throughout the product lifecycle. Expensive and complex tools are used due to non standardized terminology of the structure of product catalogue data, causes difficulties in development and maintenance of that data.

A tool supports the representation of product configurations as Visio diagrams, in which the diagram components are tied to descriptions using a common terminology.The current tool could be extended to support Semantic Web standards such as OWL in the future.

Page 17: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

17

Slide 33© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Use Case 4: Product Lifecycle

Extensions (rules, measures), Efficiency (minimal subsets)

ReasonerLogic / rules

User profiles, access rights, collaboration

ProfilerPersonalisation

Good tool, visualisation, change tracking

Ontologymanager

Ontology development

Slide 34© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Use Case 5: Managing Knowledge at TrenitaliaInstitutional

K

KBase

Portale

Unique access

A unique Workflow of K contribution

A central Repository of

Content Management

A common taxonomy

content management

system

Intranet Site

Tools of personal

productivityP

P

P

Local Contribution, codification and storing

Distributed KM: Research and Development Peer to peer document sharing and reviewing systems; semantic information

retrieval tools; Expert maps; Social technologies that allow communities exchanging processes; Documents upload on official repositories

Page 18: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

18

Slide 35© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Use Case 5: Managing Knowledge at Trenitalia

Contribution and position within community as ranking criteria

Collaborative writing systems, and evaluation ranking

Collaborative and social creation and reviewing of documents

Reputation as ranking criteriaMatch ManagerTrust

Guidelines, change tracking (versioning)

Ontology manager

Ontology managing

Mapping non-RDF, semi-automation

Wrapper; annotater

Queries

Scalability, retrieval reliability&performance

Directory Manager

Storage& retrieval

Improved matching: weighting, measures…

Match ManagerSemantic Matching

Slide 36© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Use Case 6: Integrated access to biological data

Biological Data Repositories

Existing Ontologies

Semi-Automated merging and mapping

Annotations and wrappers generation

Selected Data resources

Knowledge mining and fusion

Automated Knowledge extraction

Ontology merging and mapping

RDF-annotated Websites

RDFRepository

Non-RDFBD System

(Nucleotide Sequences, amino acid sequences,…), corporate databases, results of experiments (DNA-chips),

health cards, medical literature sites…

a researcher wants to compare the

result of a experiment with the genome annotation

database

Non-RDFBD System

Non-RDFBD System

Inte

rnet

allow, combining and associating existing ontologies in the biological field, an integral modelling of the biological data sources

(genomics, proteomics, metabolomics and systems biology)

BiologistExpert

Semantic best Match

Selection of matching combinations

Query plan generation

Requestunderstanding

New knowledgegeneration

Page 19: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

19

Slide 37© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Use Case 6: Integrated access to biological data

InformationIntegration + Matching

Knowledge fusion

Biology experts

Semantic Query Matching Engine

Automated Ontology Merging and Mapping

Semantic Mediations

Biological Data Repositories

RDF-annotated Websites

RDFRepository

Non-RDFBD System

(Nucleotide Sequences, amino acid sequences,…), corporate databases, results of experiments (DNA-chips),

health cards, medical literature sites…

Non-RDFBD System

Non-RDFBD System

Inte

rnet

Automated wrapper generation

Slide 38© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Use Case 6 : Integrated access to biological data

Knowledge mining over ontology mediated sources

Knowledge miner

Knowledge Generation

Guidelines, change tracking (versioning), tool and methodology

Ontology manager

Ontology managing

Mapping non-RDF, semi-automation, ease of integration of new services / resources

Automated Wrapper

Knowledge Extraction

Semi-automated ontology Merging and mappings

Ontology Manager

Local schemas mappings

Adapted matchersfor very heterogeneous DB

Match Manager

Semantic Matching

Page 20: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

20

Slide 39© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Use Case 7: Geoscience semantic memory

ComplexMultimediaDocuments

From Energy industry partners

GeoscienceExperts

I am in charge of this new project with Total

company for energyexploration in Indian sea ,

could you find me the recent subsurface

documents of south-eastIndia?

In a KM context to give semantic access to multi projects documents and data (software, subsurface models) to practitioners

ComplexMultimediaProject documents

90% Automated deep annotation with available taxonomy/ontology

Deeply AnnotatedProjects documents

Selected Doc / Data

Extraction

Knowledge extraction and summary

Automated Knowledge summary

Automated annotation

Semantic best Match

Selection of matching combinations

Query plan generation

Requestunderstanding

Best availableInformation / knowledge

Intra

/Ext

rane

t

Non-RDFBD System

Non-RDFBD System

C HC

C

=> CCO²

CO

Slide 40© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

InformationIntegration + MatchingKnowledge summary

Geoscienceexperts

Semantic Query Matching Engine

Automated Ontology Annotation

Semantic Mediations

ComplexMultimedia

Documents Data Repositories

Non-RDFBD System

Non-RDFBD System

Automated summarization

Intra

/Ext

rane

t

C HC

C

=> CCO²

CO

Use Case 7: Geoscience semantic memory

Page 21: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

21

Slide 41© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Ontology-driven navigationOntologynavigator

Navigation

Large domain Ontology building and maintenance

Ontology manager

Ontology managing

Summarizing complex documents and data table

Summarizer Knowledge Summary

Automated on-the-fly of complex documents / DB / SW

AnnotatorAutomatedannotation

Selection of the best (part-of) documents and data

Match Manager

Semantic Matching

Use Case 7: Geoscience semantic memory

Slide 42© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Use Case 8: Hospital Information System

Dealing with the issues of database integration in the domain of healthcare

Semantic Mediator

Medical Ontology

Mappin

g/Ann

otatin

g

Database 1 Database n

Xyz XyzXyz Xyz Xyz Xyz

Xyz Xyz Xyz Xyz Xyz Xyz Xy

z Xyz Xyz Xyz XyzXyz Xyz

Xyz XyzXyz Xyz Xyz Xyz

Xyz Xyz Xyz Xyz Xyz Xyz Xy

z Xyz Xyz Xyz XyzXyz Xyz

Xyz XyzXyz Xyz Xyz Xyz

Xyz Xyz Xyz Xyz Xyz Xyz Xy

z Xyz Xyz Xyz XyzXyz Xyz

Xyz XyzXyz Xyz Xyz Xyz

Xyz Xyz Xyz Xyz Xyz Xyz Xy

z Xyz Xyz Xyz XyzXyz Xyz

Query

Free text reports

I want to access all diagnoses and treatment

information about my patients, from all of these heterogeneous resource

easily and quickly

Page 22: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

22

Slide 43© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Use Case 8: Hospital Information System

Mediator

Ontology

Mappin

g/Ann

otatin

g

Database 1 Database n

Xyz XyzXyz Xyz Xyz Xyz

Xyz Xyz Xyz Xyz Xyz Xyz Xy

z Xyz Xyz Xyz XyzXyz Xyz

Xyz XyzXyz Xyz Xyz Xyz

Xyz Xyz Xyz Xyz Xyz Xyz Xy

z Xyz Xyz Xyz XyzXyz Xyz

Xyz XyzXyz Xyz Xyz Xyz

Xyz Xyz Xyz Xyz Xyz Xyz Xy

z Xyz Xyz Xyz XyzXyz Xyz

Xyz XyzXyz Xyz Xyz Xyz

Xyz Xyz Xyz Xyz Xyz Xyz Xy

z Xyz Xyz Xyz XyzXyz Xyz

Query

Identified Technology needs

How to build a good ontologyFoundational challenges in ontology modeling

How to deal with (free text) medical reports: - Extraction,- Indexing, - Annotating

How to agregate with other patient data:-Echograph-Cardiograms-….

Slide 44© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Use Case 8 : Hospital Information System

Knowledge extraction from medical reportsExtractorKnowledge

Extraction

Modeling methodology that guide ontology builders to have a high quality (and reusable) ontology content, i.e. facing foundational modeling challenges…

MethodologyOntology Engineering

To automatically merge medical facts about a patient.Merger

Automatedmapping

To automatically annotate medical reports and DB.Annotator

Automatedannotation

Page 23: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

23

Slide 45© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Use Case 9: Multimedia processing (1)

Knowledge structures (ontology modeling) for multimedia resources– Tools for enriching ontologies with multimedia information– Construction of domain ontologies for knowledge-assisted analysis

Knowledge-assisted multimedia content analysis tools to support concept detection and tracking

– Ontology learning and knowledge-assisted analysis– Person / face detection and recognition, mood detection – Ontological text analysis

Pre - ProcessingSegmentation

Descriptor ExtractionMultimedia Documents

Ontology Framework

ShotsPartitions

DescriptorsDescriptor Matching

ProcessingReasoning

Ontology Access & Querying

DetectedObjects/EventsACE Metadata

MergingSplittingFusion

TrackingConsistency

Checking

Indexing

DIRECTOR

SCENETAKE

TITLE

Pre - ProcessingSegmentation

Descriptor ExtractionMultimedia Documents

Ontology Framework

ShotsPartitions

DescriptorsDescriptor Matching

ProcessingReasoning

Ontology Access & Querying

DetectedObjects/EventsACE Metadata

MergingSplittingFusion

TrackingConsistency

Checking

Indexing

DIRECTOR

SCENETAKE

TITLE

KnowledgeBase

ContentAnnotation

Slide 46© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Use Case 9: Multimedia processing (2)

Algorithms for high-level semantic reasoning for multimedia contentUser query analysis tools and intelligent search, retrieval, ranking and relevance feedback mechanisms

– User query processing– Visual / conceptual hybrid search; relevance feedback

Context analysis

Contentanalysis

- object recognition (inference, reasoning)- querying- SW Services

Colours (LL features)

Sports

Football Basketball …

Player

Ball

Field

Give me the names of the players that made

the score of thisfootball match and

givel me their rating in the 2003 worldcup?

♦ Player A

♦ Player B

♦ Ball

Page 24: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

24

Slide 47© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Other prominent applications !

Bibster – A semantics-based Bibliographic P2P– http://bibster.semanticweb.org

CS AKTive space – Semantic data integration– http://cs.aktivespace.org (Winner 2003 SemWeb challenge)

Flink: SemWeb for analysis of Social Networks– http://www.cs.vu.nl/~pmika (Winner 2004 SemWeb challenge)

Museum Finland: Sem Web for cultural portal– http://museosuomi.cs.helsinki.fi (2nd prize 2004 SemWeb challenge)

ScienceDesk collaborative knowledge management system in NASA– http://sciencedesk.arc.nasa.gov/ (3rd prize 2004 SemWeb challenge)

Also see Applications and Demos at W3C SWG BPD– http://esw.w3.org/mt/esw/archives/cat_applications_and_demos.html

Slide 48© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Industry-Research (some) Challenges Key areas for semantic solutions are search and data integrationIndustry wants to better find and use the (legacy) databases (migration!)

Key technology locks are: – development of ontologies i.e. modelling of business domains, authoring,

best practices and guidelines, re-use of existing ontologies and simple tools!– knowledge extraction i.e. the population of ontologies by finding knowledge

within legacy data– mapping i.e. overcoming heterogeneity (use of different ontologies) by

determining how one ontology can be expressed in terms of another– Scalability: approximation, modularization, distribution– Matching: exact vs. fuzzy matching– Web services: where are they really needed? – Language extensions: what aspects are missing

• E.g. data types, expressiveness of rules, context, …

Page 25: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

25

Slide 49© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Transfer to industry in action

European Semantic Web research is working hard– to meet industrial requirements underway -

Industry members provide a concrete testbed for testing and evaluating research results– tools, ontologies, components and methodologies underway -

It is also important to support technology migration & industrial training– adapted courses to practitioners underway -

First phase of results– concrete technology transferance to industry already in 2006

Be pragmaticRealize the semantic web step by step

Slide 50© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Who have real Business today ?

Early Adopters !!

Page 26: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

26

Slide 51© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Key Technology announced August 2005!

Slide 52© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

NoE Knowledge Web : 3 Pillars

Industry Research Education

Ontologyrecommendation

body OOA

Virtual researchcentre

Virtual educationPlatform

Horizontal integration

KnowledgeWeb is an EU FP6 Network of Excellence– Runs from 2004 to 2007 (4 years)– Budget of approx €7 million– 18 academic partners and research centers from 11 countries

Page 27: Adding Semantic to Web Data and Servicesbeaune/websem/cours2008_2009/... · Data warehousing in heathcare use case: Solution – „introduce a common terminology for healthcare data

27

Slide 53© KnowledgeWeb Alain Léger – IASW2005, Jyväskylä, Finland, 26 August 2005

Selected referencesThe Semantic Web: research and Applications

– LNCS 2342 (ISWC 2002)– LNCS 2870 (ISWC 2003)– LNCS 3053 (ESWS 2004)– LNCS 3298 (ISWC 2004)– LNCS 3532 (ESWS 2005)

Journal of Web semantics (Elsevier)Thematic portal

– http://www.semanticweb.orgSemantic Technology Conference 2005

– http://www.semantic-conference.comWWW conferences

– http://www2004.org ProceedingsYearly semantic web applications challenge

– http://challenge.semanticweb.orgSIGsemis semantic web information systems journal

– http://www.sigsemis.orghttp://www.w3.org/2001/sw/

– All the subgroups !!– http://www.w3.org/2001/sw/BestPractices/ (Sem Web Best Practices)– http://www.w3.org/2004/12/rules-ws/report/ (Rules working draft)– http://www.w3.org/2004/swls-ws.html (Sem Web in Life sciences)– http://www.w3.org/2001/sw/Europe/ (Transfering W3C standards to Practioners)

http://knowledgeweb.semanticweb.org/– Education portal– Industry portal