with contributions by many west membersstaab/presentations/...kernel support vector machines hong et...

85
Web Science & Technologies University of Koblenz Landau, Germany Multimedia Web Steffen Staab With contributions by many WeST members

Upload: others

Post on 17-Jun-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Web Science & TechnologiesUniversity of Koblenz ▪ Landau, Germany

Multimedia WebSteffen Staab

With contributions by many WeST members

Page 2: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web2 of 90

WeST

Semantic WebDr. Janik

Web RetrievalDr. Sizov

Interactive WebDr. Scherp

Multimedia WebDr. Grzegorzek

Software WebF. Silva Parreiras

Prof. Dr. Staab Prof. Dr. Sure

EU WeGovEU X-Media

HP Synth DocsEU WeKnowItDFG Multipla EU Most

BMBF CollabCloudEU NeOn

EU Tagora

Wer sind wir?

GESISProf. Sure

EU WeGovEU ASG

EU aceMediaEU kspace

EU Net2

Page 3: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web3 of 90

WeST

Groups

Caption

Multimedia in context

Time

Low-level

UserProfile

Favs

Comms

Geo Social

network

Tags

Page 4: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web4 of 90

WeST

Multimedia Web

Daten

Interaktion Personen

Wissen

Netzwerke

Retrieval

Metadaten

Signalverarbeitung

Verstehen

Präsentieren

Multimediadatenbanken

Web Science

Information Retrieval

Pattern Recognition

Interaktive Multimediasysteme

Page 5: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web5 of 90

WeST

Agenda

Multi-modal analysis of social media

Large Scale Tag Recommendation

Understanding the Semantics of Images

Spatial and OntologicalReasoning For Image Understanding

Page 6: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Web Science & TechnologiesUniversity of Koblenz ▪ Landau, Germany

Large Scale Tag Recommendation

Rabeeh Abbasi

Page 7: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web7 of 90

WeST

Recommending Tags

Tag RecommenderTag Recommender

Big BenSt. Stephen's TowerClock TowerPalace of WestminsterHouses of ParliamentWestminsterLondonUnited Kingdom

Tags

InputInput OutputOutput

Geographical Coordinates

51°30'03"N0°7'24"W

Low-level Features

Which image features to use?

Page 8: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web8 of 90

WeST

System Overview

Tag RecommenderTag Recommender

51°30'03"N0°7'24"W

Big BenSt. Stephen's TowerClock TowerPalace of WestminsterHouses of ParliamentWestminsterLondonUnited Kingdom

TrainingData

Location 1

Location N

Location 1Clusters

Location NClusters

Model

Features

Features

Image Features

Tags

Page 9: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web9 of 90

WeST

• Tags• Flickr tags assigned by the users

• Geographical Coordinates• Latitude and Longitude

• Low-level Image Features

Features in Social Media

Page 10: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web10 of 90

WeST1 2 3 4 5 10 15 20

0

0.02

0.04

0.06

0.08

0.1

0.0751

0.0859

0.09140.0936

0.0892

0.0750

0.0645

0.0581

0.0272

0.0376

0.04350.0464 0.0474 0.0475

0.04480.0417

0.0244

0.0323 0.0338 0.0346 0.03490.0311

0.02810.0256

0.0183

0.02580.0300

0.0337 0.0357 0.0373 0.0358 0.0341

Geo EHD Tags Random

Number of tags recommended

F-M

easu

re

F-Measure

Considered ~400,000 images from 58 national capitals75% training / 25% test (gold standard) data

Geo CoordinatesLow-level FeaturesTagsBaseline (Random)

Page 11: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web11 of 90

WeST

Conclusions

New images can be automatically tagged provided their geographical coordinatesGeographical coordinates give best tag recommendations

Require less computational power (just 2 dimensions)

Page 12: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Web Science & TechnologiesUniversity of Koblenz ▪ Landau, Germany

Multi-modal Analysis of Social Media

Dr. Dr. Sergej Sizov

Page 13: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web13 of 90

WeST

Groups

Caption

Multi-modal Analysis of Social Media

Time

Low-level

UserProfile

Favs

Comms

Geo Social

network

Tags

Page 14: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web14 of 90

WeST

1. Select characteristic aspects of social media: tags, timestamps, coordinates, low-level features, etc.)

2. Construct probabilistic modelsusing joint distributions of considered aspects(latent semantic analysis)

3. Compute topic-basedprobabilistic feature vectorsfor resources, tags, and users in our model

4. Construct applications that exploit these feature vectorsfor content categorization, filtering, or search

What we do with all this..

Page 15: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web15 of 90

WeST

Case 1: recommending cool photos to users

Global Flickr model Personal User-specific model

User postings are NOT characteristic for user interests!They are looking for something quite different ..

.. published at ACM Hypertext 2009

Can we predict and recommend favorites?

Lesson1

Page 16: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web16 of 90

WeST

Case 2: detecting events

„Unusual“ topics localized in time AND space indicate events- anomalies are interesting and mean something special ..

.. to appear in DB Spektrum, Special Issue on Social Media

Selected latent topics in the Flickr! Barcelona dataset 2005-2008These topics are only active in small time/space segments (1 week - 1x1 km)

microsoft 0.139teched 0.136autumn 0.132student 0.132partners 0.132msp 0.132nov 0.125water 0.031bycicle 0.017food 0.010

graffiti 0.199streetart 0.199art 0.199street 0.199mtn 0.012happy 0.012blameless 0.011montana 0.009parr 0.007spraypaint 0.003

girls 0.110show 0.108sexy 0.108ficeb 0.108erotic 0.107model 0.107festival 0.107ficeb07 0.107tattoo 0.105fishnet 0.019

3gsm 0.408fira 0.335fair 0.073trade 0.058montjuic 0.051fountains 0.037amazing 0.029show 0.027night 0.012water 0.011

deportivo 0.077sports 0.076camp 0.063soccer 0.055lacoru 0.055futbol 0.054liga 0.052deportes 0.041barca 0.038depor 0.036campnou 0.036

Lesson2

Page 17: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web17 of 90

WeST

Case 3: exploiting location awareness

Geo coordinates can supplement text featuresin search and retrieval apps

.. published at ACM WSDM 2010

Can we exploit tag-location correlations (i.e. text features and GPS coordinates)for better content organization? Example: Flickr London dataset

classifying photos clustering photos

automatic tag recommendation

Lesson3

Page 18: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Web Science & TechnologiesUniversity of Koblenz ▪ Landau, Germany

Understanding the Semantics of Images

Dr. Marcin Grzegorzek

Page 19: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web19 of 90

WeST

The inability to access accurate and desired content can be as limiting as the lack of content itself!

Textual information : Human defined and precise in meaningAV content : Hidden component of human creative reasoning The task:

– Automatic annotation and retrieval of AV content using semantic structures natural to humans (e.g. words or sentences);

Challenges:– Subjective interpretation of content by different users

under different conditions; – Discover links between semantic based query and low-

level metadata.

Page 20: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web20 of 90

WeST

Motivation

Human user :Semantic similarity

Machine:Low-level similarity

The approach:• Merging different low-level features;• Dealing with the subjective interpretation of content;• Modeling semantic objects as mosaics made of irregular structures obtained

through low-level and spatial clustering of building blocks.

The end goal: Incorporating the obtained knowledge into a learning approach and teaching the machine to “reason” as a human.

Page 21: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web21 of 90

WeST

Descriptive Learning Models

Descriptive learning modelsrelevant class modelled with….

Single Gaussian distribution

Heuristic

Non-heuristicThe MARS system (Rui et al. 1998): re-weighting techniques and query point movement;

The PicHunter system (Cox et al. 1998), RF in a form of relative judgment and an extension of k-d trees.

Ishikawa et al. 1998: An optimization problem and considering correlation among features .

More complex distribution

Qian et al. 2002 : Gaussian Mixture Models, both relevant and irrelevant examples as well as unlabelled data

Parametric models

Non-parametric models

Non-parametric distribution for relevant samples

Parzen window density (Meilhac and Nastar 1999): RF problem as difference of densities for relevant and irrelevant samples; Incremental approach with use of user’s feedback for increasing precision of model estimation.

Page 22: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web22 of 90

WeST

Discriminative Learning Models

Discriminative learning modelsestimating the boundary between classes…

Non-linearity of visual space= Discriminative approaches + Kernel based algorithms;

Discriminative Analysis

Linear DA approach tries to cluster all irrelevant images into one class, whereas negative examples belong to many different classes….

Wu et al. 2000:Transformation maximizes the ratio between inter-class and intra-class scattering.Disadvantage: Each negative example is treated as from a different class

Linear DA

Multiple DA

Biased DA

Zhou & Huang 2001:

Positive examples are clustered in one class while negative examples can come from an uncertain number of classes; Disadvantage: Sensitivity to imbalance between classes; considerable amount of training required.

Kernel support vector machines

Hong et al. 2000 & Guo et al. 2002 : The input data is mapped into higher-dimensional feature space using a non-linear transform- kernel; In the newly defined space a boundary between classes can be easily found…

Page 23: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web23 of 90

WeST

System Overview

Multi-FeatureSpace

Automatic Feature

Extraction

Sub-System I

Learning Approach (SVM)‏

Adaptive Convolution Kernel

User

RetrievedImages

Sub-System III

RepresentativeStructured Multi-

Feature Sspace

Set of Blocks

Sub-System II

Page 24: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web24 of 90

WeST

Object Modelling

Semantic objects can be regarded as mosaics of small building blocks

STEP1: Elementary overlapping structures covering the whole picture;

STEP2: If a block is a member of two or more neighbouring structures then it is removed from all structures but the closest one (low-level similarity)Breakdown into non-regular, non-overlapping structures

The 3x3 neighbourhood used to build initial regular structures and its breakdown into non-regular, non-overlapping structures

ijS ijS%

(i,j) (i,j+1)

(i+1,j) (i+1,j+1)

(i-1,j-2) (i-1, j-1) (i-1,j)

(i,j-2) (i,j-1)

(i-1,j+1) (i-1,j+2)

(i,j+2)

(i+1,j+2)

(i-1,j-1) (i-1,j) (i-1,j+1)

(i,j-1) (i,j) (i,j+1)

(i+1,j-1) (i+1,j) (i+1,j+1)

STEP3: Key-representative structures within each image are extracted using k-medoids clustering method and low-level distance as well as the spatial proximity of generated structures;

Page 25: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web25 of 90

WeST

Databases for Evaluation

Two different databases are used for evaluationA subset from the Corel stock consisting of seven concepts, these arebuildings, clouds, cars, elephants, grass, lions and tigers:

25 distinct classes from the COIL database:

Page 26: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Web Science & TechnologiesUniversity of Koblenz ▪ Landau, Germany

Spatial and OntologicalReasoning For Image Understanding

Carsten Saathoff

Page 27: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web27 of 90

WeST

Motivation

Local features often not sufficient for classificationExploit explicitly defined spatial knowledge to improve labelling

e.g. Sky not allowed left or right of SeaAllow for efficient training of classifiers and spatial knowledge

Good labelling performance with few training examples

Sky

Sea

Sea

Sand

SeaSeaSky ↯

Page 28: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web28 of 90

WeST

Analysis Framework

HypothesesGeneration

Spatial Relations Extraction

SpatialReasoning

Training

Analysis

(x,x,x,x)(x,x,x,x)(x,x,x,x)(x,x,x,x)

Low LevelClassifiers

SkySea above

Sky

Sand

SeaSea

Trained SpatialBackground Knowledge

Training Examples

Page 29: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web29 of 90

WeST

Exploiting Spatial Context

Create optimization problem fromRegionsSpatial RelationsHypotheses setsSpatial background knowledge

ApproachesFuzzy Constraint Satisfaction

• WIAMIS08Binary Integer Programming

• SAMT2008

Sand, 0.8Sea, 0.7Person,0.5…

Hypotheses

Background Knowledgeabove-of (Sky, Sea) -> 1.0

(Sea, Sand) -> 1.0(Sea, Sky) -> 0.0…

above above

above above

left-of

Page 30: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web30 of 90

WeST

Experimental Setup

Measure improvement achieved withFCSPLinear Programming

over low-level classification with different training set sizesData set with 923 natural and urban imagesSet of 10 concepts

building,foliage,mountain,person,road,sailing-boat,sand,sea,sky,snow5690 labelled regions568 labelled „unknown“ -> ignored

Ground truth defined on automatic segmentationRegions labelled with dominant conceptGround truth created for this work

Page 31: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web31 of 90

WeST

Overview of dataset

Available at: http://mklab.iti.gr/project/scef

Page 32: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web32 of 90

WeST

Classification Rate over Training Set Size

50 100 150 200 250 300 350 4000.50

0.55

0.60

0.65

0.70

0.75

0.80

0.85

Low-level classi-ficationFCSPBIP

Page 33: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web33 of 90

WeST

F-Measure per Concept

Training Set with 300 images (best performing one for BIP)

snow road mountain foliage person sand sky sailing-boat sea building average0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

Low-levelFCSPBIP

Page 34: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web34 of 90

WeST

Ontological Background Knowledge

Question: Can we integrate general knowledge into theoptimization problem?Examples:

Beach Images contain some Sky region and some SearegionAn outdoor image taken on Mallorca in summer does notcontain any Snow

ToolsDescription LogicsBinary Integer Programs

Page 35: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web35 of 90

WeST

Exploiting general background knowledge

Generic image interpretationCreate optimization problem from

Hypotheses about imageRelations between individuals describing the imageOntological background knowledge

Approach based on extension of BIP reasoner

Sand, 0.8Sea, 0.7Person,0.5…

Background Knowledge

above above

above above

left-of

image

Beach, 0.8Mountain, 0.7Indoor,0.5…

depicts

region

Beach = depicts some Seaand depicts some Sand

Sea = above only (Sand orSea or Boat)

Page 36: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web36 of 90

WeST

Conclusion

Still more work:Dr. Ansgar Scherp

• Metadata standards for SMIL, Flash, etc.• Modeling of Events• User interaction with multimedia

Thomas Franz: Semantic relationships between information objectsSimon Schenk:Semantic backend technology….

Page 37: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Web Science & TechnologiesUniversity of Koblenz ▪ Landau, Germany

Semaplorer

Page 38: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web38 of 90

WeST

Many sources – little context

Links Location

Persons

Knowledge Tags

low- to midlevel features

xxxxxxxxx

Page 39: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web39 of 90

WeST

Choosing between Koblenz – and Koblenz

Page 40: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web40 of 90

WeST

Contextual Information

Page 41: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web41 of 90

WeST

Tag-based refinement

Page 42: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web42 of 90

WeST

A tag view of „Koblenz“ & „Castle“

Page 43: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web43 of 90

WeST

Semantic Identity – Festung Ehrenbreitstein

Page 44: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web44 of 90

WeST

Persons – Celebrities, FOAFers & Flickr Users

Page 45: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web45 of 90

WeST

Some Shades of Multimedia Meaning

Links Location

Persons

Knowledge Tags

low- to midlevel features

xxxxxxxxx

GeoNames

Page 46: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web46 of 90

WeST

Conclusion

Multimedia Web isSignal Processing

Encoding, wavelets, bitmaps,…Machine Learning

Clustering, classification, Bayes, SVM, LDA,…Social Context

Recommender systems, favorites, Web 2.0 applications…Background Knowledge

Ontologies, logics, optimization, …Interactive Systems

Systems, user studies, graphics, …Fun!

Page 47: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Web Science & TechnologiesUniversity of Koblenz ▪ Landau, Germany

Thank You!

Page 48: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web48 of 90

WeST

Agenda

Semaplorer: Faceted Browsing of Semantic Multimedia Data

Linked Open Data by Collective Intelligence

Semaplorer Architecture

Linking High-Level and Low-Level Data

Representational Paradigm• COMM – Core Ontology for MultiMedia• F – Event Ontology

Computational Paradigm

Links Location

Persons

Knowledge Tags

low- to midlevel features

xxxxxxxxx

Page 49: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web49 of 90

WeST

Collective Intelligence

Collective datasetsHosted public datasetsGated datasets

• Social networks,…

Wikipedia styleActually includes

• Discussions• Editor hierarchies• Policies

Pagerank stylehighly effectiveno coordinationno control (modulo spamming)

Gene ontologyDBPedia, Public census dataFacebook, LinkedIn

WikiversityFAQs

Yahoo Answers, Lycos IQ

TaggingFlickr, Delicious, …geotagging

Different Flavors Linked Open Data

Page 50: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web50 of 90

WeST

Linked Open Data: Example Instances

Page 51: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web51 of 90

WeST

Linked Open Data: Instance Containers

Page 52: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web52 of 90

WeST

Linked Open Data: Classes

Page 53: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web53 of 90

WeST

Agenda

Semaplorer: Faceted Browsing of Semantic Multimedia Data

Linked Open Data by Collective Intelligence

Semaplorer Architecture

Linking High-Level and Low-Level Data

Representational Paradigm• COMM – Core Ontology for MultiMedia• F – Event Ontology

Computational Paradigm

Links Location

Persons

Knowledge Tags

low- to midlevel features

xxxxxxxxx

Page 54: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web54 of 90

WeST

Swoogle

RDFSrulesgeo...

Billion triples challenge: Use Linked Open Data

Common approach: Import dump to new data silo

Semantic Web?

Geoquerying

GeoNames

WordNet

GeoNames

flexible

scaleable

webby

extensible

RDFS Rules

inflexiblemonolithic

notscaleable

PlaceOfBirthbirthplace

birthplace

WordNet Swoogle

fulltext

12 months in 2005/06700M triples

+ ++

+ >1Gt

Since mid Nov: Flickr API

Since Dec, Amazonpart shut down

Page 55: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web55 of 90

WeST

A federated RDF repository

GeoNames

RDFS Rules

PlaceOfBirthbirthplace

birthplace

WordNet Swoogle

fulltext

12 months in 2005/06700M triples

+ ++

+ >1Gt

Since mid Nov: Flickr API

Page 56: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web56 of 90

WeST

Architecture of Federated Infrastructure

Semantic Annotation Tool KATExtended by generic and application-specific pluginsProviding SemaPlorer‘s UIPlugin: Download image content from flickrPlugin: Map component using OpenStreetMap

Page 57: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web57 of 90

WeST

Architecture of Federated Infrastructure

Views:?person birthplace ?city=>?person birthplace ?city UNION?person placeOfBirth ?city

Query Splitting:?geoSight ?geo:lat ?lat; geo:long ?long.?geoSight owl:sameAs ?dbpSight.<Category:VisitorAttraction> skos:broader ?dbpSight.=>1) ?geoSight ?geo:lat ?lat; geo:long ?long.

?geoSight owl:sameAs ?dbpSight.DISTRIBUTED JOIN2) <Category:VisitorAttraction> skos:broader ?dbpSight.

Locating endpoints:graphname endpoint

Page 58: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web58 of 90

WeST

Architecture of Federated Infrastructure

Endpoints:•Autonomous RDF repositories•Connected via SPARQL•Dynamic (added by reconfiguring SourceFinder)•25 repositories, > 400GB including fulltext indices

Inferencing and Querying:Can vary from repository to repository

Here:•SPARQL based Views•RDFS•transitivity rules for SKOS•Geo range queries in SPARQL•fulltext search in SPARQL via Lucene

Page 59: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web59 of 90

WeST

Architecture of Federated InfrastructureControl EC2 Instances:

Transparent Reconfiguration:

Administration Component updates Souce Finder Configuration, no need for the application to know.

Repositories:

10 running at Koblenz University, Germany

15 running at Amazon EC2, USA

Page 60: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web60 of 90

WeST

SemaplorerSemaplorer

� Scaleable � Flexible� Federated

Semantic Web infrastructure using� RDFS� Rules� Views� Geoqueries� Fulltext Search� Based on� Cloud Computing and� NetworkedGraphs

http://btc.isweb.uni-koblenz.de

Page 61: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web61 of 90

WeST

Agenda

Semaplorer: Faceted Browsing of Semantic Multimedia Data

Linked Open Data by Collective Intelligence

Semaplorer Architecture

Linking High-Level and Low-Level Data

Representational Paradigm• COMM – Core Ontology for MultiMedia

Computational Paradigm

Links Location

Persons

Knowledge Tags

low- to midlevel features

xxxxxxxxx

Page 62: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web62 of 90

WeST

Yalta Conference

Winston ChurchillRecognizer

Franklin D. RooseveltRecognizer

Josef StalinRecognizer

Linking Low-Level Data, Too: Multiple Tools

Page 63: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web63 of 90

WeST

http://en.wikipedia.org/wiki/Yalta_Conference

World War II

Yalta ...

...History Ontology

Creating a Multimedia Presentation

SR1 SR2 SR3

Winston ChurchillRecognizer

Franklin D. RooseveltRecognizer

Josef StalinRecognizer

ChurchillRooseveltStalin

Page 64: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web64 of 90

WeST

Issues<Mpeg7><Description xsi:type="ContentEntityType"><MultimediaContent xsi:type=„ImageType"><Image><SpatialDecomposition>

<StillRegion id=„SR1“><TextAnnotation><KeywordAnnotation xml:lang="en"><Keyword>Churchill</Keyword>

</KeywordAnnotation></TextAnnotation>

</StillRegion>

<StillRegion id=„SR2“><Semantic><Label><Name>Roosevelt</Name>

<Label></Semantic>

</StillRegion>

<StillRegion id=„SR3“><Semantic><Definition> <!-- Also TextAnnotation!! --><StructuredAnnotation><WhatObject><Name xml:lang="en">Stalin</Name>

</WhatObject></StructuredAnnotation>

</Definition></Semantic>

</StillRegion>...

How do you formulate a query to get images

showing Churchill et al.?

First Shot (Xpath)://StillRegion[.//Keyword=“Churchill” or

.//Keyword=”Roosevelt” or

.//Keyword=”Stalin”]

Winston ChurchillRecognizer

Franklin RooseveltRecognizer

Josef StalinRecognizer

Page 65: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web65 of 90

WeST

What is the Problem with MPEG-7?

Annotations are not interoperable!Ambiguities due to complementary description tools

Multiple ways to model semantically identical descriptions

Several alternatives for placing description tools inside an annotation

Complex queries needed to cover all alternatives!

Incompatible with (semantic) web technologies!You cannot link to the outside world!

Page 66: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web66 of 90

WeST

Capabilities and Maturity Levels

Integration Automation

Former Situation

Current Situation

Future / Desired Situation

no standard, no vocabularymanual 1:1 agreement on format and semanticstight coupling of data and applications

standard vocabularymanual 1:1 agreement onmpeg-7 vocabularytight coupling of data and applications

standard vocabularypre-defined meaningad-hoc coupling of data and applications

Formerly

MPEG-7

COMM

Page 67: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web67 of 90

WeST

MPEG-7

COMM

Requirements on a high quality

OntologyChallenge

BuildingBlock

Legend

Page 68: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web68 of 90

WeST

TextDescriptor

MusicManager

CompoundDocument

Requirements for COMM

ReusabilityDesign a core ontology for any multimediarelated applicationMPEG-7-ComplianceSupport most important description toolsExtensibilityEnable inclusion of further

• description tools(even those that are not part of MPEG-7!)‏

• media typesSeparation of ConcernsClear separation of domain knowledge andknowledge about structureModularityEnable customization of multimedia ontologyHigh degree of axiomatization Ensure interoperability throughmachine accessible semantics

ChurchillRecognizer

Josef StalinRecognizer

FaceDetector

PhotoManager

AuthoringTool

SemanticAnnotation

decomposition visual descriptorsaudio descriptors ...

<Mpeg7>...

</Mpeg7>

<Mpeg7>...

</Mpeg7>

Page 69: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web69 of 90

WeST

MPEG-7

COMM

Requirements on a high quality

MM Ontology

Quality of Ontologies

Challenge

BuildingBlock

Legend

Page 70: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web70 of 90

WeST

How to Design a High Quality Multimedia Ontology?

Approach from [Oberle, 2005], [Oberle et al., 2006]:Use a well designed foundational ontology as a modelling basis to avoid shortcomings

Foundational ontologies provideFormal precisionDomain independenceBroad scope

Building upon foundational ontologiesprevents easy inclusion of modeling artefactsreduces conceptual ambiguityinherit rich axiomatization

Page 71: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web71 of 90

WeST

Methodology

MPEG-7

COMM

Requirements: High Quality MM Ontology

Quality of Ontologies

Quality Measures for Ontologies

Reference Ontology

MPEG-7Compliance

Challenge

BuildingBlock

Legend

Page 72: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web72 of 90

WeST

Methodology for Design Pattern Definition

Identification of most important MPEG-7 functionalities[Arndt et al., 2007]:

Decomposition of multimedia content into segmentsAnnotation of segments with meta data (e.g. visual descriptor, media information, creation & production, …)‏General: • Identify repetitive structures and

describe them at an abstract level• Describe digital data by digital data

at an arbitrary level of granularity

• Additional patterns are needed for:Complex data types of MPEG-7Semantic annotation by using domain ontologies

Interface between reusable multimedia core and domain specific knowledge

Page 73: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web73 of 90

WeST

DOLCE Design Patterns: OIO and D&S

DOLCE is a library of foundational ontologies that provides 2 design patterns (extensions) that are especially important for MPEG-7:

Ontology of Information objects (OIO): Formalization of information exchange

Relevance for multimedia ontology:• MPEG-7 describes digital data (multimedia information objects) with

digital data (annotation) ‏• Digital data entities are information object

Descriptions & Situations (D&S): Formalization of ContextRelevance for multimedia ontology:

• Meaning of digital data depends on context • Digital data entities are connected through computational situations

(e.g. input and output data of an algorithm)• Algorithms are descriptions• Annotations and decompositions are situations that satisfy the rules

of an algorithm / method

Page 74: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web74 of 90

WeST

Descriptions & Situations (D&S)‏

Distinction between:DOLCE ground entities (regions, endurants, perdurants) ‏Descriptive entities (parameters, roles, courses) ‏

DescriptionsFormalize contextDefine descriptive concepts

SituationsAre explained by descriptionsAre settings for ground entities

Page 75: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web75 of 90

WeST

Putting it Together: Decomposition Pattern

Simply see the example on the next slide….

Page 76: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web76 of 90

WeST

Decomposition by Example

Image1 playsRole SegmInput

Segment2 playsRole SegmOutp

Segment4 playsRole SegmOutp

Segment1 playsRole SegmOutp

Segment3 playsRole SegmOutpSegment1 playsRole SegmInput

Via its role in a computational task the different parts may be arbitrarily nested and related to different computing algorithms

Querying for all subparts takes place along a well-defined pattern

Page 77: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web77 of 90

WeST

Methodology

MPEG-7

COMM

Requirements: High Quality MM Ontology

Quality of Ontologies

Quality Measures for Ontologies

Reference Ontologie

Identification of repetitive structuresMPEG-7

Compliance

Pattern definition through

Specialization

Challenge

BuildingBlock

Legend

Repr. of Context

Repr. of Information

Page 78: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web78 of 90

WeST

Modular Architecture

Multimedia ontology consists ofCore module that contains the design patternsModules that specialize the core module for different media typesModules that contain media independent MPEG-7 description tools such as media information or creation & productionData type module that formalizes MPEG-7 data types e.g. matrices, vectors, unsigned-int-5, float-vector, probability-vector, …

DOLCE

Descriptions & Situation

Ontology of Information

Objects

Core

Visual Audio

Datatype

Media

Text / LingInfo

Domain Ontolog

Connected by SemanticAnnotation Pattern

Localization

Multimedia Knowledge (COMM)

Fundamental Knowledge about the World

Knowledge about a specific Domain

Page 79: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web79 of 90

WeST

Does the Multimedia Ontology fulfil the Requirements?Reusability

Easy to queryMPEG-7-Compliance

Design patterns enable therepresentation of description tools

ExtensibilityDesign patterns are media independent possibility to include

• further media types• arbitrary descriptors

Extensions of multimedia ontologywill not affect legacy annotations dueto DOLCE+D&S+OIO

Separation of ConcernsClear separation between domainspecific and multimedia relatedknowledgeLink to Linked Open Data possible!

ModularityModular architecture allows customization

High degree of axiomatizationDesign patterns come with genericaxiomatization that is refined in derivedontology modules

ChurchillRecognizer

Josef StalinRecognizer

PhotoManager

COMM

One such extension has already beendone for Text Annotation.

Another one for compund documentannotation is currently developed!

Content & Media Annotation PatternSemantic Annotation Pattern

See slide before this slide!

OWL-DL version available for download.

Page 80: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web80 of 90

WeST

http://en.wikipedia.org/wiki/Yalta_Conference

World War II

Yalta ...

...History Ontology

“Creating a Multimedia Presentation” Revisited

SR1 SR2 SR3

Winston ChurchillRecognizer

Franklin D. RooseveltRecognizer

Josef StalinRecognizer

ChurchillRooseveltStalin

Sparql: select ?image where {?image plays AnnotatedDataRole. ?x plays SemanticLabelRole.?x rdf:type pol:President }

PhotoManager

AuthoringTool

Page 81: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web81 of 90

WeST

Agenda

Semaplorer: Faceted Browsing of Semantic Multimedia Data

Linked Open Data by Collective Intelligence

Semaplorer Architecture

Linking High-Level and Low-Level Data

Representational Paradigm• COMM – Core Ontology for MultiMedia• F – Event Ontology

Computational Paradigm

Links Location

Persons

Knowledge Tags

low- to midlevel features

xxxxxxxxx

Page 82: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web82 of 90

WeST

Computational Paradigm for Understanding High-level Semantics

Good to have: inside/outside, water,…EXIF!No „one approach fits all“Interesting directions:

Joining logical and probabilistic/fuzzy reasoning, e.g.– S. Dasiopoulou et al., SAMT 2008; Ralf Möller et al.

Joining collective intelligence and implicit high level semantics• Flickr, YouTube, etc.: An unprecedented set of training data!

– Several approaches for joining query log analysis / tags with low-level analysis

• Use semantic data that was not built for your task!

Just my two cents of speculation!

Page 83: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web83 of 90

WeST

Conclusion: Recipe

Link existing data sources

Join existing computationalresources

Take Semantic Web as commondenominator

Invest a couple of person months

There you are!

Canonical representationCOMM – Multimediahttp://comm.semanticweb.org

F – EventsX-COSIMO – CommunicationCOS – Software

Page 84: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web84 of 90

WeST

SSMS-2009

Advertisement: SSMS-2009Summer school on Social Media and Semantics

Primary target audience: PhD students & othersKoblenz, last week in August 2009

Thank you for your attention!http://isweb.uni-koblenz.de

http://btc.isweb.uni-koblenz.de

Page 85: With contributions by many WeST membersstaab/Presentations/...Kernel support vector machines Hong et al. 2000 & Guo et al. 2002: The input data is mapped into higher-dimensional feature

Steffen [email protected]

CV Integration: Multimedia Web85 of 90

WeST

Get COMM

COMM is available online: http://comm.semanticweb.orgOntologyAPI

• All description tools that are present in the COMM can be used in Java applications

Currently supported (MPEG-7) description tools:• All visual low level descriptors (MPEG-7 part 3)‏• All media information descriptors (MPEG-7 part 5, clause 8)‏• Decomposition tools for

– Images (StillRegions, SpatialDecomposition, …) ‏– Videos (VideoSegments, TemporalDecomposition, …) ‏– Text (ASCIITextSegments, ASCIIDecomposition, …)‏

Semantic Annotation