1p2p4mm workshop, vico equense 6. june 2008 information society technologies victory – a...
TRANSCRIPT
1P2P4mm workshop, Vico Equense6. June 2008
Information Society Technologies
VICTORY – a multimodal, cross-platform and distributed multimedia repository
Jernej Trnkoczy
1P2P4mm workshop, Vico Equense6. June 2008
Summary
• VICTORY project introduction• Use case example and requirements• Decisions on system architecture• Technology• Conclusions
1P2P4mm workshop, Vico Equense6. June 2008
VICTORY project
• Audio-VIsual ConTent search retrieval in a distributed p2p repositORY
• IST FP6 project• 9 partners• 425PMs, 3.8MEUR (2.2 – EC
contribution)
1P2P4mm workshop, Vico Equense6. June 2008
Use case example
Automotive industry:• Product designers assigned to design
specific automotive part• Discover and retrieve 3D CAD designs and
other associated content (query by example!)
• Exploit the retrieved content (e.g. modifying, combining, making decisions, etc.) in product development processes
• Publish/share new/modified content.
1P2P4mm workshop, Vico Equense6. June 2008
Main requirements
• Content and context based search of Multipedia objects
• Repository built by the users• Support for mobile devices (mobile phones and PDAs)• Serving industrial applications
– Security (content and services)– Reliability
• Scalability to growing number of users– Distributed data storage– Distributed network bandwidth– Sharing computational power of peers (low level features extraction –
indexing and searching, remote rendering...)
1P2P4mm workshop, Vico Equense6. June 2008
Multipedia – what is it?
• A collection of a 3D object along with its accompanied information (2D views, text, audio, video).
• Multipedia object example:
1P2P4mm workshop, Vico Equense6. June 2008
Content and context based search engine
• Content based search (search by example) = using low-level geometric characteristics
• Context based search = using high-level features – Ontological concepts- annotation options provided by
the system– Text mining and NLP on textual attachments
• Relevance feedback• A complex search engine – standalone
version is 1.5GB installation!
1P2P4mm workshop, Vico Equense6. June 2008
Common p2p search mechanism options
• DHT-based solution• Unstructured overlays (flooding,
random walks...)– 1.5GB search engine on every peer?– Search operation CPU intensive– Completeness of results
• Grid-like approach (Napster) .
1P2P4mm workshop, Vico Equense6. June 2008
Our choice – hybrid architecture
• Grid-like approach for search functionality – Searching and indexing (and other services) only on
reliable, high-power machines with public and static IP addresses – super peers
– Workload management mechanism
• P2P approach for download functionality– Distributed storage – files stored on edge peers– Distributed bandwidth - simultaneous download from
many peers
1P2P4mm workshop, Vico Equense6. June 2008
Architecture
MEP...mobile edge peer
EP...desktop edge peer
MGW...mobile gateway
Indexing
Search
Other
MGW
Indexing super peer
Search
OtherMGW
Super peer
Search
Other
MGW
Super peer
MEP
MEP MEP
MEP
MEPSearch
OtherMGW
Super peer
MEP
P2P communication
EP
EP EP
EP
EP
EP
EP
Lines representing client-server communication
1P2P4mm workshop, Vico Equense6. June 2008
Super peer services
• Searching/Indexing – Indexing performed on only one super peer– The same index on all super peers – periodic index updating– Request Broker
• Relays for peers hidden behind NAT and firewalls• Download service
– Indexing super peer needs to index textual attachments– Super peers act as seeding content providers.
• Gateway for mobile devices• Other services (rendering, DRM, Identity
management etc.)
1P2P4mm workshop, Vico Equense6. June 2008
Edge peer services
• Download service (client and server)• Low level features extraction
1P2P4mm workshop, Vico Equense6. June 2008
Selected technology
• Search engine– Modified Empolis e:IAS– Novel low-level features extraction algorithms
from ITI-CERTH
• P2P functionality based on JXTA protocols
• Client-server communication using web services over HTTP
1P2P4mm workshop, Vico Equense6. June 2008
Conclusions
• Hybrid architecture fits the requirements better than pure P2P architecture:– Sophisticated search engine– Control over performance and reliability– Strict security needs
• Completely distributed (pure P2P) search of this complexity is very difficult (if not impossible) to realize.
1P2P4mm workshop, Vico Equense6. June 2008
Questions