efficient video browsing

32
Efficient Video Browsing Using Multiple Synchronized Views Heymo Kou

Upload: odette

Post on 23-Feb-2016

46 views

Category:

Documents


0 download

DESCRIPTION

Efficient Video Browsing. Using Multiple Synchronized Views Heymo Kou. Question. What is the two main technologies applied for efficient video browsing? (one for audio, one for visual content). Table of contents. Background Current technology Advanced technology Summary Reference. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Efficient Video Browsing

Efficient Video Brows-ing

Using Multiple Synchronized ViewsHeymo Kou

Page 2: Efficient Video Browsing

What is the two main technologies applied for efficient video browsing? (one for audio, one for visual content)

Question

Page 3: Efficient Video Browsing

Background Current technology Advanced technology Summary Reference

Table of contents

Page 4: Efficient Video Browsing

Growth of digital contents data

Page 5: Efficient Video Browsing

Digital video market growth

Page 6: Efficient Video Browsing

From your◦ Smart phones◦ Notebooks◦ Webcams◦ Digital camera and camcorders◦ Security and monitoring cameras

With advanced streaming technology◦ Fast Internet access◦ MPEG-4 format

Digital video becomes ubiqui-tous

Page 7: Efficient Video Browsing

Search through categories◦ Similar to Internet shopping mall

We search for big categories Then smaller categories …and so on…

User should choose which to browse◦ Should check whether the selected data matches

what user was finding Time consuming!

Manual categorizing and annotation◦ One by one?

Current technology for finding a video data

Page 8: Efficient Video Browsing

Too complicated◦ Lack of efficient algorithm

Time consuming◦ Multimedia calculation ∝ exponential

Inaccuracy◦ Video data is increasing exponentially

Cataloging manual has a somewhat limit point◦ Manually cataloging is done by human hand that

mistakes can be happened

Problem with current video search and browsing technologies

Page 9: Efficient Video Browsing

MPEG-7 Standards Speech indexing Shot Boundary Detection Time Scale Modification of Audio Signals Storyboards, Moving Storyboards and Ani-

mation Adaptive Accelerating Fast Playback Streaming Synchronized Views

Technologies for advancedimage and video retrieval

Page 10: Efficient Video Browsing

Standardized by ISO/IEC◦ International Standard Organization◦ International Electrotechnical Commission

Not a video encoding format XML to store metadata

◦ Attached to timecode in multimedia By this tag

◦ Able to index and search efficiently Yet, improvement is needed

MPEG-7 standard

Page 11: Efficient Video Browsing

Search through speech transcripts◦ Finds familiar metaphor of free text search

Automatic speech recognition (ASR)◦ Indexed transcript → semantic information

Main advantage : Representation◦ Speech is built of words

Speech indexing

Page 12: Efficient Video Browsing

Frame

Key frame

Shot◦ Group of frames which represents similar frames

Definitions

Start key frame end key frame animation

Page 13: Efficient Video Browsing

Context◦ Meaningful information within multimedia data

3 levels of video browsing◦ Browsing a large collection of videos◦ Browsing a ranked list of videos◦ Browsing a single video to find relevant segments

Definitions

Page 14: Efficient Video Browsing

Shot Boundary Detection(SBD) algorithm◦ Completely automatic

Key frames are selected and extracted◦ Saved as JPEG files

High Accuracy and Efficiency◦ Still, fault detection problem is unsolved

Shot Boundary Detection

Page 15: Efficient Video Browsing

SBD algorithm

Page 16: Efficient Video Browsing

Similar to scene selection of dvd

Page 17: Efficient Video Browsing

Audio browsing is as important asvideo browsing◦ Except images, most digital contents are audible

Faster audio browsing is necessary Speeding up of audio signal by

◦ By deleting small audio segments◦ Especially, human speech signals are quasi-peri-

odic

Time Scale Modification ofAudio Signals

Page 18: Efficient Video Browsing

Improvement of TSMTime-Domain Harmonic Scal-

ing(TDHS) technique

Time-Domain, Pitch Synchro-nous Overlap Add

Time Scale Modification(TSM) algorithm

Waveform Synchronous Over-lap(WSOLA)

Page 19: Efficient Video Browsing

Synchronous Overlap-Add SOLA

Page 20: Efficient Video Browsing

Storyboard◦ a set of one or more pages, each consists of a two

dimensional array of key-frames, sorted in chrono-logical order.

Animation◦ a quick slide show, where each of the key-frames is

shown for a fixed short period (e.g., 0.6 seconds) Moving Storyboard (MSB)

◦ the animated key frames, fully synchronized with the original audio track. Each key-frame is shown for the entire duration of the associated shot.

Storyboards, Moving Story-boards and Animation

Page 21: Efficient Video Browsing

Very fast video playback (without audio) Ordinary fast forward depends only on

speed◦ There is a chance to miss important scene

Accelerates until new scene is met Requires less computation load

Adaptive Accelerating Fast Playback

Page 22: Efficient Video Browsing

Image for adaptive fast play-back

Page 23: Efficient Video Browsing

Example in surveillance camera

Real-use of adaptive fast play-back

Page 24: Efficient Video Browsing

Server preprocesses media◦ Keep same media, but different speed encoded

When user selects other speed◦ 1. pause current media◦ 2. open file with same content with selected

speed◦ 3. seek to the corresponding position◦ 4. play the selected view

Needs no extra computational load◦ However, requires more storage: Tradeoff

Streaming Synchronized Views

Page 25: Efficient Video Browsing

Can browse multiple videos at once

Split frames every given time◦ (i.e 10 seconds)

Strong information scent is visible◦ With aggregation of occurrences

Browsing Multiple Videos: MovieDNA

Page 26: Efficient Video Browsing

Image of movieDNA

Page 27: Efficient Video Browsing

Summary of main proper-ties

ViewVisual

AudioTypical

speedup rateStati

cDy-

namic

Full video (w/o TSM) ○ ○ 1 – 2XVideo Skim ○ ○ 2 – 20XSlide show (w/o TSM) ○ ○ 1 – 2XAdaptive Fast Playback ○ 5 – 30XAnimation ○ 10 – 40XStoryboard, mosaic ○ NA

Page 28: Efficient Video Browsing

Streaming synchronized views and movieDNA◦ Less computation, multiple videos at once

Active accelerating fast playback◦ Most useful at analyzing surveillance videos

SBD & TSM◦ Efficient for implementing above technologies

Then, what is current limitation?

Conclusion

Page 29: Efficient Video Browsing

Any questions?

Q & A

Page 30: Efficient Video Browsing

What is the two main technologies applied for efficient video browsing? (one for audio, one for visual content)

Answer : The two main technologies are Shot Boundary Detection(SBD) for visual content and Time Scale Modification(TSM) for audio signals

Answer