overview of multi-view video coding yo-sung ho; kwan-jung oh; systems, signals and image processing,...

35
Overview of Multi- view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Proces sing, 2007 and 6th EURASIP Confer ence focused on Speech and Image Processing, Multimedia Communicat ions and Services. 14th Internati onal Workshop on

Post on 21-Dec-2015

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

Overview of Multi-view Video Coding

Yo-Sung Ho; Kwan-Jung Oh;Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech and Image Processing, Multimedia Communications and Services. 14th International Workshop on

Page 2: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

Outline

Introduction Applications of Multi-view Video Requirements for Multi-view Video Coding Test Data Sets and Test Conditions Joint Multi-view Video Model (JMVM) Conclusion

Page 3: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

Introduction

Page 4: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

Introduction

Multimedia Demands holography two-view stereoscopic system with special glasses multi-view video

Multi-view Video: FVV, FVT, 3DTV

What is multi-view video? Why we need multi-view video coding (MVC)? MVC has been studied in the past.

MVP, MCP, DCP, MPEG4 MAC, H.263/4

Page 5: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

Application of MVC

Page 6: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

Application of MVC

Free Viewpoint Television (FTV) Three-dimensional TV (3DTV) Immersive Teleconference

Page 7: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

FTV

What is FTV (Free Viewpoint Television)? Application of FTV:

Entertainment Education Sightseeing Surveillance Archive

Page 8: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

FTV

Page 9: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

3DTV

What is 3DTV? Interaction may not be required

To broadcast on 3DTV

Page 10: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

3DTV

Page 11: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

3DTV

Capture by various types of multiple cameras 1D parallel 2D parallel 1D arc …etc.

Intermediate view reconstruction (IVR)

Page 12: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

Immersive Teleconference

What is immersive teleconference? Interaction

Page 13: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

Requirements for Multi-view Video Coding

Page 14: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

Requirements for Multi-view Video Coding

Requirements for multi-view video coding: Compression related requirements System support related requirements

Page 15: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

Compression Related Requirements

Compression efficiency View scalability Free viewpoint scalability Spatial/Temporal/SNR scalability Backward compatibility Resource consumption Low delay Robustness

Page 16: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

Compression Related Requirements

Resolution, bit depth, chroma sampling format

Picture quality among views Temporal random access View random access Spatial random access Resource management Parallel processing

Page 17: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

System Support Related Requirements

Synchronization View generation Non-planar imaging and display systems Camera parameters

Page 18: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

Test Data Sets and Test Conditions

Page 19: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

Test Data Sets and Test Conditions

Page 20: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

Test Data Sets and Test Conditions

Page 21: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

Joint Multi-view Video Model

Page 22: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

Joint Multi-view Video Model

Page 23: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

Random Access

GGOP contains

frames. For accessing any frame within a GGOP, we

have to decode maximum number of frames.

b4(S5/T7), following 18 referencing frames.

Page 24: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

Time-first coding order

Page 25: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

Encoder Complexity

Minimum decoded picture buffer (DPB)

EX: GOP_length=16, number_of_views=8, the DPB size = 42

MVC codec will have the same coding delay as single view video coding since time-first coding is mandated.

Page 26: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

GOP structures and view prediction structure

Page 27: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

GOP structures and view prediction structure

Page 28: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

GOP structures and view prediction structure

Page 29: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

Illumination compensation

ICA MC: illumination change-adaptive motion compensation

Macroblocks(MB) mode in h.264/MPEG-4 AVC: Inter 16*16 mode, Direct 16*16 mode (include B_Skip), and P_Skip mode

DVIC: difference value of illumination change ICA ME: illumination change-adaptive motion

estimation

Page 30: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech
Page 31: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

SAD calculation for the motion estimation of S*T blocks:

In order to compensate the illumination change

Page 32: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

1

1 Illumination compensated residual signal

Page 33: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

Other Technical Issues View-temporal prediction structure

Single video v.s. Multi-view video Three main coding structures

Encode multiple video sequences separately Utilizes inter-view correlation only Utilizes both temporal and inter-view correlation

View interpolation prediction Decoder side disparity estimation Computing depth at encoder side and transmitting this to t

he decoder Motion/Disparity vector coding

Highly correlated each other

Page 34: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

Conclusion

Page 35: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech

Conclusion

The multi-view video includes multi-viewpoint video sequence captured by several cameras at the same time.

Compress multi-view video efficiently MPEG and JVT are leading the

standardization of MVC.