overview of multi-view video coding yo-sung ho; kwan-jung oh; systems, signals and image processing,...
Post on 21-Dec-2015
213 views
TRANSCRIPT
![Page 1: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/1.jpg)
Overview of Multi-view Video Coding
Yo-Sung Ho; Kwan-Jung Oh;Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech and Image Processing, Multimedia Communications and Services. 14th International Workshop on
![Page 2: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/2.jpg)
Outline
Introduction Applications of Multi-view Video Requirements for Multi-view Video Coding Test Data Sets and Test Conditions Joint Multi-view Video Model (JMVM) Conclusion
![Page 3: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/3.jpg)
Introduction
![Page 4: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/4.jpg)
Introduction
Multimedia Demands holography two-view stereoscopic system with special glasses multi-view video
Multi-view Video: FVV, FVT, 3DTV
What is multi-view video? Why we need multi-view video coding (MVC)? MVC has been studied in the past.
MVP, MCP, DCP, MPEG4 MAC, H.263/4
![Page 5: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/5.jpg)
Application of MVC
![Page 6: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/6.jpg)
Application of MVC
Free Viewpoint Television (FTV) Three-dimensional TV (3DTV) Immersive Teleconference
![Page 7: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/7.jpg)
FTV
What is FTV (Free Viewpoint Television)? Application of FTV:
Entertainment Education Sightseeing Surveillance Archive
![Page 8: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/8.jpg)
FTV
![Page 9: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/9.jpg)
3DTV
What is 3DTV? Interaction may not be required
To broadcast on 3DTV
![Page 10: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/10.jpg)
3DTV
![Page 11: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/11.jpg)
3DTV
Capture by various types of multiple cameras 1D parallel 2D parallel 1D arc …etc.
Intermediate view reconstruction (IVR)
![Page 12: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/12.jpg)
Immersive Teleconference
What is immersive teleconference? Interaction
![Page 13: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/13.jpg)
Requirements for Multi-view Video Coding
![Page 14: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/14.jpg)
Requirements for Multi-view Video Coding
Requirements for multi-view video coding: Compression related requirements System support related requirements
![Page 15: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/15.jpg)
Compression Related Requirements
Compression efficiency View scalability Free viewpoint scalability Spatial/Temporal/SNR scalability Backward compatibility Resource consumption Low delay Robustness
![Page 16: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/16.jpg)
Compression Related Requirements
Resolution, bit depth, chroma sampling format
Picture quality among views Temporal random access View random access Spatial random access Resource management Parallel processing
![Page 17: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/17.jpg)
System Support Related Requirements
Synchronization View generation Non-planar imaging and display systems Camera parameters
![Page 18: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/18.jpg)
Test Data Sets and Test Conditions
![Page 19: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/19.jpg)
Test Data Sets and Test Conditions
![Page 20: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/20.jpg)
Test Data Sets and Test Conditions
![Page 21: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/21.jpg)
Joint Multi-view Video Model
![Page 22: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/22.jpg)
Joint Multi-view Video Model
![Page 23: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/23.jpg)
Random Access
GGOP contains
frames. For accessing any frame within a GGOP, we
have to decode maximum number of frames.
b4(S5/T7), following 18 referencing frames.
![Page 24: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/24.jpg)
Time-first coding order
![Page 25: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/25.jpg)
Encoder Complexity
Minimum decoded picture buffer (DPB)
EX: GOP_length=16, number_of_views=8, the DPB size = 42
MVC codec will have the same coding delay as single view video coding since time-first coding is mandated.
![Page 26: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/26.jpg)
GOP structures and view prediction structure
![Page 27: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/27.jpg)
GOP structures and view prediction structure
![Page 28: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/28.jpg)
GOP structures and view prediction structure
![Page 29: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/29.jpg)
Illumination compensation
ICA MC: illumination change-adaptive motion compensation
Macroblocks(MB) mode in h.264/MPEG-4 AVC: Inter 16*16 mode, Direct 16*16 mode (include B_Skip), and P_Skip mode
DVIC: difference value of illumination change ICA ME: illumination change-adaptive motion
estimation
![Page 30: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/30.jpg)
![Page 31: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/31.jpg)
SAD calculation for the motion estimation of S*T blocks:
In order to compensate the illumination change
![Page 32: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/32.jpg)
1
1 Illumination compensated residual signal
![Page 33: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/33.jpg)
Other Technical Issues View-temporal prediction structure
Single video v.s. Multi-view video Three main coding structures
Encode multiple video sequences separately Utilizes inter-view correlation only Utilizes both temporal and inter-view correlation
View interpolation prediction Decoder side disparity estimation Computing depth at encoder side and transmitting this to t
he decoder Motion/Disparity vector coding
Highly correlated each other
![Page 34: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/34.jpg)
Conclusion
![Page 35: Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech](https://reader035.vdocument.in/reader035/viewer/2022062714/56649d575503460f94a35b3a/html5/thumbnails/35.jpg)
Conclusion
The multi-view video includes multi-viewpoint video sequence captured by several cameras at the same time.
Compress multi-view video efficiently MPEG and JVT are leading the
standardization of MVC.