1 web viewthe following documents were approved. 9647 doc on iso ... going forward with this...
TRANSCRIPT
INTERNATIONAL ORGANISATION FOR STANDARDISATIONORGANISATION INTERNATIONALE DE NORMALISATION
ISO/IEC JTC 1/SC 29/WG 11CODING OF MOVING PICTURES AND AUDIO
ISO/IEC JTC 1/SC 29/WG 11 N9558Antalya, TR – January 2008
Source: Leonardo Chiariglione Title: Report of 83rd meetingStatus
Report of 83rd meeting
1 Opening The 83rd MPEG meeting was held on 14 – 18 January 2008 in Antalya, Turkey.
2 Roll call of participants Annex 1 provides the attendance list
3 Approval of agenda Annex 2 provides the adopted agenda
4 Allocation of contributions Annex 3 provides the list of input contributions.
5 Communications from Convenor The Convenor announced that Jörn Ostermann was appointed as Chairman of the Requirements group.
6 Report of previous meeting This was approved
7 Processing of NB Position Papers NB Position Papers were presented and discussed. Where relevant a response was provided.
1
8 Work plan management
8.1 Media coding
8.1.1 MPEG-2 Main Profile Level for 1080@50/60pThe following documents were approved
9563 Request for 13818-2:2000/Amd.39564 Text of ISO/IEC 13818-2:2000/PDAM 3 Level for 1080@50/60p
8.1.2 MPEG-4 Visual Simple Studio Profile Levels 5 and 6The following document was approved
9565 Study Text of ISO/IEC 14496-2:2004/PDAM5 Simple Studio Profile Levels 5 and 6
8.1.3 AAC-ELD The following document was approved
9619 Workplan for AAC-ELD Verification Test
8.1.4 New Profiles for Professional Applications
8.1.5 Scalable Video CodingThe following document was approved
9577 Report on SVC Verification Tests
8.1.6 Multiview Video CodingThe following documents were approved
9575 Disposition of Comments on ISO/IEC 14496-10:200X/PDAM 1 9576 Text of ISO/IEC 14496-10:200X/FPDAM 1 Multiview Video Coding9578 Joint Multiview Video Model (JMVM) 79579 JMVM 7 Software9580 Overview of Multiview Video Coding (MVC)
8.1.7 AFX The following document was approved
9649 WD2.0 of AFX 3rd Edition
8.1.8 Frame-based Animated Mesh CompressionThe following documents were approved
2
9647 DoC on ISO/IEC 14496-16:2006/PDAM2 (Frame-based Animated Mesh Compression)9648 Text of ISO/IEC 14496-16:2006/FPDAM2 (Frame-based Animated Mesh Compression)
8.1.9 Low-complexity 3D mesh compressionThe following documents were approved
9650 Requirements for low-complexity 3D mesh compression9651 CfP for low-complexity 3D mesh compression
8.1.10 Open Font FormatThe following documents were approved
9683 Request for 14496-22 2nd Edition9684 Text of ISO/IEC CD 14496-22 2nd Edition
8.1.11 Codec Configuration RepresentationThe following documents were approved
9584 Study Text of ISO/IEC CD 23001-4 Codec Configuration Representation9585 Reconfigurable Video Coding Requirements V 4.09586 Overview of Reconfigurable Video Coding (RVC)9589 Description of Core Experiments in RVC9590 RVC Simulation Model (RSM) V7.09591 RVC Work Plan and FU Development Status
8.1.12 Video Tool LibraryThe following documents were approved
9587 Study Text of CD ISO/IEC 23002-4 Video Tool Library
9588 Extensions of Video Tool Library under consideration9593 Description of Exploration Experiments in RVC9594 Methodologies for Video Toolbox Extension V2.0
8.1.13 Spatial Audio Object CodingThe following documents were approved
9636 Status and Workplan on SAOC Core Experiments9637 WD on SAOC Text and Reference Software
8.1.14 Post Production Deliverable FormatsThe following documents were approved
9710 Requirements for MPEG Post Production Deliverable Formats
3
9711 Gap Analysis between Post Production Deliverable Requirements and Proposed Working Draft
9712 Text of WD1.0 MPEG Post Production Deliverable Formats
8.1.15 Free Viewpoint TV coding The following documents were approved
9595 Call for Contributions on 3D Video Test Material (Update)9596 Description of Exploration Experiments in 3D Video
8.1.16 Unified speech and audio coding The following documents were approved
9638 Evaluation Guidelines for Unified Speech and Audio Proposals9639 Workplan on Speech and Audio Material Selection 9640 Draft Workplan on Subjective Testing of Unified Speech and Audio Coding Proposals
8.1.17 Media Value Chain OntologyThe following document was approved
9658 Requirements for a Media Value Chain Ontology
8.1.18 Representation of Sensory ExperienceThe following document was approved
9659 Requirements on RoSE Framework
8.2 Composition coding
8.2.1 Scene representationThe following documents were approved
9675 WD1.0 of Use of LASeR jointly with BIFS in MPEG-4 Systems Architecture9676 Request for Amendment of ISO/IEC 14496-119677 ISO/IEC 14496-11 PDAM6 Scene Partitioning
8.2.2 Presentation of Structured InformationThe following documents were approved
9715 Requirements for Presentation of Structured Information9716 Preliminary WD of Presentation of Structured Information
4
8.3 Description coding
8.3.1 Visual Descriptions ExtensionsThe following document was approved
9582 Description of Core Experiments for MPEG-7 New Visual Extensions
8.3.2 Visual Signature Tools The following document was approved
9581 Text of ISO/IEC 15938-3:2001/PDAM 3 Image Signature Tools
8.4 IPMPThe following documents were approved
9686 DoC on ISO/IEC 21000-5/FPDAM3 Open Access Content Profile9687 Text of ISO/IEC 21000-5/FDAM3 Open Access Content Profile9688 MPEG-21 REL Profiles Software Implementation Plan v.9
8.5 Transport and File formats
8.5.1 Carriage of SVC in MPEG-2 Systems The following documents were approved
9669 Text ISO/IEC 13818-1:2007/FPDAM3.2 Carriage of SVC in MPEG-2 Systems9670 Text of ISO/IEC 13818-1:2007/Cor.2 WD2.0 related to the carriage of AVC
8.5.2 ISO Base Media File FormatThe following documents were approved
9678 Text of ISO/IEC 14496-12 3rd Edition9680 Updated Technology under Consideration for Part 12
8.5.3 AVC File Format extensions for SVCThe following documents were approved
9681 DoC on ISO/IEC 14496-15/FPDAM2 SVC File Format Extension9682 Text of ISO/IEC 14496-15/FDAM2 SVC File Format Extension
8.6 Multimedia architecture
8.6.1 3D Graphics Compression ModelsThe following document was approved
5
9652 Study of CD of ISO/IEC 14496-25
8.6.2 WIM TVThe following document was approved
9717 Requirements on WIM TV
8.6.3 MPEG eXtensible MiddlewareThe following document was approved
9713 Requirements for MXM (MPEG eXtensible Middleware)
8.7 Application formats
8.7.1 MAF generalThe following documents were approved
9689 MAF Overview Document9690 MAF Overview Presentation
8.7.2 Musical Slide Show Application FormatThe following documents were approved
9691 Study Text of ISO/IEC FCD 23000-4 Musical Slide Show 2nd Edition9692 Study Text of ISO/IEC 23000-4:200x/PDAM1 MSS Application Format Conf. and Ref.
Software
8.7.3 Media Streaming Application Format The following document was approved
9693 Text of ISO/IEC 23000-5 2nd Edition WD1.0 Media Streaming Application Format
8.7.4 Professional Archival MAFThe following documents were approved
9694 Requirements on Professional Archival Application Format9696 Text of ISO/IEC CD 23000-6 Professional Archival Application Format
8.7.5 Open Release Application FormatThe following documents were approved
9697 DoC of ISO/IEC FCD 23000-7 Open Access Application Format9698 Text of ISO/IEC FDIS 23000-7 Open Access Application Format
6
9699 Request of Amendment for ISO/IEC 23000-7 9700 Text of ISO/IEC PDAM1 23000-7 Conformance and Reference
Software
8.7.6 Portable Video Player MAFThe following documents were approved
9701 Study Text of ISO/IEC 23000-8/FCD Portable Video Application Format
9702 Workplan for Portable Video Application Format Conformance and Ref. Soft.
8.7.7 Video Surveillance Application Format The following documents were approved
9705 DoC on ISO/IEC CD 23000-10 (Video Surveillance Application Format)9706 Text of ISO/IEC FCD 23000-10 (Video Surveillance Application
Format)9708 Future Work on Surveillance AF's – collection of requirements
8.7.8 Video Stereoscopic Application FormatThe following document wa approved
9709 Text of ISO/IEC CD 23000-11 (Stereoscopic Video Application Format)
8.8 Reference implementation
8.8.1 Symbolic Music Representation Reference SoftwareThe following documents were approved
9671 DoC on ISO/IEC 14496-5/FPDAM16 Symbolic Music Representation Ref. Soft.9672 Text of ISO/IEC 14496-5/FDAM16 Symbolic Music Representation Ref. Soft.
8.8.2 BSAC Extensions Reference SoftwareThe following document was approved
9630 Study on ISO/IEC 14496-5:2001/FPDAM 20, Reference Software for MPEG-1/2 Audio in MPEG-4 and BSAC Extensions
8.8.3 AAC-ELD Reference Software The following document was approved
9629 ISO/IEC 14496-5:2001/AMD XX, WD on AAC-ELD Reference Sw.
7
8.8.4 SVC Reference SoftwareThe following documents were approved
9572 Disposition of Comments on ISO/IEC 14496-5:2001/PDAM 199573 Text of ISO/IEC 14496-5:2001/FPDAM 19 Reference Software for Scalable Video
Coding
8.8.5 3D Graphics Compression Model Reference SoftwareThe following document was approved
9645 ISO/IEC 14496-5 PDAM 22 (3DGCM RefSoft)
8.8.6 LASeR Reference SoftwareThe following documents were approved
9673 DoC on ISO/IEC 14496-5/FPDAM17 LASeR Ref. Soft.9674 Text of ISO/IEC 14496-5/FDAM17 LASeR Ref. Soft.
8.8.7 DMB AF Reference SofwareThe following document was approved
9704 Text of ISO/IEC 23000-9/AMD1 WD1.0 Conformance and Reference Software
8.8.8 Video Surveillance Application Format Reference SofwareThe following document was approved
9707 Text of ISO/IEC 23000-10/AMD1 WD1.0 Conformance and Reference Software
8.8.9 MPEG Surround Reference SoftwareThe following documents were approved
9634 DoC on ISO/IEC 23003-1:2006/FPDAM 2, MPEG Surround Reference Sw.9635 ISO/IEC 23003-1:2006/FDAM 2, MPEG Surround Reference Sw.
8.9 Conformance
8.9.1 MPEG-2 Main Profile Level for 1080@50/60p ConformanceThe following documents were approved
9583 Request for 13818-4:2004/Amd.39618 Text of ISO/IEC 13818-4:2004/PDAM 3 Level for 1080@50/60p Conformance Testing
8
8.9.2 MPEG-4 Visual Simple Profile Level 6 Conformance The following document was approved
9567 Study Text of ISO/IEC 14496-4:2004/PDAM35 Simple Studio Profile Levels 5 and 6 Conformance Testing
8.9.3 Scalable Video Coding Conformance The following documents were approved
9568 Disposition of Comments on ISO/IEC 14496-4:2004/PDAM 319569 Text of ISO/IEC 14496-4:2004/FPDAM 31 Conformance Testing for Scalable Video
Coding
8.9.4 Symbolic Music Representation Conformance The following documents were approved
9625 DoC on ISO/IEC 14496-4:2004/FPDAM 29, SMR Conformance9626 ISO/IEC 14496-4:2004/FDAM 29, SMR Conformance
8.9.5 Audio Scalable to Lossless Conformance The following documents were approved
9620 DoC on ISO/IEC 14496-4:2004/FPDAM 20, SLS Conformance9621 ISO/IEC 14496-4:2004/FDAM 20, SLS Conformance
8.9.6 AAC-ELD, OAFI and additional AAC ConformanceThe following document was approved
9624 ISO/IEC 14496-4:2004/AMD XX, WD on AAC-ELD, OAFI and additional AAC Conformance
8.9.7 Frame-based Animated Mesh Compression ConformanceThe following document was approved
9642 Study on PDAM of ISO/IEC 14496-4:2004 AMD32 (FAMC Conformance)
8.9.8 Multiresolution Profile ConformanceThe following document was approved
9643 Study on PDAM of ISO/IEC 14496-4:2004 AMD33 (MultiResolution Profile Conformance)
9
8.9.9 3D Graphics Compression Model ConformanceThe following document was approved
9644 ISO/IEC 14496-4:2004 PDAM 34 (3DGCM Conformance)
8.9.10 DMB AF ConformanceThe following document was approved
9704 Text of ISO/IEC 23000-9/AMD1 WD1.0 Conformance and Reference Software
8.9.11 Video Surveillance Application Format ConformanceThe following document was approved
9707 Text of ISO/IEC 23000-10/AMD1 WD1.0 Conformance and Reference Software
8.9.12 MPEG Surround ConformanceThe following documents were approved
9631 DoC on ISO/IEC 23003-1:2006/FPDAM 1, MPEG Surround Conformance
9632 ISO/IEC 23003-1:2006/FDAM 1, MPEG Surround Conformance9633 Workplan on further issues for MPEG Surround Conformance
8.9.13 Video Tool Library ConformanceThe following document was approved
9592 RVC Conformance Testing Working Draft V4.0
8.9.14 MPEG-4 Audio Conformance RollupThe following document was approved
9627 MPEG-4 Audio Conformance Rollup
8.10 Maintenance
8.10.1 Systems coding standardsThe following document was approved
9679 WD1.0 of Corrigendum on ISO/IEC 14496-12
10
8.10.2 Video coding standards The following documents were approved
9566 Study Text of ISO/IEC 14496-2:2004/DCOR3 9570 Disposition of Comments on ISO/IEC 14496-5:2001/Amd.1:2002/DCOR 19571 Text of ISO/IEC 14496-5:2001/Amd.1:2002/COR 19574 Text of ISO/IEC 14496-10:200X/DCOR 1
8.10.3 Audio coding standards The following documents were approved
9622 ISO/IEC 14496-4:2004/AMD 11/DCOR 3, Parametric Stereo9623 ISO/IEC 14496-4:2004/AMD 19/DCOR 1, ALS9628 ISO/IEC 14496-5:2001/AMD 10/DCOR 2, ALS
8.10.4 3DG coding standardsThe following document was approved
9646 Study of ISO/IEC 14496-16:2006/AMD1/DCOR1
8.10.5 MPEG-21 standardsThe following document was approved
9685 Items for consideration for Corrigendum or Amendment of MPEG-21 DIA
8.10.6 MAF standardsThe following document was approved
9703 Text of ISO/IEC 23000-9/DCOR1 (DMB Application Format)
9 Organisation of this meeting
9.1 Tasks for subgroups The following tasks were assigned
Requirements Carriage of AVS on MPEG-2 Systems IPTV Media value chain ontologies Framework for representation of sensory effects information ? Information exchange with virtual worldsSystems 2 1 3 Carriage of SVC 26 Open Font Format Conformance 27 Laser v.2 conformance
11
5 14 Open Font Format Reference Software 16 Symbolic Music Representation Reference Software 17 Laser Reference Software 15 1 File Format 7 12 Query Format Schemas 21 5 REL amendment OAC 6 Media value chain ontologies 8 1 Reference software Schemas 9 1 Mime type registration 15 1 Security in Event Reporting A 4 1 Musical Slide Show MAF conformance & RS 2 Protected Musical Slide Show MAF 5 1 Media Streaming MAF conformance & RS 6 Professional Archival MAF 7 OA MAF 8 1 Portable Video Player MAF conformance & RS 9 DMB MAF conformance & RS 10 Video Surveillance MAF 11 Stereoscopic MAF E 8 M3W Reference Software and Conformance X Post production delivery format U MPEG eXtensible Middleware V Information exchange with virtual worlds W Framework for representation of sensory effects information Y Joint management of content description and presentation IPTVVideo 2 2 New levels for 1080/60 P support 4 2 5 Studio Profile level 5 and 6 4 4 35 Studio Profile level 5 and 6 Conformance 7 3 3 Image Signature Tools 7 3 4 Video Signature Tools 3 Video augmentation by metadata A 3 1 Photo Player Reference Software 2 Photo Player Conformance B 4 Codec Configuration Description C 4 Video Tool Library 4 1 Video Tool Library Conformance FTVJVT 4 10 New AVC Profiles for Professional Applications Conformance 10 New AVC Profiles for Professional Applications Reference SW 10 Scalable Video Coding Conformance 10 Scalable Video Coding Reference SW 10 1 Multi-View Video Coding 4 10 3 verification tests Audio 4 4 20 SLS conformance
12
29 SMR Conformance 5 16 SMR Reference Software 20 MPEG-1/-2 on MPEG-4 reference software D 1 1 MPEG Surround Reference Software 2 MPEG Surround Conformance 2 Spatial Audio Object Coding X Unified Speech and Audio Coding3DG 4 21 Geometry and shadow Conformance 4 4 32 FAMC (Frame based Animated Mesh Compress.) Conformance 33 Multiresolution profile conformance 3DG Compression Model Conformance 5 13 Geometry and shadow Reference Software 21 FAMC (Frame based Animated Mesh Compress.) Reference software 21 Multiresolution profile Reference Software 3DG Compression Model Reference Software 16 2 Frame-based animated mesh compression 3 3D Multiresolution profile 4 Space partitioning 25 3D Graphics Compression model ? Metaverse
9.2 Joint meetings The following joint meetings were held
Groups What Where Day TimeSys, 3DG Scene partitioning Systems Mon 17:00-18:00Req, Sys IPTV Systems Tue 09:00-09:00Req, Sys, 3DG Metaverse Systems Tue 09:30-10:00Req, Sys Rose Systems Tue 10:00-11:00Req, Sys, Vid AVS Systems Tue 11:00-12:00Vid, JVT, Req MVC, bit depth reqs, FTV JVT Wed 16:00-17:30
10 WG management
10.1 Terms of referenceThe following document was approved
9600 Terms of reference
10.2 OfficersJörn Ostermann was appointed as Chairman of the Requirements Group
10.3 EditorsThe following document was approved
13
9604 Editors of MPEG standards
10.4 LiaisonsThe following liaison statements were issued
9714 Liaison to JPEG on ISO Base Format9718 Response to DVB on File Format9719 Response to DVB on Carriage and Storage of SVC9720 Response to JPEG on Query Format9721 Liaison to JTC1/SWG-ARM on PA Application Format9722 Liaison to SMPTE on PA Application Format9723 Liaison to TC20/SC13 on PA Application Format9724 Liaison to JPEG on PA Application Format9725 Response to JTC1/SC349726 Liaison to ITU-T SG16 on IPTV9727 Liaison to Creative Common on Open Access Application Format 9728 Liaison to SMPTE on Post-Production Deliverables9729 Liaison to NAB on Post-Production Deliverables9730 Liaison to ATSC on Post-Production Deliverables9731 Liaison to MPAA on Post-Production Deliverables9732 Liaison to EBU on Post-Production Deliverables9733 Liaison to IEC TC100 TA6 on Post-Production Deliverables9734 Liaison to IFPI on Post-Production Deliverables9735 Liaison to DMP on Presentation of Structured Information9736 Liaison to ITU-T TC 9 WG43 on Video Surveillance AF9614 Liaison Statement to SMPTE re RVC 9615 Liaison Statement to ITU-T SG 9 re FTV 9616 Liaison Statement to ITU-T SG 9 re Bitstream Splicing9617 Liaison Statement template for various organizations re SVC verification testing report9641 Liaison Statement to ETSI TC DECT9660 Liaison Statement to ITU-T SG 16
10.5 Work item assignment
10.6 Ad hoc groupsThe following ad hoc groups were established
w9664 Ad Hoc Group on Application Format
w9663 Ad Hoc Group on MPEG File Formats
w9665 Ad Hoc Group on Presentation of Structured Information
w9662 Ad Hoc Group on Scene Representation
14
w9661 AHG on 3DG documents and software maintenance
w9653 AHG on Audio Standards Maintenance
w9668 AHG on Font Format Representation
w9613 AHG on FTV
w9655 AHG on Information Exchange with Virtual Worlds
w9597 AHG on Maintenance of MPEG-4 Visual related Documents, Reference Software and Conformance
w9667 AHG on MPEG Query Format
w9599 AHG on MPEG-7 Visual
w9598 AHG on Reconfigurable Video Coding
w9657 AHG on Requirements for Media Value Chain Ontology
w9666 AHG on Requirements for MPEG Post Production Deliverable Formats
w9656 AHG on the RoSE Framework
w9654 AHG on Unified Speech and Audio Coding and SAOC and AAC-ELD
10.7 Asset management The following documents were approved
9605 Schema assets9606 Software assets9607 Conformance assets9608 Content assets9609 URI assets
10.8 IPR managementThe following document was approved
9610 Standards under development for which a call for patent statements is issued
15
10.9 Work planThe following documents were approved
9601 MPEG Standards9602 Table of unpublished FDISs9603 Work plan and time line
11 Administrative matters
11.1 Schedule of future MPEG meetings The following schedule was approved
# City Country yy mm dd-dd83 Antalya TR 08 01 14-1884 Archamps FR 08 04-05 28-0285 Hannover DE 08 07 21-2586 Busan KR 08 10 13-1787 Archamps FR 09 01-02 26-3088 ? US? 09 04 20-2489 London? UK? 09 06-07 29-0390 Xian CN 09 10 26-30
11.2 Promotional activitiesThe following document was approved
9561 Antalya press release
12 Resolutions of this meetingThese were approved
13 A.O.B
14 Closing
16
Annex A – Attendance list
First name Last name Affiliation NBChristian Timmerer Klagenfurt University ATDan Cernea Vrije Universiteit Brussel BEJan De Cock Ghent University BERik Van de Walle Ghent University - IBBT BEKenneth Vermeirsch Ghent University BETouradj Ebrahimi EPFL CHMarco Mattavelli EPFL CHweizhong Chen Huawei technologies CO.,LTD CNDandan Ding Zhejiang University CNSixin Lin Haitao Yang Huawei Tech. Ltd Co. CNTiejun Huang Peking University CNShan Gao Lianhuan Xiong Huawei Technologies Co. Ltd. CNYingjia Liu Huawei CNSiwei Ma Peking University CNHonggang Qi Institute of Computing Technology CNxuemin Wang Huawei technologies CO., LTD CNLianhuan Xiong Huawei CNXiaozhong Xu Tsinghua University CNHaitao Yang Xidian University CNLijing Xu Yingjia Liu HUAWEI Technologies Co., Ltd. CNLu Yu Zhejiang University CNGang Zhu Tsinghua University CNPeter Amon Siemens AG DEGero Bäse Siemens DEJohannes Boehm Thomson DEStefan Doehla Fraunhofer IIS DERalf Geiger Fraunhofer IIS DESebastian Gerke Fraunhofer HHI DEBernhard Grill Fraunhofer IIS DEOliver Hellmuth Fraunhofer IIS DETilman Liebchen LG Electronics DEMarkus Multrus Fraunhofer IIS DEKarsten Müller Fraunhofer HHI DEMatthias Narroschke Panasonic DETobias Oelbaum Technische Universität München DEJens-Rainer Ohm RWTH Aachen DEJoern Ostermann Leibniz Universität Hannover DEThomas Schierl Fraunhofer HHI DEAndreas Schneider Dolby Germany GmbH DEMarkus Schnell Fraunhofer IIS DEFlorian Schreiner Technische Universität München DEHeiko Schwarz Fraunhofer HHI DEAljoscha Smolic Aljoscha Smolic DERalph Sperschneider Fraunhofer IIS DELeonid Terentiev Fraunhofer IIS DE
17
Herbert Thoma Fraunhofer IIS DEThomas Wiegand Fraunhofer HHI DESteffen Wittmann Panasonic DEPablo Carballeira López Universidad Politécnica de Madrid ESJaime Delgado DMAG-UPC ESMarc Gauvin sDae ESLeonardo Lizcano Telefonica R&D ESFrancisco Morán Burgos Universidad Politécnica de Madrid ESYing Chen Tampere University of Technology FIMiska Hannuksela Nokia Corporation FIJani Lainema Nokia FIJustin Ridge Nokia FIKemal Ugur Nokia FIJuha Vartiainen The Finnish Standards Association, SFS FIxianglin wang Nokia Inc. FIBertrand BERTHELOT France Telecom FRYann Bodo Joost Technologies BV FRVincent Bottreau Thomson FRSebastien Brangoulo Joost Technologies BV FRAlice de Casanove Actimagine FRJean-Claude Dufourd Streamezzo FRPatrick GIOIA Orange Labs FRMarc GUEZ VUCHER FRANCE FRJoel Jung Orange - France Telecom R&D FRJean Francois Nezan IETR / INSA FRavaro olivier Streamezzo FRStephane Pateux Orange Labs FRPierrick Philippe France Telecom R&D FRMarius Preda Institut Telecom FRFrançoise PRETEUX Institut TELECOM FRMickael RAULET IETR / INSA FRJerome vieron Thomson R&D France FRDavid Virette France Télécom FRPierfrancesco Bellini University of Florence ITsabina brufani SISVEL ITLeonardo Chiariglione Cedeo.net ITGiovanni Cordara Telecom Italia Lab ITKohtaro Asai Mitsubishi Electric JPYukihiro Bando NTT JPTakeshi Chujoh Toshiba Corporation JPToshiaki Fujii Nagoya University JPNoboru Harada NTT JPTakashi Ito Fujitsu Laboratories Ltd. JPKota Iwamoto NEC Corporation JPHideaki Kimata NTT JPNAOKI KOBAYASHI NTT JPTakuyo Kogure Matsushita Electric Ind. Co. Ltd JPTakehiro Moriya NTT JPTokumichi Murakami Mitsubishi Electric JP
18
Hiroya Nakamura JVC (Victor Company of Japan, Limited) JPTakahiro Nishi Matsushita Electric (Panasonic) JPToshiyuki Nomura NEC JPTakeshi Norimatsu Panasonic JPYukiko Ogura IPSJ/ITSCJ JPShun-ichi Sekiguchi Mitsubishi Electric Corporation JPTakanori Senoh National Institute of Info & Comm Tech JPOsamu Shimada NEC Corporation JPShinya Shimizu NTT JPAkihiko Sugiyama NEC Corporation JPTeruhiko Suzuki Sony Corp. JPMasashi Takahashi Hitachi Ltd JPTK Tan NTT DoCoMo, Inc. JPMasayuki Tanimoto Nagoya University JPAkiyuki Tanizawa Toshiba Corporation JPYasuhiro Toguri Sony Corporation JPYoshihisa Yamada Mitsubishi Electric JPTOMOO YAMAKAGE TOSHIBA Corporation JPTomoyuki Yamamoto Sharp JPTakahiro Yamasaki Oki Electric Industry Co., Ltd. JPTomonobu Yoshino KDDI JPHyouk Jean Cha LG Electronics KRJihun Cha ETRI KRSuhee Cho ETRI KRYoonsik Choe Yonsei University KRbumsuk choi ETRI KRHaechul Choi ETRI KRJin Soo Choi ETRI KRMiran Choi ETRI KRWOONG IL CHOI Samsung KRYungho Choi Konkuk University KRDong-Hoon Han Sejong University KRJong-Ki Han Sejong University KRKi-Hun Han Sejong University KRYO-SUNG HO GIST KRSeoYoung Hwang Samsung Electronics CO., LTD KRByeong-Moon Jeon LG Electronics KRByeungwoo Jeon SKKU KRYong-Joon Jeon LG Electronics KRDong-Seok Jeong Inha University KRjechang jeong hanyang university KRSeyoon Jeong ETRI KRJie Jia Sejong University KRSanghyun Joo ETRI KRYang-Won Jung LG Electronics KRJung Won Kang ETRI KRChan=Young Kim VARO VISION KRDAEYEON KIM Sejong university KRDaiyong Kim Hanyang University KR
19
Dong Soo Kim LG Electronics KRHa Yoon Kim SK Telecom KRHae Kwang KIM Sejong University KRHansang Kim Samsung Electronics KRHui Yong Kim ETRI KRHyungyu Kim Hanyang University KRInkwon Kim VARO VISION CO., LTD KRJae-Gon Kim Korea Aerospace University KRJong Lak Kim DSP Group KRJONGYOUN KIM Net&TV KRJungHoe Kim Samsung AIT KRKyuheon Kim Kyung Hee Univ. KRMiyoung Kim Samsung AIT KRMunchurl Kim Information and Communications University KRSeong-wan Kim Yonsei Univ. KRSikyung Kim Hangyang Univ. KRYong-Goo Kim Yonsei Univ. KRYong Han Kim University of Seoul KRYong Tae Kim Samsung Electronics KRSeung Ryong Kook Kyunghee University KREunkyung Kwak HUMAX KRAlex Lee Humax Co., Ltd KRHyobin Lee Yonsei Univ. KRJangwon Lee Kyung Hee Univ. KRSANG HOON LEE DSPG KRSinwook Lee Hanyang University KRYoonjin Lee Kyunghee UNIV. KRYung-Ki Lee Sejong University KRYung Lyul Lee Sejong University KRJungEun Lim LG Electronics KRTaebeom Lim KETI KRYoung-Kwon Lim net&tv Inc. KRJoo Hee Moon Sejong University KRHenney Oh LG Electronics KRKwan-Jung Oh GIST KRWeon Geun Oh ETRI KRGwang-Hoon Park Kyung Hee University KRHyoungMee Park Sejong University KRJiho Park KETI KRJongtae Park Kyunghee Univ, KRJoonyoung Park LG Electronics KRJUKYUNG PARK Net&TV KRMin Cheol Park Sejong University KRMin Woo Park Kyunghee University KRSeung-Wook Park LG Electronics KRMuhammad Syah Houari Sabirin Information and Communications University KRJeongil Seo ETRI KRSeungYong Shim Sejong Univ. KRDong-Gyu Sim Kwangwoon University KR
20
hyung sik(sean) suh LG ELECTRONICS Inc. KRLim Sung-Chang Sejong University KRHendry Tan Information and Communications University KRGi-Mun Um ETRI KRKwanghyun Won Sungkyunkwan University KRSHIM WOO SUNG Samsung electronics KRJungyoup Yang Sungkyunkwan University KRJEONG-JU YOO ETRI KRYoungJoe Yoo Sejong Univ. KRDaeil Yoon Sejong University KRKyoungro Yoon Konkuk University KRSungyong Yoon LG Electronics KRKug Jin Yun ETRI KRFons Bruls Philips NLWiebe de Haan Philips NLJean H.A. Gelissen Philips Research NLWerner oomen Philips Applied Technologies NLGisle Bjontegaard Tandberg NOArild Fuldseth Tandberg NOMarek Domanski Poznan University of Technology PLKrzysztof Klimaszewski Poznań Univ. of Technology PLMarian Muczko Telekomunikacja Polska S.A. PLLukasz Pikula Telekomunikacja Polska S.A. PLKenneth Andersson Ericsson AB SEPer Fröjdh Ericsson SEKristofer Kjörling Coding Technologies AB SEHeiko Purnhagen Coding Technologies AB SEAnisse Taleb Ericsson AB SELekha Chaisorn Institute for Infocom Research SGTi Eu Chan Institute For Infocomm Research (A*STAR) SGFarzam Farbiz A*STAR Insitute for Infocomm Research SGWei Siong Lee Institute for Infocomm Research SGChong Soon Lim Panasonic Singapore Labs SGCorey Manders A*STAR Institute for Infocomm Research SGWei Yao Institute for Infocomm Research SGYongwei Zhu Institute for Infocomm Research SGJames Annesley Kingston University UKTanya Beech QintiQ UKMiroslaw Bober Mitsubishi Electric ITE-VIL UKPaul Brasnett Mitsubishi Electric ITE-VIL UKLeszek Cieplinski Mitsubishi Electric UKCatherine Grant Nine Tiles UKMike Nilsson BT UKMadhukar Budagavi Texas Instruments Inc. USYi-Jen Chiu Intel Corp. USOscar Divorra Escoda Thomson USAlex Eleftheriadis Vidyo, Inc. USCristina Gomila Thomson USOnur Guleryuz DoCoMo USA Labs US
21
Michael Horowitz Vidyo, Inc. USShih-Ta Hsiang Motorola, Inc. USYi Hu Conexant Systems USWalt Husak Dolby / SMPTE USFaisal Ishtiaq Motorola USMarta Karczewicz Qualcomm USGwo Giun (Chris) Lee National Cheng Kung University USVladimir Levantovsky Monotype Imaging Inc. USHe-Yuan Lin National Cheng Kung University USJulie Lofton Hot Potato, Inc. USAjay Luthra Motorola USKyle McAdoo Conexant Systems USSam Narasimhan Motorola USPurvin Pandit Thomson USSchuyler Quackenbush Audio Research Labs USArturo Rodriguez Scientific Atlanta, a Cisco Company USJesus Sampedro Polycom USAndrew Segall Sharp Labs of America USSuman Sharma Intel Corporation USDavid Singer Apple Inc., USA USGary Sullivan Microsoft Corp. USHuifang Sun Mitsubishi Electric Research Labs USPankaj Topiwala FastVDO USAnthony Vetro Mitsubishi Electric USXin Wang ContentGuard, Inc. USYong Yu Broadcom Corp US
22
Annex B – Agenda
Item
1
Opening
2
Roll call of participants
3
Approval of agenda
4
Allocation of contributions
5
Communications from Convenor
6
Report of previous meeting
7
Processing of NB Position Papers
8
Work plan management
1 1 Media coding
2 MPEG-4 Visual Simple Profile Level 6
3 AAC-ELD
4 New Profiles for Professional Applications
5 Scalable Video Coding
6 Multiview Video Coding
7 Geometry and Shadow
8 Binary Format for XML (Prefixes and Wild Card extensions)
9 Bitstream Syntax Description Language
10 Fixed point implementation of DCT/IDCT
11 Video Tool Library
12 Spatial Audio Object Coding
13 Free Viewpoint TV coding
14 Audio and speech coding
23
15 Ontology
16Video coding exploration
2
Composition coding
1 Lightweight Scene Representation
3
Description coding
1 Schema definition
2 Visual Descriptions Extensions
3 Visual Signature Tools
4 Technologies for digital photo management using MPEG-7 visual tools
5 Improvements to Geographic Descriptor
6 MPEG-7 Query Format
4
Systems support
1 Fragment Request Unit
5
IPMP
1 REL MAM (Mobile And optical Media) Profile
2 REL DAC (Dissemination And Capture) Profile
3 REL ORC (Open Release Content) Profile
4 IPMP XML Messages
6
Digital Item
1 Schema files for MPEG-21 standards
2 Security in Event Reporting
3 Review of DI
7
Transport and File formats
1 Carriage of SVC in MPEG-2 Systems
2 Transport of MPEG Surround data in AAC
24
3 MP4FF box for Original Audio File Information
4 File Format extensions for Description of Timed Metadata
5 Flute Hint Track
6 AVC File Format extensions for FRExt
7 AVC File Format extensions for SVC
8 AVC File Format extensions for MVC
9 Digital Item File Format
10 Digital Item Streaming
8
Multimedia architecture
1 Codec Configuration Representation
2 3D Graphics Compression Models
3 Media Streaming MAF Protocols
4 IPTV
5 Extensible Multimedia Platform
6 Metaverse
9
Application formats
1 Musical Slide Show Application Format
2 Media Streaming Application Format
3 Professional Archival MAF
4 Open Release Application Format
5 Portable Video Player MAF
6 Digital Multimedia Broadcasting Application Format
7 Video Surveillance MAF
8 Stereoscopic MAF
9 Cross media interactive presentation
25
10
Reference implementation
1 Symbolic Music Representation Reference Software
2 MPEG-1 and -2 on MPEG-4 Reference Software
3 BSAC Extensions Reference Software
4 Reference Hardware Description
5 New Profiles for Professional Applications Reference Software
6 SVC Reference Software
7 File Format Reference Software
8 Geometry and Shadow Reference Software
9 Frame-based Animated Mesh Compression Reference Software
10 MPEG-J GFX Reference Software
11 LASeR Reference Software
12 Open Font Format Reference Software
13 MPEG-7 Systems Reference Software
14 MPEG-21 REL Reference Software
15 Photo Player MAF Reference Software
16 Musical Slide Show MAF Reference Software
17 Binary MPEG format for XML Reference Software
18 Prefixes and wild card extensions Reference Software
19 MPEG Surround Reference Software
20 M3W Reference Software
11
Conformance
1 Audio BIFS v3 Conformance
2 Symbolic Music Representation Conformance
3 MPEG-4 Visual Simple Profile Level 6 Conformance
26
4 New Profiles for Professional Applications Conformance
5 SVC Profiles Conformance
6 MPEG-1 and -2 Audio in MPEG-4 Conformance
7 BSAC Conformance
8 1-bit Oversampled Audio Conformance
9 Audio Scalable to Lossless Conformance
10 File Format Conformance
11 Geometry & Shadow Conformance
12 Frame-based Animated Mesh Compression Conformance
13 MultiResolution Profile Conformance
14 Synthesized Texture Conformance
15 MPEG-J GFX Conformance
16 Laser Conformance
17 Open Font Format Conformance
18 Perceptual 3D Shape Conformance
19 Improvements to Geographic Descriptor Conformance
20 Binary MPEG format for XML Conformance
21 MPEG Surround Conformance
22 M3W Conformance
23 Video Tool Library Conformance
12
Maintenance
1 Systems coding standards
2 Video coding standards
3 Audio coding standards
4 Visual description coding standards
27
5 Audio description coding standards
6 MDS standards
9
Organisation of this meeting
1
Tasks for subgroups
2
Joint meetings
10
WG management
1
Terms of reference
2
Officers
3
Editors
4
Liaisons
5
Work item assignment
6
Ad hoc groups
7
Asset management
1 Reference software
2 Conformance
3 Test material
4 URI
8
IPR management
9
Work plan
11
Administrative matters
1
Responses to National Bodies
2
Schedule of future MPEG meetings
3
Promotional activities
12
Resolutions of this meeting
13
A.O.B
28
14
Closing
29
Annex C– Input contributions
Number Source Title
m15029 Webmaster Antalya document register
m15030 Noboru HaradaHendry
Ad Hoc Group on Professional Archival Application Format
m15031 Jaime DelgadoXin Wang
Ad Hoc Group on Requirements of Media Value Chains Ontologies
m15032 Yi-Shin TungTeruhiko Suzuki
Ad Hoc Group on Maintenance of MPEG-4 Visual related Documents, Reference Software and Conformance
m15033Euee S. JangMarco MattavelliYoshihisa Yamada
Ad Hoc Group on Reconfigurable Video Coding
m15034Miroslaw BoberRyoma OamiRobert O'Callaghan
Ad Hoc Group on MPEG-7 Visual
m15035 Miroslaw BoberThomas Wiegand Ad Hoc Group on Video Augmentation by Metadata
m15036 Hideaki KimataKarsten Müller Ad Hoc Group on Free-Viewpoint Television
m15037 Young-Kwon LimJean Lefeuvre Ad Hoc Group on Scene Representation
m15038 David SingerVisharam Mohammed Ad Hoc Group on MPEG File Formats
m15039Kyuheon KimHui Yong KimJean Cha
Ad Hoc Group on Application Format
m15040Young-Kwon LimJihun ChaJean-Claude Dufourd
Ad Hoc Group on Digital Item Presentation
m15041 Tobias OelbaumMathias Wien Ad Hoc Group on SVC Verification Test
m15042 Jeong-Hwan AhnNikolce Stefanoski
Ad Hoc Group on 3DG documents, experiments and software maintenance
m15043 R. Sperschneider Ad Hoc Group on Audio Standards Maintenance
m15044 S. QuackenbushEunmi Oh
Ad Hoc Group on Unified Speech and Audio Coding and SAOC
30
m15045 Julie Lofton Ad Hoc Group on Requirements for MPEG Post Production Deliverable Formats
m15046 Kyoungro YoonMario Doeller Ad Hoc Group on MPEG Query Format
m15047
Taka [email protected]@[email protected]@nict.go.jp
Consideration of Depth Format
m15048 SC 29 Secretariat Liaison Statement from SMPTE [SC 29 N 8899]
m15049 SC 29 Secretariat Liaison Statement from DVB [SC 29 N 8901]
m15050 SC 29 Secretariat Table of Replies on ISO/IEC 23001-1:2006/FDAM 1 [SC 29 N 8902]
m15051 SC 29 Secretariat Table of Replies on ISO/IEC 14496-4:2004/FDAM 14 [SC 29 N 8909]
m15052 SC 29 Secretariat Table of Replies on ISO/IEC 14496-4:2004/FDAM 18 [SC 29 N 8910]
m15053 SC 29 Secretariat Table of Replies on ISO/IEC 14496-4:2004/FDAM 19 [SC 29 N 8911]
m15054 SC 29 Secretariat Table of Replies on ISO/IEC 14496-1:2004/FDAM 3 [SC 29 N 8912]
m15055 SC 29 Secretariat Table of Replies on ISO/IEC 21000-4:2006/FDAM 1 [SC 29 N 8913]
m15056 SC 29 Secretariat Liaison Statement from ITU-T SG 9 [SC 29 N 8919]
m15057 SC 29 Secretariat Summary of Voting on ISO/IEC 14496-16:2006/PDAM 2 [SC 29 N 8920]
m15058 SC 29 Secretariat Summary of Voting on ISO/IEC 14496-4:2004/FPDAM 29 [SC 29 N 8925]
m15059 SC 29 Secretariat Summary of Voting on ISO/IEC 14496-5:2001/FPDAM 16 [SC 29 N 8926]
m15060 SC 29 Secretariat Summary of Voting on ISO/IEC 14496-5:2001/FPDAM 17 [SC 29 N 8927]
m15061 SC 29 Secretariat Summary of Voting on ISO/IEC 14496-5:2001/Amd.1:2002/DCOR 1 [SC 29 N 8938]
m15062 SC 29 Secretariat IEC CDV 62360 [SC 29 N 8941]
m15063 SC 29 Secretariat Liaison Statement from SC 29/WG 1 [SC 29 N 8956]
m15064 SC 29 Secretariat Liaison Statement from SC 29/WG 1 [SC 29 N 8957]
31
m15065 SC 29 Secretariat Table of Replies on ISO/IEC 14496-5:2001/FDAM 11 [SC 29 N 8960]
m15066 SC 29 Secretariat Summary of Voting on ISO/IEC 14496-15:2004/FPDAM 2 [SC 29 N 8961]
m15067 SC 29 Secretariat Table of Replies on ISO/IEC FDIS 21000-14 [SC 29 N 8980]
m15068 SC 29 Secretariat ISO/IEC FCD 19776-1 2nd Edition [SC 29 N 8985]
m15069 SC 29 Secretariat ISO/IEC FCD 19776-3 2nd Edition [SC 29 N 8986]
m15070 SC 29 Secretariat Table of Replies on ISO/IEC FDIS 23001-2 [SC 29 N 9001]
m15071 SC 29 Secretariat ISO/IEC CD 19775-2 2nd Edition [SC 29 N 9002]
m15072 SC 29 Secretariat Liaison Statement from ITU-T SG 9 [SC 29 N 9004]
m15073 SC 29 Secretariat Table of Replies on ISO/IEC 14496-12:2005/FDAM 2 [SC 29 N 9006]
m15074 SC 29 Secretariat Table of Replies on ISO/IEC FDIS 14496-23 [SC 29 N 9007]
m15075 SC 29 Secretariat Table of Replies on ISO/IEC FDIS 23000-2 (2nd Edition) [SC 29 N 9033]
m15076 SC 29 Secretariat Liaison Statement from ITU-T IPTV Focus Group (FG IPTV) [SC 29 N 9034]
m15077 SC 29 Secretariat Liaison Statement from JTC 1/SC 34/WG 2 [SC 29 N 9035]
m15078
Pierfrancesco BelliniPaolo NesiGiorgio ZoiaMaurizio Capanai
Editors Study on ISO/IEC 14496-4:2004/FPDAM 29, SMR Conformance
m15079
Pierfrancesco BelliniPaolo NesiGiorgio ZoiaMaurizio Campanai
Editor Study on ISO/IEC 14496-5:2001/FPDAM 16 Symbolic Music Representation reference software
m15080Gwo Giun LeeHe-Yuan LinMing-Jiun Wang
Functional units of AVC inter-prediction for adaptive interlace coding
m15081 Andy Tescher for the USNB USNB Contribution: Proposed amendment to ISO/IEC 14496-22
m15082Simon DanielsMichelle HillVladimir Levantovsky
The proposal for amendment of ISO/IEC 14496-22 (in support of USNB comment m15081)
m15083 Sanghyun Joo Requirements on RoSE Framework
32
Bumsuk ChoiMunchurl Kim
m15084Benoit Le BonhommeMarius PredaFrançoise Preteux
Online platform for 3D graphics compression benchmarking
m15085Blagica JovanovaMarius PredaFrancoise Preteux
Software Implementation for P25
m15086Blagica JovanovaMarius PredaFrancoise Preteux
Conformance dataset for P25
m15087Ivica ArsovMarius PredaFrancoise Preteux
MPEG-4 3D graphics player for N93 and N95
m15088Masayuki TanimotoToshiaki FujiiKazuyoshi Suzuki
Available Technologies for FTV
m15089
Masayuki TanimotoToshiaki FujiiKazuyoshi SuzukiNorishige Fukushima
Contribution of Nagoya University on FTV Test Material
m15090Masayuki TanimotoToshiaki FujiiKazuyoshi Suzuki
Improvement of Depth Map Estimation and View Synthesis
m15091 Jean GelissenMark Verberkt Requirements on Framework for RoSE
m15092
[email protected]. Marc [email protected]. Jaime [email protected]. Victor Rodriguez
A Common Core IP Model
m15093 Per FröjdhDavid Singer Proposed re-structured ISO Base Media File Format
m15094 S. Quackenbush 82nd MPEG Audio Report
m15095 S. Quackenbush Collected Set of Possible Evaluation Guidelines
m15096 S. Quackenbush Draft Workplan for Testing of SA Proposals
m15097
Teruhiko SuzukiNick SaundersJohn StonePaul Gardiner
Proposal for MPEG-4 visual studio profile level 5 and 6
33
m15098
Gang ZhuXiaozhong XuPing YangYun He
Inter-View Skip Mode for FTV using Depth Information
m15099 Teruhiko Suzuki Proposal for MPEG-4 visual studio profile conformance testing
m15100Teruhiko SuzukiAjay LuthraYi-Jen Chiu
Proposal of new level to support 1080@50p/60p for MPEG-2 video
m15101
Aljoscha SmolicHeribert BrustKarsten Mueller Marcus MuellerThomas Wiegand
Corrected Camera Parameters for N9468 ?Call for Contributions on FTV Test Material?
m15102
Ingo FeldmannMarcus MuellerFrederik ZillyRalf TangerKarsten MuellerAljoscha SmolicPeter KauffThomas Wiegand
Progress Report on 3DTV Video Acquisition
m15103
Sangki KimHyobin LeeSeongwan KimSangyoun LeeMyungil Gil JangHyun Ki KimJeong Heo jeong
CE Report for VCE-5
m15104 Kota IwamotoRyoma Oami Text/Logo Mask Image Generation Software for VCE-7
m15105 Julie LoftonJeff Steele MPEG-M under the MPEG 21 Reflector
m15106
Weon Genu OhDaeil YoonJie JiaHae Kwang Kim
Contribution of video test material for MPEG-7 video signature CE
m15107Kenji OtoiYoshihisa YamadaKohtaro Asai
Proposed text of the RVC FUs for MPEG-2
m15108 [email protected] Subjective results for the SVC Verification Test
m15109 Ruben TousJaime Delgado
Proposal of Reference Software for MPQF. Validation of embedded XQuery expressions.
34
m15110
Osamu ShimadaToshiyuki NomuraAkihiko SugiyamaOsamu Hoshuyama
A core experiment proposal for an additional SAOC functionality of separating real-environment signals into multiple objects
m15111 Yang-Won JungHenney Oh A proposed CE on object parameter estimation in SAOC
m15112 Henney OhYang-Won Jung Comments on SAOC applications and architectures
m15113
Sinwook LeeJaebum JunByeongjun KimChungku YieEuee S. Jang
The results of RVC CE 1.2
m15114
Ju-Kyong JinWeon-Geun OhDong-Jin SeoSang-il NaJae-Hyun HuhDong-Seok Jeong
Proposal on Frame-Reduction video clip format
m15115 James Annesley Late UKNB comments on the Study of CD for the Video Surveillance Application Format R.1
m15116 Sang-Beom LeeKwan-Jung Oh -based Multi-view Depth Map Estimation for FTV
m15117
Byeongjun KimJaebum JunHyungyu KimChungku YieEuee S. Jang
Study of Application Requirements Related to RVC
m15118Miyoung KimEunmi OhJungHoe Kim
Comments on Unified Speech and Audio CfP Evaluation Guidelines
m15119Yo-Sung HoSang-Beom LeeKwan-Jung Oh
Segment-based Multi-view Depth Map Estimation for FTV
m15120Yo-Sung HoSang-Tae NaKwan-Jung Oh
Virtual View Synthesis for FTV
m15121 Tilman Liebchen Update of ALS Conformance
m15122 James Annesley Errors and Corrections for MPEG-7: Part 3 - Visual Reference Software
m15123 Oliver HellmuthJohannes Hilpert
Information and Verification Results for CE on Karaoke/solo System Improving Performance of MPEG
35
Andreas HölzerLeonid TerentievCornelia Falch
SAOC RM0
m15124 Houari SabirinMunchurl Kim
Use cases for content protection in Musical slide show Application Format 2nd Edition
m15125
Hyungyu KimSikyung KimMyungjoong LeeChungku YieEuee S. Jang
The results of RVC CE 1.1
m15126 HendryMunchurl Kim Proposed Editorial Update for ISO/IEC 23000-6 WD 1.0
m15127Hyouk-Jean ChaTae Hyeon KimJisoo Hong
Editor's study text of ISO/IEC 23000-4/PDAM1 Musical slide show application format
m15128 HendryMunchurl Kim
Proposal for Pre-Processing Tool Location Reference in Professional Archival Application Format
m15129HendryHouari SabirinMunchurl Kim
Set of MPEG-7 Tools for Professional Archival Applications Format
m15130
Weon-Geun OhAyoung ChoWon-Keun YangIk-Hwan ChoJu-Kyong JinJun-Woo LeeDong-Seok Jeong
Experiment Results of Image Signature for Complex Conditions
m15131
Weon-Geun Oh.Won-Keun Yang.Ayoung Cho.Dong-Seok Jeong
The Extra Experiment Result to Verify the Method of Performance Measure on MPEG-7 VCE-6
m15132 Mathias Wien Verification of new SVC Verification Test Streams
m15133 Jani PeltotaloMiska M. Hannuksela Proposed corrections to ALC/FLUTE server file format
m15134 Jani PeltotaloMiska M. Hannuksela Proposed additions to ALC/FLUTE server file format
m15135 Christian Timmerer MPEG-21 schema assets update
m15136 Florian Schreiner Study Text of ISO/IEC FCD 23000-7 Open access application format
m15137 Min-Jeong LeeHeung-Kyu Lee Cross verification result for ETRI VCE-6 proposal
36
m15138Ingo KoflerChristian TimmererHermann Hellwagner
Multiple MPEG-21 DIA AdaptationQoS Descriptions within a Digital Item
m15139
[email protected] Zheng [email protected] Tiejun Huangyhtian@ @pku.edu.cn Yonghong Tian
Video Signature based on Inter-frame Correlation Coefficients
[email protected]@[email protected]
Visual Signature based on Waston Perceptual Model
m15141Hyouk-Jean ChaTae Hyeon KimHerbert Thoma
Editor's study text of ISO/IEC 23000-8/FCD Portable video application format
m15142
Jihun ChaInjae LeeYoung-Kwon LimKyungAe MoonJinwoo Hong
Considerations on Integrating LASeR and DID Technologies for WIM TV
m15143
Jeongil SeoSeungkwon BeackKwang-ki KimKyoungok Kang
CE on efficient decoding of a controllable object and an MBO
m15144
Jeongil SeoSeungkwon BeackKwang-ki KimKyeoungok Kang
Consideration on enhanced Karaoke processing for stereo FGO
m15145 Oliver WuebboltJohannes Boehm Thoughts on Speech and Audio Evaluation Guidelines
m15146 Stefan DöhlaMiska M. Hannuksela MPEG2-TS and RTP reception hint tracks
m15147 Stefan DöhlaMiska M. Hannuksela
Extended sample grouping mechanism for the ISO Base Media File Format
m15148 Jani PeltotaloMiska M. Hannuksela
Proposed conformance files for ALC/FLUTE server file format
m15149Khaled MamouTitus ZahariaFrançoise Prêteux
FAMC decoder conformance
m15150
Khaled MamouTitus ZahariaMarius PredaFrançoise Prêteux
FAMC integration into the MPEG-4 RefSoft
37
m15151 Markus SchnellRalf Geiger Update on AAC-ELD Verification Test
m15152 Gero Bäse Study Text of ISO/IEC 23000-10/CD Video Surveillance Application Format
m15153
Khaled MamouTitus ZahariaMarius PredaFrançoise Prêteux
Low-complexity approach for static mesh compression
m15154 Andreas Schneider Update on MPEG Surround Conformance
m15155 Werner OomenErik Schuijers
Evaluation criteria and test items for unified speech and audio coding
m15156
Dandan DingMarco MattavelliChristophe LucarzLu Yu
Update of Classification of Tokens for FUs of MPEG-4 SP and MPEG-4/AVC in RVC Framework
m15157 James AnnesleyJames Orwell
Video Surveillance Application Format: Reference Software
m15158 Kristofer KjörlingHeiko Purnhagen
Homework according to the joint speech and audio workplan
m15159
Christophe LucarzDandan DingJianjun LiMarco Mattavelli
BSDL Description of MPEG-4 SP and AVC BP Bitstream Syntax for RVC Framework
m15160 Kristofer KjörlingHeiko Purnhagen
Thoughts on evaluation criteria for joint speech and audio workitem
m15161 Andreas SchneiderHeiko Purnhagen
Proposed correction to PS conformance and reference software
m15162 Jonas Engdegard Cross Verification of SAOC CE on Karaoke enhancement
m15163
Christophe LucarzJianjun LiMarco MattavelliDandan Ding
Auto-generation of RVC Parser from BSDL Syntax Description: Variable Length Decoding
m15164
Christophe LucarzJianjun LiMarco MattavelliDandan Ding
Functional Units for RVC Toolbox: Variable Length Decoding
m15165Ralf GeigerMarkus MultrusBernhard Grill
Comments on Speech and Audio Evaluation Guidelines
m15166 Dandan Ding Function Units for Conversion from Syntax to Sequence
38
Christophe LucarzMarco MattavelliLu Yu
of Tokens: BTYPE
m15167
M. RauletG. RoquierM. WipliezJF. NezanO. Deforges
Update of CAL2C code generation
m15168 Florian Schreiner Open Access Application Format: Reference Software
m15169 Paul BrasnettMiroslaw Bober Correction to Image Signature XM Software
m15170 Paul BrasnettMiroslaw Bober
Performance Evaluation of Image Signature on Extended Database
m15171 Florian Schreiner GENB comments on the Study of the MPEG-21 REL Open Access Profile FPDAM
m15172 Paul BrasnettMiroslaw Bober
Extending the Trace Transform Image Signature to Complex Conditions
m15173Dave SingerYe-Kui WangThomas Rathgen
Editors' Input to ISO/IEC 14496-15/FPDAM 2 (SVC File Format)
m15174 Olgierd Stankiewicz.Krzysztof Wegner. Depth Map Estimation Software
m15175 Olgierd Stankiewicz.Krzysztof Wegner. Depth Map Estimation Software
m15176Masanori SanoHideki SumiyoshiNobuyuki Yagi
Paging function in MPEG Query Format
m15177Masanori SanoHideki SumiyoshiNobuyuki Yagi
Interpretation Consistency for SpatialQuery and TemporalQuery
m15178 David Singer Codec-independent color information in part 12 files
m15179 DW Singer Backwards-compatibility for alternate groups
m15180 Manuela SchinnRalph Sperschneider WD on Audio part of MPEG-4 Conformance
m15181 Karol Wnukowicz Cross verification result of Image Signature (VCE-6)
m15182Noboru HaradaTakehiro MoriyaYutaka Kamamoto
Updated requirements on Professional Archival Application Format
m15183 Noboru Harada Proposed update of MPEG-4 ALS reference software for
39
Takehiro MoriyaYutaka Kamamoto OAFI
m15184 Hyouk-Jean Cha Proposed workplan for Portable video application format conformance
m15185
Shun-ichi SekiguchiKenji OtoiYoshihisa YamadaKohtaro AsaiTokumichi Murakami
4:4:4 video coding performance with adaptive motion vector coding
m15186 KNB KNB Comments on RVC
m15187Hui Yong KimHouari SabirinMunchurl Kim
Proposed text of ISO/IEC 23000-9/PDAM1 DMB AF: Conformance and Reference software
m15188
Hui Yong KimMyungSeok KiGun BangYong Han Kim
Proposed text of ISO/IEC 23000-9/DCOR1 DMB AF: timescale of TS
m15189
Hui Yong KimGun BangMyungSeok KiHan-Kyu LeeYong Han Kim
Proposed WD on 14496-12 ISO-FF Amendment: MPEG-2 TS storage
m15190 SC 29 Secretariat Table of Replies on ISO/IEC FDIS 23001-5
m15191
Gi-Mun UmTaeone KimNamho HurJinwoong Kim
Segment-based Disparity Estimation using Foreground Separation
m15192 SC 29 Secretariat Table of Replies on ISO/IEC 14496-20:2006/FDAM 1
m15193 SC 29 Secretariat Summary of Voting on ISO/IEC 14496-4:2004/PDAM 31
m15194 SC 29 Secretariat Summary of Voting on ISO/IEC 14496-5:2001/PDAM 19
m15195Xiaozhong XuXilin ChenTiejun Huang
Transport of GB 20090.2 video data over ITU-T Rec. H.222.0 | ISO/IEC 13818-1
m15196Hideaki KimataHiroya NakamuraTakashi Itoh
Proposal on Profiles for MVC (Multi-view Video Coding)
m15197 [email protected]@jdl.ac.cn. Proposal for Video Signature
m15198 Jeong-Hwan Ahn. KNB Comment on 14496-16:2006/AMD1.Corr1 (3D
40
Daiyong Kim.Euee S. Jang
Mesh Coding Extension Correction)
m15199
Hyungyu KimGiseok SonByeongjun KimSinwook LeeChungku YieEuee S. Jang
Proposed text of CCR CD: A section for DD transmission
m15200
Dandan DingLu YuHonggang QiTiejun HuangWen Gao
BSDL Description of AVS Bitstream Syntax for RVC Framework
m15201 Nikolce StefanoskiJörn Ostermann
GNB comments on ISO/IEC 14496-16:2006/PDAM 2 (FAMC)
m15202
Honggang QiTiejun HuangWen GaoDandan DingLu Yu
Text Description for Bitstream Parser FU of AVS
m15203 Kyuheon Kim Updated WD 23000-11 for Stereoscopic Video Application Format
m15204 National Body of KOREA KNB Response to Clause 3.2.2 of 82nd MPEG Shenzhen Meeting Resolution
m15205 Filippo Chiariglione Proposed Working Draft of ISO/IEC 23000-5 2nd Edition
m15206
L. ChiariglionePhilip MerrillLuntian MouOlivier AvaroXin Wang
WIM TV Trial at Beijing Olympics
m15207 L. ChiariglioneOlivier Avaro Requirements for Digital Item Presentation
m15208 L. Chiariglione Requirements for MPEG eXtensible Middleware (MXM)
m15209 Wendy Aylsworth Response to sc29n8883 Liaison from JVT on potential extension of SVC
m15210 China National Body (CNNB) China NB Comments on Transport of GB 20090.2 video data over ITU-T Rec. H.222.0 | ISO/IEC 13818-1
m15211 DVB via SC 29 Secretariat Liaison Statement from DVB [SC 29 N 9045]
41
m15212 KNB KNB Comments on ISO/IEC 23000-4 2nd Edition FCD
m15213Zheng HuangTiejun HuangYonghong Tian
Video Signature based on Mutual Infomation
m15214 ITU-T SG 16 via SC 29 Secretariat Liaison Statement from ITU-T SG 16
m15215 the DVD Forum via SC 29 Secretariat Liaison Statement from the DVD Forum
m15216 Julie LoftonJeff Steele
Working draft for proposed MPEG-M Production Deliverables standard
m15217 Paul BrasnettMiroslaw Bober
Updated Results on Extended Trace Transform Image Signature
m15218 Jean H.A. Gelissen Information exchange with Virtual Worlds (Metaverse1) Presentation
m15219 Sikyung KimEuee S. Jang Table of 3D models in the MPEG 3DGC repository
m15220 Thomas Schierl for the GNB GNB comment on ISO/IEC 13818-1:2007/FDAM3
m15221 Jean-Claude DufourdOlivier Avaro Joint LASeR/BIFS scene representation
mxxxx webmaster Input contribution template
r1000 OhmSullivan Video Subgroup report for Friday Plenary
r1001 SullivanOhm JVT Report - Friday Plenary
r1002 SullivanOhm JVT Report - Wednesday Plenary
r1003 RVC RVC Report - Friday Plenary
r1004 Tobias Oelbaum SVC Verification Report - Friday Plenary
r1005 OhmSullivan Video Subgroup Report - Wednesday Plenary
r1006 Miroslaw Bober MPEG-7 Visual Report - Friday Plenary
w9555 Convener List of Documents from the 83rd Meeting in Antalya, Turkey
w9556 Convener Resolutions of the 83rd Meeting in Antalya, Turkey
w9557 Convener List of AHGs Established at the 83rd Meeting in Antalya, Turkey
w9558 Convener Report of the 83rd Meeting in Antalya, Turkey
42
w9559 Convener Guidelines for Electronic Distribution of MPEG M and N Documents
w9560 Convener Press Release of the 83rd Meeting in Antalya, Turkey
w9561 Convener Meeting Notice of the 84th Meeting in Archamps, Switzerland
w9562 HoD Guide for WG 11 Meeting Hosts
w9563 video Request for 13818-2:2000/Amd.3
w9564 Video Text of ISO/IEC 13818-2:2000/PDAM 3 Level for 1080p/60 Support
w9565 Video Study Text of ISO/IEC 14496-2:2004/PDAM5 Simple Studio Profile Levels 5 and 6
w9566 Video Study Text of ISO/IEC 14496-2:2004/DCOR3
w9567 Video Study Text of ISO/IEC 14496-4:2004/PDAM35 Simple Studio Profile Levels 5 and 6 Conformance Testing
w9568 Video Disposition of Comments on ISO/IEC 14496-4:2004/PDAM 31
w9569 Video Text of ISO/IEC 14496-4:2004/FPDAM 31 Conformance Testing for Scalable Video Coding
w9570 Video Disposition of Comments on ISO/IEC 14496-5:2001/Amd.1:2002/DCOR 1
w9571 Video Text of ISO/IEC 14496-5:2001/Amd.1:2002/COR 1
w9572 Video Disposition of Comments on ISO/IEC 14496-5:2001/PDAM 19
w9573 Video Text of ISO/IEC 14496-5:2001/FPDAM 19 Reference Software for Scalable Video Coding
w9574 Video Text of ISO/IEC 14496-10:200X/DCOR 1
w9575 Video Disposition of Comments on ISO/IEC 14496-10:200X/PDAM 1
w9576 Video Text of ISO/IEC 14496-10:200X/FPDAM 1 Multiview Video Coding
w9577 Video Report on SVC Verification Tests
w9578 Video Joint Multiview Video Model (JMVM) 7
w9579 Video JMVM 7 Software
w9580 Video Overview of Multiview Video Coding (MVC)
w9581 Video Text of ISO/IEC 15938-3:2001/PDAM 3 Image Signature Tools
43
w9582 Video Description of Core Experiments for MPEG-7 New Visual Extensions
w9583 Video Request for 13818-4:2004/Amd.3
w9584 Video Study Text of ISO/IEC CD 23001-4 Codec Configuration Representation
w9585 Video Reconfigurable Video Coding Requirements V 4.0
w9586 Video Overview of Reconfigurable Video Coding (RVC)
w9587 Video Study Text of CD ISO/IEC 23002-4 Video Tool Library
w9588 Video Extensions of Video Tool Library under consideration
w9589 Video Description of Core Experiments in RVC
w9590 Video RVC Simulation Model (RSM) V7.0
w9591 Video RVC Work Plan and FU Development Status
w9592 Video RVC Conformance Testing Working Draft V4.0
w9593 Video Description of Exploration Experiments in RVC
w9594 Video Methodologies for Video Toolbox Extension V2.0
w9595 Video Call for Contributions on 3D Video Test Material (Update)
w9596 Video Description of Exploration Experiments in 3D Video
w9597 Convener AHG on Maintenance of MPEG-4 Visual related Documents, Reference Software and Conformance
w9598 Convener AHG on Reconfigurable Video Coding
w9599 Convener AHG on MPEG-7 Visual
w9600 Convener Terms of reference
w9601 Convener MPEG Standards
w9602 Convener Table of unpublished FDISs
w9603 Convener Work plan and time line
w9604 Convener Editors of MPEG standards
w9605 Convener Schema assets
w9606 Convener Software assets
w9607 Convener Conformance assets
w9608 Convener Content assets
w9609 Convener URI assets
44
w9610 Convener Standards under development for which a call for patent statements is issued
w9611 Convener List of Organisations with which MPEG entertains liaisons
w9612 DELETED DELETED
w9613 Convener AHG on FTV
w9614 Convener Liaison Statement to SMPTE re RVC
w9615 Convener Liaison Statement to ITU-T SG 9 re FTV
w9616 Convener Liaison Statement to ITU-T SG 9 re Bitstream Splicing
w9617 Convener Liaison Statement template for various organizations re SVC verification testing report
w9618 Video Text of ISO/IEC 13818-4:2004/PDAM 3 Level for 1080@50/60p Conformance Testing
w9619 Audio Workplan for AAC-ELD Verification Test
w9620 Audio DoC on ISO/IEC 14496-4:2004/FPDAM 20, SLS Conformance
w9621 Audio ISO/IEC 14496-4:2004/FDAM 20, SLS Conformance
w9622 Audio ISO/IEC 14496-4:2004/AMD 11/DCOR 3, Parametric Stereo
w9623 Audio ISO/IEC 14496-4:2004/AMD 19/DCOR 1, ALS
w9624 Audio ISO/IEC 14496-4:2004/AMD XX, WD on AAC-ELD, OAFI and additional AAC Conformance
w9625 Audio DoC on ISO/IEC 14496-4:2004/FPDAM 29, SMR Conformance
w9626 Audio ISO/IEC 14496-4:2004/FDAM 29, SMR Conformance
w9627 Audio MPEG-4 Audio Conformance Rollup
w9628 Audio ISO/IEC 14496-5:2001/AMD 10/DCOR 2, ALS
w9629 Audio ISO/IEC 14496-5:2001/AMD XX, WD on AAC-ELD Reference Sw.
w9630 Audio Study on ISO/IEC 14496-5:2001/FPDAM 20, MPEG-1 and -2 Audio in MPEG-4 and BSAC Extensions
w9631 Audio DoC on ISO/IEC 23003-1:2006/FPDAM 1, MPEG Surround Conformance
w9632 Audio ISO/IEC 23003-1:2006/FDAM 1, MPEG Surround Conformance
45
w9633 Audio Workplan on further issues for MPEG Surround Conformance
w9634 Audio DoC on ISO/IEC 23003-1:2006/FPDAM 2, MPEG Surround Reference Sw.
w9635 Audio ISO/IEC 23003-1:2006/FDAM 2, MPEG Surround Reference Sw.
w9636 Audio Status and Workplan on SAOC Core Experiments
w9637 Audio WD on SAOC Text and Reference Software
w9638 Audio Evaluation Guidelines for Unified Speech and Audio Proposals
w9639 Audio Workplan on Speech and Audio Material Selection
w9640 Audio Draft Workplan Evaluation Subjective Tests
w9641 Convener Liaison Statement to ETSI TC DECT
w9642 3DGC Study on PDAM of ISO/IEC 14496-4:2004 AMD32 (FAMC Conformance)
w9643 3DGC Study on PDAM of ISO/IEC 14496-4:2004 AMD33 (MultiResolution Profile Conformance)
w9644 3DGC ISO/IEC 14496-4:2004 PDAM 34 (3DGCM Conformance)
w9645 3DGC ISO/IEC 14496-5 PDAM 22 (3DGCM RefSoft)
w9646 3DGC Study of ISO/IEC 14496-16:2006/AMD1/DCOR1
w9647 3DGC DoC on ISO/IEC 14496-16:2006/PDAM2 (Frame-based Animated Mesh Compression)
w9648 3DGC Text of ISO/IEC 14496-16:2006/FPDAM2 (Frame-based Animated Mesh Compression)
w9649 3DGC WD2.0 of AFX 3rd Edition
w9650 3DGC Requirements for low-complexity 3D mesh compression
w9651 3DGC CfP for low-complexity 3D mesh compression
w9652 3DGC Study of CD of ISO/IEC 14496-25
w9653 Convener AHG on Audio Standards Maintenance
w9654 Convener AHG on Unified Speech and Audio Coding and SAOC and AAC-ELD
w9655 Convener AHG on Information Exchange with Virtual Worlds
w9656 Convener AHG on the RoSE Framework
w9657 Convener AHG on Requirements for Media Value Chain Ontology
46
w9658 Requirements Requirements for a Media Value Chain Ontology
w9659 Requirements Requirements on RoSE Framework
w9660 Convener Liaison Statement to ITU-T SG 16
w9661 Convener AHG on 3DG documents and software maintenance
w9662 Convener Ad Hoc Group on Scene Representation
w9663 Convener Ad Hoc Group on MPEG File Formats
w9664 Convener Ad Hoc Group on Application Format
w9665 Convener Ad Hoc Group on Presentation of Structured Information
w9666 Convener AHG on Requirements for MPEG Post Production Deliverable Formats
w9667 Convener AHG on MPEG Query Format
w9668 Convener AHG on Font Format Representation
w9669 Systems Text ISO/IEC 13818-1:2007/FPDAM3.2 Carriage of SVC in MPEG-2 Systems
w9670 Systems Text of ISO/IEC 13818-1:2007/Cor.2 WD2.0 related to the carriage of AVC
w9671 Systems DoC on ISO/IEC 14496-5/FPDAM16 Symbolic Music Representation Ref. Soft.
w9672 Systems Text of ISO/IEC 14496-5/FDAM16 Symbolic Music Representation Ref. Soft.
w9673 Systems DoC on ISO/IEC 14496-5/FPDAM17 LASeR Ref. Soft.
w9674 Systems Text of ISO/IEC 14496-5/FDAM17 LASeR Ref. Soft.
w9675 Systems WD1.0 of Use of LASeR jointly with BIFS in MPEG-4 Systems Architecture
w9676 Systems Request for Amendment of ISO/IEC 14496-11
w9677 Systems ISO/IEC 14496-11 PDAM6 Scene Partitionning
w9678 Systems Text of ISO/IEC 14496-12 3rd Edition
w9679 Systems WD1.0 of Corrigendum on ISO/IEC 14496-12
w9680 Systems Updated Technology under Consideration for Part 12
w9681 Systems DoC on ISO/IEC 14496-15/FPDAM2 SVC File Format Extension
w9682 Systems Text of ISO/IEC 14496-15/FDAM2 SVC File Format Extension
47
w9683 Systems Request for 14496-22 2nd Edition
w9684 Systems Text of CD ISO/IEC 14496-22 2nd Edition
w9685 Systems Items for consideration for Corrigendum or Amendment of MPEG-21 DIA
w9686 Systems DoC on ISO/IEC 21000-5/FPDAM3 Open Access Content Profile
w9687 Systems Text of ISO/IEC 21000-5/FDAM3 Open Access Content Profile
w9688 Systems MPEG-21 REL Profiles Software Implementation Plan v.9
w9689 Systems MAF Overview Document
w9690 Systems MAF Overview Presentation
w9691 Systems Study Text of ISO/IEC FCD 23000-4 Musical Slide Show 2nd Edition
w9692 Systems Study Text of ISO/IEC 23000-4:200x/PDAM1 MSS Application Format Conf. and Ref. Software
w9693 Systems Text of ISO/IEC 23000-5 2nd Edition WD1.0 Media Streaming Application Format
w9694 Systems Requirements on Professional Archival Application Format
w9695 DELETED DELETED
w9696 Systems Text of ISO/IEC CD 23000-6 Professional Archival Application Format
w9697 Systems DoC of ISO/IEC FCD 23000-7 Open Access Application Format
w9698 Systems Text of ISO/IEC FDIS 23000-7 Open Access Application Format
w9699 Systems Request of Amendment for ISO/IEC 23000-7
w9700 Systems Text of ISO/IEC PDAM1 23000-7 Conformance and Reference Software
w9701 Systems Study Text of ISO/IEC 23000-8/FCD Portable Video Application Format
w9702 Systems Workplan for Portable Video Application Format Conformance and Ref. Soft.
w9703 Systems Text of ISO/IEC 23000-9/DCOR1 (MAF Application Format)
w9704 Systems Text of ISO/IEC 23000-9/AMD1 WD1.0 Conformance
48
and Reference Software
w9705 Systems DoC on ISO/IEC CD 23000-10 (Video Surveillance Application Format)
w9706 Systems Text of ISO/IEC FCD 23000-10 (Video Surveillance Application Format)
w9707 Systems Text of ISO/IEC 23000-10/AMD1 WD1.0 Conformance and Reference Software
w9708 Systems Future Work on Surveillance AF's - collection of requirements
w9709 Systems Text of ISO/IEC CD 23000-11 (Stereoscopic Video Application Format)
w9710 Systems Requirements for MPEG Post Production Deliverable Formats
w9711 Systems Gap Analysis between Post Production Deliverable Requirements and Proposed Working Draft
w9712 Systems Text of WD1.0 MPEG Post Production Deliverable Formats
w9713 Systems Requirements for MXM (MPEG eXtensible Middleware)
w9714 Convener Liaison to JPEG on ISO Base Format
w9715 Systems Requirements for Presentation of Structured Information
w9716 Systems Preliminary WD of Presentation of Structured Information
w9717 Systems Requirements on WIM TV
w9718 Convener Response to DVB on File Format
w9719 Convener Response to DVB on Carriage and Storage of SVC
w9720 Convener Response to JPEG on Query Format
w9721 Convener Liaison to JTC1/SWG-ARM on PA Application Format
w9722 Convener Liaison to SMPTE on PA Application Format
w9723 Convener Liaison to TC20/SC13 on PA Application Format
w9724 Convener Liaison to JPEG on PA Application Format
w9725 Convener Response to JTC1/SC34
w9726 Convener Liaison to ITU-T SG16 on IPTV
w9727 Convener Liaison to Creative Common on Open Access Application Format
49
w9728 Convener Liaison to SMPTE on Post-Production Deliverables
w9729 Convener Liaison to NAB on Post-Production Deliverables
w9730 Convener Liaison to ATSC on Post-Production Deliverables
w9731 Convener Liaison to MPAA on Post-Production Deliverables
50
Annex D– Output documents
Number Source Title
w9555 Convener List of Documents from the 83rd Meeting in Antalya, Turkey
w9556 Convener Resolutions of the 83rd Meeting in Antalya, Turkey
w9557 Convener List of AHGs Established at the 83rd Meeting in Antalya, Turkey
w9558 Convener Report of the 83rd Meeting in Antalya, Turkey
w9559 Convener Guidelines for Electronic Distribution of MPEG M and N Documents
w9560 Convener Press Release of the 83rd Meeting in Antalya, Turkey
w9561 Convener Meeting Notice of the 84th Meeting in Archamps, Switzerland
w9562 HoD Guide for WG 11 Meeting Hosts
w9563 video Request for 13818-2:2000/Amd.3
w9564 Video Text of ISO/IEC 13818-2:2000/PDAM 3 Level for 1080p/60 Support
w9565 Video Study Text of ISO/IEC 14496-2:2004/PDAM5 Simple Studio Profile Levels 5 and 6
w9566 Video Study Text of ISO/IEC 14496-2:2004/DCOR3
w9567 Video Study Text of ISO/IEC 14496-4:2004/PDAM35 Simple Studio Profile Levels 5 and 6 Conformance Testing
w9568 Video Disposition of Comments on ISO/IEC 14496-4:2004/PDAM 31
w9569 Video Text of ISO/IEC 14496-4:2004/FPDAM 31 Conformance Testing for Scalable Video Coding
w9570 Video Disposition of Comments on ISO/IEC 14496-5:2001/Amd.1:2002/DCOR 1
w9571 Video Text of ISO/IEC 14496-5:2001/Amd.1:2002/COR 1
w9572 Video Disposition of Comments on ISO/IEC 14496-5:2001/PDAM 19
w9573 Video Text of ISO/IEC 14496-5:2001/FPDAM 19 Reference Software for Scalable Video Coding
w9574 Video Text of ISO/IEC 14496-10:200X/DCOR 1
w9575 Video Disposition of Comments on ISO/IEC 14496-10:200X/PDAM 1
w9576 Video Text of ISO/IEC 14496-10:200X/FPDAM 1 Multiview Video Coding
w9577 Video Report on SVC Verification Tests
w9578 Video Joint Multiview Video Model (JMVM) 7
51
w9579 Video JMVM 7 Software
w9580 Video Overview of Multiview Video Coding (MVC)
w9581 Video Text of ISO/IEC 15938-3:2001/PDAM 3 Image Signature Tools
w9582 Video Description of Core Experiments for MPEG-7 New Visual Extensions
w9583 Video Request for 13818-4:2004/Amd.3
w9584 Video Study Text of ISO/IEC CD 23001-4 Codec Configuration Representation
w9585 Video Reconfigurable Video Coding Requirements V 4.0
w9586 Video Overview of Reconfigurable Video Coding (RVC)
w9587 Video Study Text of CD ISO/IEC 23002-4 Video Tool Library
w9588 Video Extensions of Video Tool Library under consideration
w9589 Video Description of Core Experiments in RVC
w9590 Video RVC Simulation Model (RSM) V7.0
w9591 Video RVC Work Plan and FU Development Status
w9592 Video RVC Conformance Testing Working Draft V4.0
w9593 Video Description of Exploration Experiments in RVC
w9594 Video Methodologies for Video Toolbox Extension V2.0
w9595 Video Call for Contributions on 3D Video Test Material (Update)
w9596 Video Description of Exploration Experiments in 3D Video
w9597 Convener AHG on Maintenance of MPEG-4 Visual related Documents, Reference Software and Conformance
w9598 Convener AHG on Reconfigurable Video Coding
w9599 Convener AHG on MPEG-7 Visual
w9600 Convener Terms of reference
w9601 Convener MPEG Standards
w9602 Convener Table of unpublished FDISs
w9603 Convener Work plan and time line
w9604 Convener Editors of MPEG standards
w9605 Convener Schema assets
w9606 Convener Software assets
w9607 Convener Conformance assets
w9608 Convener Content assets
52
w9609 Convener URI assets
w9610 Convener Standards under development for which a call for patent statements is issued
w9611 Convener List of Organisations with which MPEG entertains liaisons
w9612 DELETED DELETED
w9613 Convener AHG on FTV
w9614 Convener Liaison Statement to SMPTE re RVC
w9615 Convener Liaison Statement to ITU-T SG 9 re FTV
w9616 Convener Liaison Statement to ITU-T SG 9 re Bitstream Splicing
w9617 Convener Liaison Statement template for various organizations re SVC verification testing report
w9618 Video Text of ISO/IEC 13818-4:2004/PDAM 3 Level for 1080@50/60p Conformance Testing
w9619 Audio Workplan for AAC-ELD Verification Test
w9620 Audio DoC on ISO/IEC 14496-4:2004/FPDAM 20, SLS Conformance
w9621 Audio ISO/IEC 14496-4:2004/FDAM 20, SLS Conformance
w9622 Audio ISO/IEC 14496-4:2004/AMD 11/DCOR 3, Parametric Stereo
w9623 Audio ISO/IEC 14496-4:2004/AMD 19/DCOR 1, ALS
w9624 Audio ISO/IEC 14496-4:2004/AMD XX, WD on AAC-ELD, OAFI and additional AAC Conformance
w9625 Audio DoC on ISO/IEC 14496-4:2004/FPDAM 29, SMR Conformance
w9626 Audio ISO/IEC 14496-4:2004/FDAM 29, SMR Conformance
w9627 Audio MPEG-4 Audio Conformance Rollup
w9628 Audio ISO/IEC 14496-5:2001/AMD 10/DCOR 2, ALS
w9629 Audio ISO/IEC 14496-5:2001/AMD XX, WD on AAC-ELD Reference Sw.
w9630 Audio Study on ISO/IEC 14496-5:2001/FPDAM 20, MPEG-1 and -2 Audio in MPEG-4 and BSAC Extensions
w9631 Audio DoC on ISO/IEC 23003-1:2006/FPDAM 1, MPEG Surround Conformance
w9632 Audio ISO/IEC 23003-1:2006/FDAM 1, MPEG Surround Conformance
w9633 Audio Workplan on further issues for MPEG Surround Conformance
w9634 Audio DoC on ISO/IEC 23003-1:2006/FPDAM 2, MPEG Surround Reference Sw.
53
w9635 Audio ISO/IEC 23003-1:2006/FDAM 2, MPEG Surround Reference Sw.
w9636 Audio Status and Workplan on SAOC Core Experiments
w9637 Audio WD on SAOC Text and Reference Software
w9638 Audio Evaluation Guidelines for Unified Speech and Audio Proposals
w9639 Audio Workplan on Speech and Audio Material Selection
w9640 Audio Draft Workplan Evaluation Subjective Tests
w9641 Convener Liaison Statement to ETSI TC DECT
w9642 3DGC Study on PDAM of ISO/IEC 14496-4:2004 AMD32 (FAMC Conformance)
w9643 3DGC Study on PDAM of ISO/IEC 14496-4:2004 AMD33 (MultiResolution Profile Conformance)
w9644 3DGC ISO/IEC 14496-4:2004 PDAM 34 (3DGCM Conformance)
w9645 3DGC ISO/IEC 14496-5 PDAM 22 (3DGCM RefSoft)
w9646 3DGC Study of ISO/IEC 14496-16:2006/AMD1/DCOR1
w9647 3DGC DoC on ISO/IEC 14496-16:2006/PDAM2 (Frame-based Animated Mesh Compression)
w9648 3DGC Text of ISO/IEC 14496-16:2006/FPDAM2 (Frame-based Animated Mesh Compression)
w9649 3DGC WD2.0 of AFX 3rd Edition
w9650 3DGC Requirements for low-complexity 3D mesh compression
w9651 3DGC CfP for low-complexity 3D mesh compression
w9652 3DGC Study of CD of ISO/IEC 14496-25
w9653 Convener AHG on Audio Standards Maintenance
w9654 Convener AHG on Unified Speech and Audio Coding and SAOC and AAC-ELD
w9655 Convener AHG on Information Exchange with Virtual Worlds
w9656 Convener AHG on the RoSE Framework
w9657 Convener AHG on Requirements for Media Value Chain Ontology
w9658 Requirements Requirements for a Media Value Chain Ontology
w9659 Requirements Requirements on RoSE Framework
w9660 Convener Liaison Statement to ITU-T SG 16
w9661 Convener AHG on 3DG documents and software maintenance
w9662 Convener Ad Hoc Group on Scene Representation
54
w9663 Convener Ad Hoc Group on MPEG File Formats
w9664 Convener Ad Hoc Group on Application Format
w9665 Convener Ad Hoc Group on Presentation of Structured Information
w9666 Convener AHG on Requirements for MPEG Post Production Deliverable Formats
w9667 Convener AHG on MPEG Query Format
w9668 Convener AHG on Font Format Representation
w9669 Systems Text ISO/IEC 13818-1:2007/FPDAM3.2 Carriage of SVC in MPEG-2 Systems
w9670 Systems Text of ISO/IEC 13818-1:2007/Cor.2 WD2.0 related to the carriage of AVC
w9671 Systems DoC on ISO/IEC 14496-5/FPDAM16 Symbolic Music Representation Ref. Soft.
w9672 Systems Text of ISO/IEC 14496-5/FDAM16 Symbolic Music Representation Ref. Soft.
w9673 Systems DoC on ISO/IEC 14496-5/FPDAM17 LASeR Ref. Soft.
w9674 Systems Text of ISO/IEC 14496-5/FDAM17 LASeR Ref. Soft.
w9675 Systems WD1.0 of Use of LASeR jointly with BIFS in MPEG-4 Systems Architecture
w9676 Systems Request for Amendment of ISO/IEC 14496-11
w9677 Systems ISO/IEC 14496-11 PDAM6 Scene Partitionning
w9678 Systems Text of ISO/IEC 14496-12 3rd Edition
w9679 Systems WD1.0 of Corrigendum on ISO/IEC 14496-12
w9680 Systems Updated Technology under Consideration for Part 12
w9681 Systems DoC on ISO/IEC 14496-15/FPDAM2 SVC File Format Extension
w9682 Systems Text of ISO/IEC 14496-15/FDAM2 SVC File Format Extension
w9683 Systems Request for 14496-22 2nd Edition
w9684 Systems Text of CD ISO/IEC 14496-22 2nd Edition
w9685 Systems Items for consideration for Corrigendum or Amendment of MPEG-21 DIA
w9686 Systems DoC on ISO/IEC 21000-5/FPDAM3 Open Access Content Profile
w9687 Systems Text of ISO/IEC 21000-5/FDAM3 Open Access Content Profile
w9688 Systems MPEG-21 REL Profiles Software Implementation Plan v.9
w9689 Systems MAF Overview Document
55
w9690 Systems MAF Overview Presentation
w9691 Systems Study Text of ISO/IEC FCD 23000-4 Musical Slide Show 2nd Edition
w9692 Systems Study Text of ISO/IEC 23000-4:200x/PDAM1 MSS Application Format Conf. and Ref. Software
w9693 Systems Text of ISO/IEC 23000-5 2nd Edition WD1.0 Media Streaming Application Format
w9694 Systems Requirements on Professional Archival Application Format
w9695 DELETED DELETED
w9696 Systems Text of ISO/IEC CD 23000-6 Professional Archival Application Format
w9697 Systems DoC of ISO/IEC FCD 23000-7 Open Access Application Format
w9698 Systems Text of ISO/IEC FDIS 23000-7 Open Access Application Format
w9699 Systems Request of Amendment for ISO/IEC 23000-7
w9700 Systems Text of ISO/IEC PDAM1 23000-7 Conformance and Reference Software
w9701 Systems Study Text of ISO/IEC 23000-8/FCD Portable Video Application Format
w9702 Systems Workplan for Portable Video Application Format Conformance and Ref. Soft.
w9703 Systems Text of ISO/IEC 23000-9/DCOR1 (MAF Application Format)
w9704 Systems Text of ISO/IEC 23000-9/AMD1 WD1.0 Conformance and Reference Software
w9705 Systems DoC on ISO/IEC CD 23000-10 (Video Surveillance Application Format)
w9706 Systems Text of ISO/IEC FCD 23000-10 (Video Surveillance Application Format)
w9707 Systems Text of ISO/IEC 23000-10/AMD1 WD1.0 Conformance and Reference Software
w9708 Systems Future Work on Surveillance AF's - collection of requirements
w9709 Systems Text of ISO/IEC CD 23000-11 (Stereoscopic Video Application Format)
w9710 Systems Requirements for MPEG Post Production Deliverable Formats
w9711 Systems Gap Analysis between Post Production Deliverable Requirements and Proposed Working Draft
w9712 Systems Text of WD1.0 MPEG Post Production Deliverable Formats
w9713 Systems Requirements for MXM (MPEG eXtensible Middleware)
w9714 Convener Liaison to JPEG on ISO Base Format
w9715 Systems Requirements for Presentation of Structured Information
w9716 Systems Preliminary WD of Presentation of Structured Information
56
w9717 Systems Requirements on WIM TV
w9718 Convener Response to DVB on File Format
w9719 Convener Response to DVB on Carriage and Storage of SVC
w9720 Convener Response to JPEG on Query Format
w9721 Convener Liaison to JTC1/SWG-ARM on PA Application Format
w9722 Convener Liaison to SMPTE on PA Application Format
w9723 Convener Liaison to TC20/SC13 on PA Application Format
w9724 Convener Liaison to JPEG on PA Application Format
w9725 Convener Response to JTC1/SC34
w9726 Convener Liaison to ITU-T SG16 on IPTV
w9727 Convener Liaison to Creative Common on Open Access Application Format
w9728 Convener Liaison to SMPTE on Post-Production Deliverables
w9729 Convener Liaison to NAB on Post-Production Deliverables
w9730 Convener Liaison to ATSC on Post-Production Deliverables
w9731 Convener Liaison to MPAA on Post-Production Deliverables
w9732 Convener Liaison to EBU on Post-Production Deliverables
w9733 Convener Liaison to IEC TC100 TA6 on Post-Production Deliverables
w9734 Convener Liaison to IFPI on Post-Production Deliverables
w9735 Convener Liaison to DMP on Presentation of Structured Information
w9736 Convener Liaison to IEC TC 9/WG 43 on Video Surveillance AF
57
Annex E – Requirements report
Source: Jörn Ostermann (Leibniz Universität Hannover)
1. Requirements documents approved at this meeting
w9658 Requirements for a Media Value Chain Ontology
w9659 Requirements on RoSE Framework
2. MPEG-V: Information exchange with virtual worlds
Several use cases were discussed. Until the next meeting, these use cases have to be verified. A call for proposals and evaluation methods for the proposals have to be prepared for the next meeting such that a call for proposals can be issued. The tentative work plan set is WD in July or October 2008 and CD in January 2009.
3. Explorations
3.1. IPTV RequirementsNo input on this topic was brought to the meeting. Since IPTV is an important area of work for MPEG, a joint meeting with Systems was held clarifying the contributions MPEG (codecs, IPMP, streaming) could make in this field. As a result, liaison from SG16 on Meta data, M3W, and WimTV are requested. The IPTV Requirements document (N9167) has not been updated.
3.2. Rose
Requirements for Rose, the representation of sensory effects, were discussed. At the next meeting, the group plans to issue a Call for Proposals.
Input documents: m15083 Requirements on RoSE Frameworkm15091 Requirements on Framework for RoSE
3.3. Media Value Chain OntologyThis topic has been discussed within MPEG for several meetings. It is now time to clarify a time line for standardization. Therefore, it was decided to prepare a requirements document and a Call for Proposals until the next meeting. At the next meeting, the decision on the documents will determine whether MPEG is going forward with this activity.
58
Input documents: m15031 Ad Hoc Group on Requirements of Media Value Chains Ontologies m15092 A Common Core IP Model
3.4. Future Work ItemsA review of the work areas of MPEG was started. Currently, MPEG focuses on the consumer market. Professional profiles of MPEG are provided such that consumer content can be produced efficiently. The standards define only the decoder, they do not define rendering. MPEG does not require or profile the use of error resilience features since the transport channel is assumed transparent.
In order to widen the usage of MPEG standards, MPEG members are requested to bring proposals for new work items to the next meetings. Possible items identified at the meeting are:
Tools for consumer content creation, manipulation, annotation, distribution and privacy High efficiency video coding considering new colour spaces High efficiency audio coding Control of display setting from contents Accessibility including e-inclusion and alternative output devices for disabled Capture and presentation of smell, touch, vibrations and emotions
At the next meeting, potential work items will be discussed further.
59
Annex F – Systems report
Source: Systems Chair and Break-out group ChairsEditor: Olivier Avaro (Streamezzo)
Contributors: David Singer (Apple), Young-Kwon Lim (Net&TV), Jean Gelissen (Philips), Gero Baese (Siemens)
1 OverviewThe main outputs of the meeting from the Systems Sub-group perspective are:
No. TitleX 13818-1 MPEG-2 Systems9669 Text ISO/IEC 13818-1:2007/FPDAM3.2 Carriage of SVC in MPEG-2 Systems9670 Text of ISO/IEC 13818-1:2007/Cor.2 WD2.0 related to the carriage of AVC X 14496-5 Reference Software9671 DoC on ISO/IEC 14496-5/FPDAM16 Symbolic Music Representation Ref. Soft.9672 Text of ISO/IEC 14496-5/FDAM16 Symbolic Music Representation Ref. Soft.9673 DoC on ISO/IEC 14496-5/FPDAM17 LASeR Ref. Soft.9674 Text of ISO/IEC 14496-5/FDAM17 LASeR Ref. Soft.X 14496-11 Scene Representation9675 WD1.0 of Use of LASeR jointly with BIFS in MPEG-4 Systems Architecture9676 Request for Amendment of ISO/IEC 14496-119677 ISO/IEC 14496-11 PDAM6 Scene PartitionningX 14496-12 ISO Base Media File Format9678 Text of ISO/IEC 14496-12 3rd Edition9679 WD1.0 of Corrigendum on ISO/IEC 14496-129680 Updated Technology under Consideration for Part 12 X 14496-15 AVC File Format9681 DoC on ISO/IEC 14496-15/FPDAM2 SVC File Format Extension9682 Text of ISO/IEC 14496-15/FDAM2 SVC File Format ExtensionX 14496-22 Open Font Format9683 Request for 14496-22 2nd Edition9684 Text of ISO/IEC 2nd Edition 14496-22X 21000 General9685 Items for consideration for Corrigendum or Amendment of MPEG-21 DIAX 21000-5 Rights Expression Language9686 DoC on ISO/IEC 21000-5/FPDAM3 Open Access Content Profile9687 Text of ISO/IEC 21000-5/FDAM3 Open Access Content Profile9688 MPEG-21 REL Profiles Software Implementation Plan v.9X 23000 General9689 MAF Overview Document9690 MAF Overview PresentationX 23000-4 Musical Slide Show Application Format9691 Study Text of ISO/IEC FCD 23000-4 Musical Slide Show 2nd Edition9692 Study Text of ISO/IEC 23000-4:200x/PDAM1 MSS Application Format Conf. and Ref.
SoftwareX 23000-5 Media Streaming Application Format
9693 Text of ISO/IEC 23000-5 2nd Edition WD1.0 Media Streaming Application Format
X 23000-6 Professional Archival Application Format9694 Requirements on Professional Archival Application Format9695 Request for ISO/IEC 23000-6 Professional Archival Application Format
60
9696 Text of ISO/IEC CD 23000-6 Professional Archival Application FormatX 23000-7 Open Access Application Format9697 DoC of ISO/IEC FCD 23000-7 Open Access Application Format9698 Text of ISO/IEC FDIS 23000-7 Open Access Application Format9699 Request of Amendment for ISO/IEC 23000-7 9700 Text of ISO/IEC PDAM1 23000-7 Conformance and Reference SoftwareX 23000-8 Portable Video Application Format9701 Study Text of ISO/IEC 23000-8/FCD Portable Video Application Format9702 Workplan for Portable Video Application Format Conformance and Ref. Soft.X 23000-9 Digital Multimedia Broadcasting Application Format9703 Text of ISO/IEC 23000-9/DCOR1 (DMB Application Format)9704 Text of ISO/IEC 23000-9/AMD1 WD1.0 Conformance and Reference SoftwareX 23000-10 Video Surveillance Application Format9705 DoC on ISO/IEC CD 23000-10 (Video Surveillance Application Format)9706 Text of ISO/IEC FCD 23000-10 (Video Surveillance Application Format)9707 Text of ISO/IEC 23000-10/AMD1 WD1.0 Conformance and Reference
Software9708 Future Work on Surveillance AF's – collection of requirementsX 23000-11 Stereoscopic Video Application Format9709 Text of ISO/IEC CD 23000-11 (Stereoscopic Video Application Format)X XXX Post Production Deliverable Formats9710 Requirements for MPEG Post Production Deliverable Formats9711 Gap Analysis between PPD Requirements and Proposed Working Draft9712 Text of WD1.0 MPEG Post Production Deliverable FormatsX Exploration9713 Requirements for MXM (MPEG eXtensible Middleware)9715 Requirements for Presentation of Structured Information9716 Preliminary WD of Presentation of Structured Information9717 Requirements on WIM TVX Assets and Standing Documents9605 MPEG Schema Assets UpdatesX Liaison9718 Response to DVB on File Format9719 Response to DVB on Carriage and Storage of SVC9720 Response to JPEG on Query Format9721 Liaison to JTC1/SWG-ARM on PA Application Format9722 Liaison to SMPTE on PA Application Format9723 Liaison to TC20/SC13 on PA Application Format9724 Liaison to JPEG on PA Application Format9725 Response to JTC1/SC349726 Liaison to ITU-T SG16 on IPTV9727 Liaison to Creative Common on Open Access Application Format 9728 Liaison to SMPTE on Post-Production Deliverables9729 Liaison to NAB on Post-Production Deliverables9730 Liaison to ATSC on Post-Production Deliverables9731 Liaison to MPAA on Post-Production Deliverables9732 Liaison to EBU on Post-Production Deliverables9733 Liaison to IEC TC100 TA6 on Post-Production Deliverables9734 Liaison to IFPI on Post-Production Deliverables9735 Liaison to DMP on Presentation of Structured Information9736 Liaison to ITU-T TC 9 WG43 on Video Surveillance AF9714 Liaison to JPEG on ISO Base Format
61
2 General issues
2.1 GeneralThe meeting report from Shenzhen has been approved.The following demonstrations have been made:
None.
2.2 List of standards under developmentPr Pt Edit. Project Description CfP WD CD FCD FDIS
2 1 2006 Amd.3 SVC in MPEG-2 Systems
07/07 08/01 08/07
2 1 2006 Cor.2 Transport of AVC Specification
08/04 08/07
4 1 200x Amd.4 Registration Authority 07/10 08/04 08/074 4 2007 Amd.26 Open Font Format Conf. 07/04 07/10 08/044 4 2007 Amd.27 LASeR Amd.1
Conformance06/10 07/07 07/10 08/04
4 4 2007 Amd.xx SVC File Format Conf. TBS4 5 2007 Amd.14 Open Font Format Ref.
Soft07/10 08/04 08/10 09/01
4 5 2007 Amd.xx AVC File Format Ref. Soft
TBS
4 5 2007 Amd.xx SVC File Format Ref. Soft
TBS
4 5 2007 Amd.xx Synthesized Texture Ref. Soft
08/04 08/07 09/01
4 11 2005 Amd.6 Scene Partitionning 08/01 08/04 08/107 5 2008 Amd.4. Improvements to
geographic descriptor 08/04
7 7 2008 Amd.3. Improvements to geographic descriptor conformance
08/04
7 12 2008 1st Ed. MPEG Query Format 07/10 08/0421 8 200x Amd.1 Minor Enhancement 07/10 08/04 08/0721 9 200x Amd.1 MP21 Mime Type 07/04 07/10 08/0421 15 200x Amd.1 Security in Event
Reporting08/04
A 4 200x 2nd Ed. Protected MSS AF 07/04 07/07 07/10 08/04A 4 200x Amd.1 MSS AF Conf. and Soft 07/07 07/10 08/04 08/07A 5 200x 2nd Ed. MS AF 08/01 08/04 08/10 09/04A 6 200x 1st Ed. Professional Archival
AF07/10 08/01 08/04 08/10
A 7 200x Amd.1 OA AF Ref. Soft and Conf.
08/01 08/04 08/10
A 8 200x 1st Ed. Portable Video Player AF
06/10 07/04 07/10 08/04
A 8 200x Amd.1 PVP AF Ref. Soft. And Conf.
A 9 200x Amd.1 DMB AF Ref .Soft. And
62
Conf.A 10 200x 1st Ed. Video Surveillance AF 07/04 07/07 08/01 08/07A 10 200x Amd.1 Video Surveillance AFA 11 200x 1st Ed. Stereoscopic Video AF 07/04 08/01 08/04 08/10A 11 200x 1st Ed. SV AF Ref. Soft. And
Conf.08/07 08/10 09/04 09/07
B 2 200x Amd.1 Fragment Request Unit Ref. Soft. And Conf.
E 8 200x 1st Ed. Ref. Soft. and Conformance
07/01 07/07 08/04 08/07
V 1 200x 1st Ed. Interface with Virtual World
08/07 08/10 09/01 09/04 09/10
63
2.3 Standing Documents
Pr Pt Documents No. Meeting1 1 MPEG-1 White Paper – Multiplex Format N7675 05/07 Nice1 1 MPEG-1 White Paper – Terminal Architecture N7676 05/07 Nice1 1 MPEG-1 White Paper – Multiplexing and
SynchronizationN7677 05/07 Nice
2 1 MPEG-2 White Paper – Multiplex Format N7678 05/07 Nice2 1 MPEG-2 White Paper – Terminal Architecture N7679 05/07 Nice2 1 MPEG-2 White Paper – Multiplexing and
SynchronizationN7680 05/07 Nice
2 11 MPEG-2 White Paper – MPEG-2 IPMP N7503 05/07 Poznan4 1 MPEG-4 White Paper – MPEG-4 Systems N7504 05/07 Poznan4 1 MPEG-4 White Paper – Terminal Architecture N7610 05/10 Nice4 1 MPEG-4 White Paper – M4MuX N7921 06/01 Bangkok4 1 MPEG-4 White Paper – OCI N8148 06/04 Montreux4 6 MPEG-4 White Paper – DMIF N8149 06/04 Montreux4 11 MPEG-4 White Paper – BIFS N7608 05/10 Nice4 12 MPEG-4 White Paper – ISO File Format N8150 06/04 Montreux4 14 MPEG-4 White Paper – MP4 File Format N7923 06/01 Bangkok4 15 MPEG-4 White Paper – AVC FF N7924 06/01 Bangkok4 13 White Paper on MPEG-4 IPMP N7505 05/07 Poznan4 13 MPEG IPMP Extensions Overview N6338 04/03 München4 17 White Paper on Streaming Text N7515 05/07 Poznan4 18 White Paper on Font Compression and Streaming N7508 05/07 Poznan4 20 Presentation Material on LASER N6969 05/01 Hong-
Kong4 20 White Paper on LASeR N7507 05/07 Poznan4 22 White Paper on Open Font Format N7519 05/07 Poznan7 1 MPEG-7 White Paper - MPEG-7 Systems N7509 05/07 Poznan7 1 MPEG-7 White Paper – Terminal Architecture N8151 06/04 Montreux21 9 MPEG-21 White Paper – MPEG-21 File Format N7925 06/01 BangkokA X MPEG Application Format Overview N9421 07/10 ShenzhenA X MAF Overview Document N9691 08/01 AntalyaA X MAF Overview Presentation N9690 08/01 AntalyaB X MPEG-B White Paper – BinXML N7922 06/01 BangkokE X MPEG Multimedia Middleware Context and
ObjectivesN6335 04/03 München
E X 1rst M3W White paper N7510 05/07 PoznanE X 2nd M3W White Paper : Architecture N8152 06/04 MontreuxE X Tutorial on M3W N8153 06/04 MonreuxE X M3W White Paper : Multimedia Middleware
ArchitectureN8687 06/10 Hanzhou
E X M3W White Paper : Multimedia API N8688 06/10 HanzhouE X M3W White Paper : Component Model N8689 06/10 HanzhouE X M3W White Paper : Resource and Quality
ManagementN8690 06/10 Hanzhou
E X M3W White Paper : Component Download N8691 06/10 HanzhouE X M3W White Paper : Fault Management N8692 06/10 Hanzhou
64
E X M3W White Paper : System Integrity Management
N8693 06/10 Hanzhou
65
2.4 Mailing Lists Reminder
Topic Information Kindly Managed by
General Systems
List
Liste Reflector : [email protected]:
http://lists.uni-klu.ac.at/mailman/listinfo/gen-sysmailto:[email protected]?subject=subscribe
List-Archive: http://lists.uni-klu.ac.at/pipermail/gen-sysList-Help: mailto:[email protected]?subject=help
University of Klagenfurt
BiM
Liste Reflector : [email protected]:
http://lists.uni-klu.ac.at/mailman/listinfo/mpeg7-sysmailto:[email protected]?subject=subscribe
List-Archive: http://lists.uni-klu.ac.at/pipermail/mpeg7-sysList-Help: mailto:[email protected]?subject=help
University of Klagenfurt
File Format
Liste Reflector : [email protected]:
http://lists.uni-klu.ac.at/mailman/listinfo/mp4-sysmailto:[email protected]?subject=subscribe
List-Archive: http://lists.uni-klu.ac.at/pipermail/mp4-sysList-Help: mailto:[email protected]?subject=help
University of Klagenfurt
LASeR
Liste Reflector : [email protected]:
http://lists.uni-klu.ac.at/mailman/listinfo/mpeg-lasermailto:[email protected]?subject=subscribe
List-Archive: http://lists.uni-klu.ac.at/pipermail/mpeg-laserList-Help: mailto:[email protected]?subject=help
University of Klagenfurt
MAF
Liste Reflector : [email protected]:http://lists.uni-klu.ac.at/mailman/listinfo/maf-sysmailto:[email protected]?subject=subscribeList-Archive: http://lists.uni-klu.ac.at/pipermail/maf-sysList-Help: mailto:[email protected]?subject=help
University of Klagenfurt
MPEG-2 on File Format
#1: Please subscribe via http://lists.uni-klu.ac.at/mailman/listinfo/isoff-transport. Please use only that email address for posting messages with which you're subscribed. Otherwise the email won't be delivered. #2: The email address for posting messages is: [email protected]
University of Klagenfurt
66
#3: The archive is accessible via http://lists.uni-klu.ac.at/mailman/private/isoff-transport/ for list members only.
2.5 FAQThe FAQ were updated as needed.
2.6 AOBNone.
67
3 MPEG-2 Systems (13818-1)
3.1 GeneralM15210: China NB Comments on Transport of GB 20090.2 video data over ITU-T Rec. H.222.0 | ISO/IEC 13818-1. The Systems sub-group thanks the China NB for their input contribution on the carriage of AVS in MPEG-2 Systems. The Systems sub-group recommends to use mechanisms already in place to carry data in formats defines by organization external to MPEG (i.e. using the SMPTE registration authority). To the knowledge of the Systems sub-group, these mechanisms will fully satisfy the requirements of the China NB. In addition, the Systems sub-group noted the interest of the carriage of RVC on MPEG-2 Systems and welcome contributions in this area.
M15195: Transport of GB 20090.2 video data over ITU-T Rec. H.222.0 | ISO/IEC 13818-1. Noted.
3.2 13818-1:2005 Amd.3 Carriage of SVC
3.2.1 Topics1. Transport of Scalable Video Coding
3.2.2 ContributionsM15220: Late GNB on ISO/IEC 13818-1:2007/FDAM3. Joint work was done to produce Study text of the Carriage of SVC.
Technical Work in Progress.
3.3 13818-1:2005 DCOR.2
3.3.1 Topics1. Coorigendum on the carriage of AVC
3.3.2 ContributionsNone.
Technical Work in Progress.
4 MPEG-4 Conformance (14496-4)
4.1 14496-4 Amd.26 Open Font Format Conformance
4.1.1 Topics1. Open Font Format Conformance
4.1.2 ContributionsNone.
Technical Work in Progress.
68
4.2 14496-4 Amd.27 LASeR V2 Conformance
4.2.1 Topics1. LASeR V2 Conformance
4.2.2 ContributionsNone.
Technical Work in Progress.
5 MPEG-4 Reference Software (14496-5)
5.1 14496-5 Amd.14
5.1.1 Topics1. Open Font Format Reference Software
5.1.2 ContributionsNone.
Technical Work In Progress.
5.2 14496-5 Amd.16
5.2.1 Topics1. Symbolic Music Representation Reference Software
5.2.2 ContributionsM15059: Summary of Voting on ISO/IEC 14496-5:2001/FPDAM 16 [SC 29 N 8926]. See DoC.M15079: Editor Study on ISO/IEC 14496-5:2001/FPDAM 16 Symbolic Music Representation reference software. Taken as input to produce final text.
Technical Work Completed.
5.3 14496-5 Amd.17
5.3.1 Topics1. LASeR Reference Software
5.3.2 ContributionsM15060: Summary of Voting on ISO/IEC 14496-5:2001/FPDAM 17 [SC 29 N 8927].
Technical Work Completed.
69
6 MPEG-4 ISO Base File Format (14496-12)
6.1 14496-12 ISO Base Media File format General
6.1.1 15093 editor's re-structure part 12Thank you. Please publish this ASAP with the help of the secretariat. Two fixes needed to movie fragments, and a sentence fragment “A track” needs removing in track selection.
6.1.2 15147 extended sample groupsThis attacks a number of problems; having compact group ‘definition’ in-line in the mapping box, a different way to compress the mapping (absolute sample numbers), multiple group definitions for the same type (with separated mapping tables), and extended, possibly variable-length in-line ‘values’ in the mapping box. All except the last seem fairly straightforward. The absolute sample numbers was a stylistic point (no other sample table has absolute numbers).This design is also somewhat complicated by a desire to use the same definition material for both sample groups and timed meta-data.We’d like a resolution asking for input on improving the sample group design, overall, taking into account these (and maybe other) issues and opportunities.
6.1.3 15211 indexingThis explains the timed meta-data use of the same structures.
6.1.4 15178 color infoTo systems plenary, please.
6.1.5 15179 alt. groups and backwards compatibilityInteresting, but we have some compatibility issues here, and we ought to define better what the track header flags mean and what their required behavior is, as well. It may be time to lift the ‘track is disabled if it’s a hint track’ rule, also. Hold this to the next meeting also (since we’re holding the possible Corr.).
6.1.6 w9379 Deriving from Part 12 updatedPlease add this to the TuC (or amendment if it becomes one).
6.1.7 Other amendment materialWe agree to include the information on RTP recording hint tracks (including the RTCP format), from the DVB liaison.
6.2 14496-12:2008/Amd2
6.2.1 Topics1. New Edition & Amd.2
6.2.2 ContributionsM15093 : Proposed re-structured ISO Base Media File Format. Adopted as starting point for new edition.
6.2.3 15073 Part 12 FDAM 2 repliesThank you for 100% approval.
70
6.2.4 15133 corr. to alc/fluteActually this is a Corr. to part 12 in general. We probably want to wait to issue this as a Corr. to the 2008 edition of part 12.
6.2.5 15134 Additions to alc/fluteThe first item seems needed; interleaving to construct source symbols from source files is very awkward right now.The second may be (?) a mis-understanding; every hint sample is attached to a sample entry that provides this data.This goes into the TuC for the upcoming amendment.
6.2.6 15148 Alc/flute conformanceThank you. We’re not sure about the file extension, but also not sure what is best. The spreadsheet needs updating, and we need to find out how to handle a ‘large’ (12MB) conformance file. Then we can open an amendment to the conformance part. We’ll do that in Archamps, with all the other Corr.s and Amd.s.
6.3 General consensus on the MPEG-2 TS approach
6.3.1 What is a sequence?multiple program, of which single is a special case
6.3.2 Hint track overalltimescale recommended at 90kHz, or an integer division or multiple thereof.
6.3.3 What is a sample?6.3.3.1 either multi-program, single TS packet per sample6.3.3.2 or single-program, multi TS packet per sample
In case (a), sync sample table is present but empty. Sample groups may be used to mark the sync points of the programs.In case (b), samples that contain GOP boundaries should have a GOP boundary at the start of a sample. The sync sample table marks the samples which start GOPs, and if the sync sample table is absent, all the samples are at the start of GOPs. If the sync sample table is present but empty, the GOP positions are unknown and may be not at the start of samples.Case (b) covers the (unusual) case of one sample for the sequence.
6.3.3.3 what about preceding and trailing bytes? (TS and FEC) as examples
6.3.3.4 how do you tell the difference?sample-size/(188+prec+trail) = N
6.3.3.5 are PMT, PAT, OD etc. also still in-stream?yes, probably. the sample entry documents the initial OD/PAT/PMT conditions for all samples associated with it. If these change, a new sample entry is needed for the first sample at or after the change. If they are not in the sample entry, then they are dynamic and the stream must be scanned.
71
6.3.4 What is in a sample entry?6.3.4.1 0 or more PMTs6.3.4.2 0 or 1 PAT6.3.4.3 0 or more OD6.3.4.4 indication for whether sample times are exactly PCR times6.3.4.5 Transport offset
there is an issue that this only applies to one sample, and isn’t safe under say editing (or random access). we should warn that this field may need updating after e.g. editing.
6.3.4.6 optional boxes for format of preceding and trailing bytes (not defined here)
6.3.5 What are the hint track timestamps (stts)?They may be reception/transmission times or PCR times. But there is a recommendation that the PCR times be used, as these are piece-wise linear and the stts table then compacts sensibly. The big question is, do we have a packet structure that allows the documentation of a reception/transmission offset from the PCR time?
6.3.6 Special issues for recording support?de-hinting issues (error concealment)
6.3.7 Track referencesto associated media tracks (“de-hinting”, linking at authoring time)
6.3.8 Constructors that use track refsyes
6.3.9 Other matters to correct/amd in part 12definition of a hint tracks
6.4 Part 12 MPEG-2 TS Storage
6.4.1 15146 MPEG-2 TS Hint TracksThank you for the introduction. We do need to think about de-hinting and error concealment.
6.4.2 15211 DVB MPEG-2 TS, (Indexing?)We need to decide whether to reply to the liaison.
6.4.3 15189 Proposed MPEG-2 TS storageThank you for this initial start on the combined specification, to all on the reflector and particularly those in the Sunday pre-meeting. The editing team will take the notes developed in the meeting and make an input to the next meeting of the proposed amendment. The editors should prepare their best effort by Feb 8th. The editing team is Hui Yong Kim, Stefan Döhla, David Singer.
7 MPEG-4 AVC File Format (14496-15)
7.1 14496-15:2004/Amd.2
7.1.1 Topics
1. SVC File Format Extensions
72
7.1.2 15066 Part 15 FPDAM repliesProcessed, thank you.
7.1.3 15173 editor's SVC FF draftAccepted, according to the Finnish request, as the basis of future work.
Technical Work Completed.
7.2 14496-15:2004/Amd.3? MVC FF
7.2.1 Topics1. MVC File Format Extensions
7.2.2 ContributionsLacking contributions, we’ll ask again, otherwise take things slowly.
Technical Work in Progress.
73
8 LASeR (14496-20)
8.1 14496-20/Amd.xxx
8.1.1 Topics1. LASeR Extensions
8.1.2 ContributionsNone.
Technical Work in Progress.
9 LASeR (14496-22) Open Font Format
9.1 14496-22/Amd.1
9.1.1 Topics
2. Open Font Format Extension
9.1.2 ContributionsM15081: USNB Contribution: Proposed amendment to ISO/IEC 14496-22. And M15082 : The proposal for amendment of ISO/IEC 14496-22 (in support of USNB comment m15081). Decision to start a new work item following USNB request. Decision to take text of M15082 as a basis to produce CD text of new edition of the Open Font Format Specification.
M15077: Liaison Statement from JTC 1/SC 34/WG 2 [SC 29 N 9035]. Request to update reference in our specification. Accepted and integrated in the text of 2nd Edition CD.
Technical Work in Progress.
10 15938-12 MPEG-7: Query Format
10.1 GeneralDiscussion on the location of MPEG Query Format Specification. Decision to keep it as Part 12 of MPEG-7 AND to integrate conformance and reference software as Amendment of Part 12.
M15176, M1577 : Nobody to present these contributions. Postponned to next meeting.M15109 : Not enough people to progress this specification. Postponned to next meeting.
74
11 21000 MPEG-21
11.1 GeneralM15135: MPEG-21 schema assets update. Taken as a basis to produce the related output document on Schema Assets.M15138: Multiple MPEG-21 DIA AdaptationQoS Descriptions within a Digital Item. Used to produce Items for consideration for Corrigendum or Amendment of MPEG-21 DIA.
11.2 MPEG-21 File Format Amendment
11.2.1 Topics1. Mime Type
11.2.2 ContributionsNone.
12 MPEG-A MAF (23000)
12.1 23000-4 Musical Slide Show MAF
12.1.1 Topics1. Protected Musical Slide Show MAF
12.1.2 ContributionsM15212: KNB Comments on ISO/IEC 23000-4 2nd Edition FCD. See DoC. M15124: Use cases for content protection in Musical slide show Application Format 2nd Edition. Integrated in study text of 2nd Edition.M15127: Editor's study text of ISO/IEC 23000-4/PDAM1 Musical slide show application format. Use as the basis to produce Study text.
Technical Work in Progress.
12.2 23000-5 Media Streaming MA
12.2.1 Topics1. Media Streaming MAF
12.2.2 ContributionsM15205: Proposed Working Draft of ISO/IEC 23000-5 2nd Edition. Proposed text for the:
a. Reference Software (using Chillout)b. Conformance Testingc. Informative Annex
Taken as a basis to produced output document. Decision to keep the informative annex on example Technical Work in Progress.
12.3 23000-6 Professional Archival AF
12.3.1 Topics1. Professional AF
75
12.3.2 ContributionsM15182: Updated requirements on Professional Archival Application Format. Taken as basis for producing requirements document.M15126: Proposed Editorial Update for ISO/IEC 23000-6 WD 1.0. Approved. Take as a basis to produce CD text.M15128: Proposal for Pre-Processing Tool Location Reference in Professional Archival Application Format. And M15129 : Set of MPEG-7 Tools for Professional Archival Applications Format. Accepted for introduction in FCD.
Technical Work in Progress.
12.4 23000-7 Open Access Application Format
12.4.1 Topics1. Open Access Application Format
12.4.2 ContributionsM15168: Open Access Application Format: Reference Software. Taken as basis to produce WD.M15171: GENB comments on the Study of the MPEG-21 REL Open Access Profile FPDAM. All comments where disposed of. See DoC.M15136: Study Text of ISO/IEC FCD 23000-7 Open access application format. Used to produce FDIS text.
Technical Work Completed.
12.5 23000-8 Portable Video Player MAF
12.5.1 Topics1. Portable Video Player MAF
12.5.2 ContributionsM15141: Editor's study text of ISO/IEC 23000-8/FCD Portable video application format. Taken as input for producing study text.M15184: Proposed workplan for Portable video application format conformance. Taken as input for producing output workplan.
Technical Work in Progress.
12.6 23000-9 DMB AF
12.6.1 Topics1. DMB MAF
12.6.2 ContributionsM15187: Proposed text of ISO/IEC 23000-9/PDAM1 DMB AF: Conformance and Reference software. Use as basis to produce WD.M15188: Proposed text of ISO/IEC 23000-9/DCOR1 DMB AF: timescale of TS.
Technical Work in Progress.
76
12.7 23000-10 Video Surveillance MAF
12.7.1 Topics1. Video Surveillance MAF 1st Edition
12.7.2 ContributionsM15157: Video Surveillance Application Format: Reference Software. Taken as basis for producing WD of reference software. See DoC.M15152: Study Text of ISO/IEC 23000-10/CD Video Surveillance Application Format. Taken as basis to produce text of FCD.M15115: Early UKNB comments on the Study of CD for the Video Surveillance Application Format.
Technical Work in Progress.
12.8 23000-11 Stereoscopic Video AF
12.8.1 Topics1. Vide
12.8.2 ContributionsM15203: Updated WD 23000-11 for Stereoscopic Video Application Format. Taken as input to produce text of the CD.
Technical Work in Progress.
13 MPEG-E Multimedia Middleware (23004)
13.1 Multimedia Middleware
13.1.1 Topics1. MPEG Multimedia Middleware
13.1.2 ContributionsNone.
Technical Work in Progress.
77
14 Requirements and Exploration
14.1 Standing DocumentsNone.
14.2 New Proposals
14.2.1 WIM TVIPTV : Informal report from MPEG experts who contributed to IPTV Focus Group. We will continue to liaise with ITU-SG16 and provide various MPEG specifications that would be relevant to the group.
M15206: WIM TV Trial at Beijing Olympics. Use case noted and will be use as a basis to drive requirements for WIM TV.
14.2.2 Proposal for Standardization of MPEG eXtensible MiddlewareM15208: Taken as a basis for the production of the requirements for MXM. Decision on a timeline (cf. Requirements document).
14.2.3 Proposal for Standardization of ROSE M15083: Requirements on RoSE Framework.M15091: Requirements on Framework for RoSE. Both contributions taken as input to produce updated requirement document.
14.2.4 Proposal for Standardization of Content Deliverables for Professionally Produced Film, TV, VideoM15216, M15105 : Working draft for proposed MPEG-M Production Deliverables standard & MPEG-M under the MPEG 21 Reflector.
The Systems sub-group, noting that :1. it has generated a Post-production Deliverable (PPD) requirements document at the 82nd WG11
meeting that was not made publicly available2. it has reviewed the results of the Ad hoc group on Requirements for MPEG Post Production
Deliverable Formats (N9549) 3. it has developed a Post-production Deliverable (PPD) working draft (WD) at the 83rd WG 11
meeting4. it has made a gap analysis between the current requirements and the PPD WD concluding that the
WD provides a broad coverage of PPD requirements, but thata. Some requirement are not yet formulated in a way that allows the development of a technical
solution;b. Some requirement were insufficiently documented to be able to perform the gap analysis;c. Some requirements were not fully satisfied
recommends making the PPD Requirement document as well as the WD publicly available and to liaise with relevant SDOs and trade organizations that may have an interest in this activity inviting them to comment on the two documents and to join in the development of the PPD standard.
The Systems sub-group recommends promoting the proposed PPD specification to WD and progressing the PPD standard with the following time line:
- CD : 2008/04
78
- FCD : 2008/10- FDIS : 2009/04
The WD will be progressed to CD pending the successful completion of the gap analysis between the current requirements and the CD.
14.2.5 Proposal for Standardization of Interfaces with Virtual Worlds None.
14.2.6 Presentation of Structured Information M15207: Requirements for Digital Item Presentation. Taken as input for the production of the requirement for this activity.
M15142: Considerations on Integrating LASeR and DID Technologies for WIM TV. Taken as input for the production of the WD on the Presentation of Structured Information.
14.2.7 Carriage of RVCJoint meeting with Video. Review requirements on the carriage of RVC on MPEG-2 Systems. The activity will be started at the next meeting. Technical inputs are welcomed.Known open issues : Updatability of descriptors in MPEG-2 Systems.
14.2.8 Scene PartitionningJoint meeting with 3DGC. Decision to integrate the Scene Partitionning specification in Part 11 and start a committee draft in Antalya.
15 LiaisonCf. Liaison output.
79
16 Latest References and Publication Status
Pr Pt Standard No. Issue Status Doc. With Purpose ISO Award
2 1 ISO/IEC 13818-1/Amd.7 Published 2000/12 Done2 1 ISO/IEC 13818-1:2000 (MPEG-2 Systems 2nd Edition) 00/12 Published 2000/12 Proposed2 1 ISO/IEC 13818-1:2000/COR1 (FlexMux Descr.) N3844 01/01 Pisa Published 2002/03 N/A2 1 ISO/IEC 13818-1:2000/COR2 (FlexMuxTiming_ descriptor) N4404 01/12 Pattaya Published 2002/12 N/A2 1 ISO/IEC 13818-1:2000/Amd.1 (Metadata on 2) & COR1 on Amd.1 N5867 03/07
TrondheimPublished 2003/12 Proposed
2 1 ISO/IEC 13818-1:2000/Amd.2 (Support for IPMP on 2) N5604 03/03 Pattaya Published 2004/03 N/A2 1 ISO/IEC 13818-1:2000/Amd.3 (AVC Carriage on MPEG-2) N5771 03/07
TrondheimPublished XXXX Proposed
2 1 ISO/IEC 13818-1:2000/Amd.4 (Metadata Application CP) N6847 04/10 Palma FDAM ITTF to be published N/A2 1 ISO/IEC 13818-1:2000/Amd.5 (New Audio P&L Sig.) N6585 04/07
RedmondFDAM ITTF to be published N/A
2 1 ISO/IEC 13818-1:2000/COR3 (Correction for Field Picture) N6845 04/10 Palma COR ITTF to be published N/A2 1 ISO/IEC 13818-1:2000/COR4 (M4MUX Code Point) N7469 05/07 Poznan COR ITTF to be published N/A2 1 ISO/IEC 13818-1:2000/COR5 (Corrections related to 3rd Ed.) N7895 06/01
BangkokCOR ITTF to be published N/A
2 1 ISO/IEC 13818-1:2006 (MPEG-2 Systems 3rd Edition) 06/xx Published ITTF TBP2 1 ISO/IEC 13818-1:2006/Amd.1 (Transport of Streaming text) N8369 06/07
KlagenfurtFDAM ITTF to be published TBP
2 1 ISO/IEC 13818-1:2006/Amd.2 (Carriage of Auxialiry Video Data) N8798 07/01 Marrakech
FDAM ITTF to be published TBP
2 1 ISO/IEC 13818-1:2006/Cor.1.2 (Reference to AVC Specification) N9365 07/10 Shenzhen
FDAM ITTF to be published TBP
2 11 ISO/IEC 13818-1:2003 (IPMP on 2) N5607 03/03 Pattaya Published 2003/12 Proposed4 1 ISO/IEC 14496-1 (MPEG-4 Systems 1st Ed.) N2501 98/10 Atl. City Published 1999/12 Done
80
4 1 ISO/IEC 14496-1/Amd.1 (MP4, MPEG-J) N3054 99/12 Hawaii Published 2001/11 Done4 1 ISO/IEC 14496-1/Cor.1 N3278 00/03
Noordwijk.Published 2001/11 N/A
4 1 ISO/IEC 14496-1:2001 (MPEG-4 Systems 2nd Ed.) N3850 01/01 Pisa Published 2001/11 N/A4 1 ISO/IEC 14496-1:2001/Amd.1 (Flextime) Published 2002/10 Done4 1 ISO/IEC 14496-1:2001/Cor.1 N4264 01/07 Sydney COR ITTF N/A4 1 ISO/IEC 14496-1:2001/Cor.2 N5275 02/10 Shangai COR ITTF N/A4 1 ISO/IEC 14496-1:2001/Cor.3 N6587 04/07
RedmondCOR ITTF N/A
4 1 ISO/IEC 14496-1:2001/Amd.2 (Textual Format) N4698 02/03 Jeju Island
AMD ITTF N/A
4 1 ISO/IEC 14496-1:2001/Amd.3 (IPMP Extensions) N5282 02/10 Shanghai
Published 2004-05 N/A
4 1 ISO/IEC 14496-1:2001/Amd.4 (SL Extension) N5471 02/12 Awaji Published 2003/12 N/A4 1 ISO/IEC 14496-1:2001/Amd.7 (AVC on 4) N5976 03/10
BrisbannePublished 2004-08 N/A
4 1 ISO/IEC 14496-1:2001/Amd.8 (ObjectType Code Points) N6202 03/12 Hawaii AMD ITTF to be published N/A4 1 ISO/IEC 14496-1:200x/Amd.1 (Text Profile Descriptors) N7229 05/04 Busan PDAM ITTF Final Text
EditingN/A
4 1 ISO/IEC 14496-1:200x/Cor4 (Node Coding Table) N7473 05/07 Poznan PDAM ITTF to be published N/A4 1 ISO/IEC 14496-1 (MPEG-4 Systems 3rd Ed.) N5277 02/10
ShanghaiIS ITTF to be published Proposed
4 1 ISO/IEC 14496-1:200x/Amd.1 (Text Profile Descriptors) N7229 05/04 Busan PDAM ITTF Final Text Editing
N/A
4 1 ISO/IEC 14496-1:200x/Cor.1 (Clarif. On audio codec behavior) N8117 06/04 Montreux
COR ITTF Final Text Editing
N/A
4 1 ISO/IEC 14496-1:200x/Amd.2 (3D Profile Descriptor Extensions) N8372 06/07 Klagenfurt
PDAM ITTF to be published N/A
4 1 ISO/IEC 14496-1:200x/Cor.2 (OD Dependencies) N8646 06/10 Hangzhou
COR ITTF to be published N/A
4 1 ISO/IEC 14496-1:200x/Amd.3 (JPEG 2000 support in Systems) N8860 07/01 Marrakech
PDAM ITTF to be published N/A
81
4 4 ISO/IEC 14496-1:200x/Amd.17 (ATG Conformance) N8861 07/01 Marrakech
PDAM ITTF to be published N/A
4 4 ISO/IEC 14496-1:200x/Amd.22 (AudioBIFS v3 conformance) N9295 07/07 Lausanne
PDAM ITTF to be published N/A
4 4 ISO/IEC 14496-1:200x/Amd.23 (Synthesized Texture conformance) N9369 07/10 Shenzhen
PDAM ITTF to be published N/A
4 4 ISO/IEC 14496-1:200x/Amd.24 (File Format Conformance) N9370 07/10 Shenzhen
PDAM ITTF to be published N/A
4 4 ISO/IEC 14496-1:200x/Amd.25 (LASeR V1 Conformance) N9372 07/10 Shenzhen
PDAM ITTF to be published N/A
4 5 ISO/IEC 14496-1:200x/Amd.12 (File Format) N9020 07/04 San Jose PDAM ITTF to be published N/A4 5 ISO/IEC 14496-1:200x/Amd.16 (SMR Ref. Soft) N9672 08/01 Antalya PDAM ITTF to be published N/A4 5 ISO/IEC 14496-1:200x/Amd.17 (LASeR Ref. Soft) N9674 08/01 Antalya PDAM ITTF to be published N/A4 6 ISO/IEC 14496-6:2000 Published 2000/12 N/A4 8 ISO/IEC 14496-8 (MPEG-4 on IP Framework) N4712 02/03 Jeju Published 2004-05 Proposed4 11 ISO/IEC 14496-11 (MPEG-4 Scene Description 3rd
Edition) N6960 05/01
HongKongFDIS SC29 Final Text
EditingProposed
4 11 ISO/IEC 14496-11/Amd.1 (AFX) N5480 02/12 Awaji FDAM ITTF Integration in 1st
Ed.N/A
4 11 ISO/IEC 14496-11/Amd.2 (Advanced Text and Graphics) N6205 03/12 Hawaii FDAM ITTF Integration in 1st
Ed.N/A
4 11 ISO/IEC 14496-11/Cor.1 N6203 03/12 Hawaii COR SC29 N/A4 11 ISO/IEC 14496-11/Cor.3 Valuator/AFX related correction N6594 04/07
RedmondCOR ITTF Integration in 1st
Ed.N/A
4 11 ISO/IEC 14496-11/Amd.3 Audio BIFS Extensions N6591 04/07 Redmond
FDAM ITTF Integration in 1st
Ed.Proposed
4 11 ISO/IEC 14496-11/Amd.4 XMT and MPEG-J Extensions N6959 05/01 HongKong
FDAM ITTF Integration in 1st
Ed.N/A
4 11 ISO/IEC 14496-11/Cor.3 (Audio BIFS Integrated in 3rd Edition) N7230 05/04 Busan COR ITTF Final Text Editing
N/A
4 11 ISO/IEC 14496-11/Cor.5 (Misc Corrigendum) N8383 06/07 Klagenfurt
COR SC29 N/A
82
4 11 ISO/IEC 14496-11/Amd.5 Symbolic Music Representation
N8657 06/10 Hangzhou
FDAM ITTF TBP
4 11 ISO/IEC 14496-11/Cor.6 (AudioFx Correction) N9021 07/04 San Jose COR SC29 N/A4 12 ISO/IEC 14496-12 (ISO Base Media File Format) N5295 02/10
ShanghaiPublished 2004-02 Proposed
4 12 ISO/IEC 14496-12/Amd.1 ISO FF Extension N6596 04/07 Redmond
FDAM ITTF FDAM 04/11/30 N/A
4 12 ISO/IEC 14496-12/Cor.1 (Correction on File Type Box)
N7232 05/04 Busan COR ITTF Final Text Editing
N/A
4 12 ISO/IEC 14496-12/Cor.2 (Miscellanea) N7901 06/01 Bangkok
COR ITTF Final Text Editing
N/A
4 12 ISO/IEC 14496-12/Amd.1 (Description of timed metadata)
N8659 06/10 Hangzhou
FDAM ITTF N/A
4 12 ISO/IEC 14496-12/Cor.3 (Miscellanea) N9024 07/04 San Jose COR ITTF Final Text Editing
N/A
4 12 ISO/IEC 14496-12/Amd.2 (Flute Hint Track) N9023 07/04 San Jose FDAM ITTF N/A4 13 ISO/IEC 14496-13 (IPMP-X) N5284 02/10
ShanghaiIS ITTF to be published Proposed
4 14 ISO/IEC 14496-14 (MP4 File Format) N5298 02/10 Shanghai
Published 2003-11 Proposed
4 14 ISO/IEC 14496-14/Cor.1 (Audio P&L Indication) N7903 06/01 Bangkok
COR ITTF Final Text Editing
N/A
4 15 ISO/IEC 14496-15 (AVC File Format) N5780 03/07 Trondheim
Published 2004-04 Proposed
4 15 ISO/IEC 14496-15/Amd.1 (Support for FREXT) N7585 05/10 Nice FDAM ITTF Final Text Editing
N/A
4 15 ISO/IEC 14496-15/Cor.1 N7575 05/10 Nice COR ITTF N/A4 15 ISO/IEC 14496-15/Cor.2 (NAL Unit Restriction) N8387 06/07
KlagenfurtCOR ITTF N/A
4 15 ISO/IEC 14496-15/Amd.2 (SVC File Format Extension)
N9682 08/01 Antalya FDAM ITTF N/A
4 17 ISO/IEC 14496-17 (Streaming Text) N7479 05/07 Poznan FDAM ITTF TBP
83
4 18 ISO/IEC 14496-18 (Font Compression and Streaming) N6215 03/12 Hawaii Published 2004-07 Proposed4 18 ISO/IEC 14496-18/Cor.1 (Misc. corrigenda and
clarification)N8664 06/10
HangzhouCOR ITTF N/A
4 19 ISO/IEC 14496-19 (Synthesized Texture Stream) N6217 03/12 Hawaii Published 2004-07 Proposed4 20 ISO/IEC 14496-20 (LASeR) N7588 05/10 Nice FDAM Editor TBP4 20 ISO/IEC 14496-20/Cor.1 (Misc. corrigenda and
clarification)N8666 06/10
HangzhouCOR ITTF N/A
4 20 ISO/IEC 14496-20/Amd.1 (LASeR Extension) N9029 07/04 San Jose FDAM ITTF N/A4 20 ISO/IEC 14496-20/Cor.2 (Profile Removal) N9381 07/10
ShenzhenFDAM ITTF N/A
4 20 ISO/IEC 14496-20/Amd.2 (SVGT1.2 Support) N9384 07/10 Shenzhen
FDAM ITTF N/A
4 22 ISO/IEC 14496-22 (Open Font Format) N8395 06/07 Klagenfurt
FDAM Editor Final Text Editing
TBP
7 1 ISO/IEC 15938-1 (MPEG-7 Systems) N4285 01/07 Sydney Published 2002/07 Done7 1 ISO/IEC 15938-1/Amd.1 (MPEG-7 Systems Extensions) N6326 04/03 Munich FDAM ITTF FDAM 04/11/28 N/A7 1 ISO/IEC 15938-1/Cor.1 (MPEG-7 Systems Corrigendum) N6328 04/03 Munich COR Editor N/A7 1 ISO/IEC 15938-1/Cor.2 (MPEG-7 Systems Corrigendum) N7490 05/07 Poznan COR ITTF N/A7 1 ISO/IEC 15938-1/Amd.2 (BiM extension) N7532 05/10 Nice FDAM ITTF N/A7 2 ISO/IEC 15938-2 (MPEG-7 DDL) N4288 01/07 Sydney Published 2002/02 Done7 7 ISO/IEC 15938-7/Amd.2 (Fast Access Ext. Conformance) N8672 06/10
HangzhouFDAM ITTF N/A
21 9 ISO/IEC 21000-9 (MPEG-21 File Format) N6975 05/01 HongKong
FDIS ITTF FDIS 05/01/21 Done
21 16 ISO/IEC 21000-16 (MPEG-21 Binary Format) N7247 05/04 Busan FDIS ITTF FDIS 05/04/22 TBP21 5 ISO/IEC 21000-5 (Open Release Content Profile) N9687 08/01 Antalya FDAM ITTF TBPA 1 ISO/IEC 23000-4 (Musical Slide Show MAF) N9037 07/04 San Jose FDIS ITTF TBPA 1 ISO/IEC 23000-9 (Digital Multi. Broadcasting MAF) N9397 07/10
ShenzhenFDIS ITTF TBP
A 1 ISO/IEC 23000-7 (Open Access MAF) N9698 08/01 Antalya FDIS ITTF TBPB 1 ISO/IEC 23001-1 (XML Binary Format) N7597 05/10 Nice FDIS ITTF TBP
84
B 1 ISO/IEC 23001-1/Cor.1 (Misc. Editorial and technical clar.)
N8680 06/10 Hangzhou
COR ITTF N/A
B 1 ISO/IEC 23001-1/Cor.2 (Misc. Editorial and technical clar.)
N9049 07/04 San Jose COR ITTF N/A
B 1 ISO/IEC 23001-1/Amd.1 (Reference Soft. & Conf.) N8886 07/01 Marrakech
FDAM ITTF N/A
B 1 ISO/IEC 23001-1/Amd.1 (Exten. On encoding of wild cards)
N9296 07/07 Lausanne
PDAM ITTF to be published N/A
B 2 ISO/IEC 23001-1 (Fragment Request Unit) N9051 07/04 San Jose FDIS ITTF TBPB 3 ISO/IEC 23001-3 (IPMP XML Messages) N9416 07/04 San Jose FDIS ITTF TBPE 1 ISO/IEC 23008-1 Architecture N8892 07/01
MarrakechFDAM ITTF N/A
E 2 ISO/IEC 23008-2 Multimedia API N8893 07/01 Marrakech
FDAM ITTF N/A
E 3 ISO/IEC 23008-3 Component Model N8894 07/01 Marrakech
FDAM ITTF N/A
E 4 ISO/IEC 23008-4 Ressource & Quality Management N8895 07/01 Marrakech
FDAM ITTF N/A
E 5 ISO/IEC 23008-5 Component Download N9053 07/04 San Jose FDAM ITTF N/AE 6 ISO/IEC 23008-6 Fault Management N9054 07/04 San Jose FDAM ITTF N/AE 7 ISO/IEC 23008-7 System Integrity Management N9055 07/04 San Jose FDAM ITTF N/A
29116 1 ISO/IEC 29116 Media Streaming MAF Protocols N9420 07/10 Shenzhen
FDAM ITTF N/A
85
17 Resolutions of Systems
Cf. WG11 resolution.
18 Contributions Reviewed by the Systems Subgroup
N° Title Authorsm15038
Ad Hoc Group on MPEG File Formats David SingerVisharam Mohammed
m15059
Summary of Voting on ISO/IEC 14496-5:2001/FPDAM 16 [SC 29 N 8926]
SC 29 Secretariat
m15060
Summary of Voting on ISO/IEC 14496-5:2001/FPDAM 17 [SC 29 N 8927]
SC 29 Secretariat
m15063
Liaison Statement from SC 29/WG 1 [SC 29 N 8956] SC 29 Secretariat
m15064
Liaison Statement from SC 29/WG 1 [SC 29 N 8957] SC 29 Secretariat
m15066
Summary of Voting on ISO/IEC 14496-15:2004/FPDAM 2 [SC 29 N 8961]
SC 29 Secretariat
m15077
Liaison Statement from JTC 1/SC 34/WG 2 [SC 29 N 9035]
SC 29 Secretariat
m15079
Editor Study on ISO/IEC 14496-5:2001/FPDAM 16 Symbolic Music Representation reference software
Pierfrancesco BelliniPaolo NesiGiorgio ZoiaMaurizio Campanai
m15081
USNB Contribution: Proposed amendment to ISO/IEC 14496-22
Andy Tescher for the USNB
m15082
The proposal for amendment of ISO/IEC 14496-22 (in support of USNB comment m15081)
Simon DanielsMichelle HillVladimir Levantovsky
m15083
Requirements on RoSE Framework Sanghyun JooBumsuk ChoiMunchurl Kim
m15091
Requirements on Framework for RoSE Jean GelissenMark Verberkt
m15093
Proposed re-structured ISO Base Media File Format Per FröjdhDavid Singer
m15093
Proposed re-structured ISO Base Media File Format Per FröjdhDavid Singer
m15105
MPEG-M under the MPEG 21 Reflector Julie LoftonJeff Steele
m15105
MPEG-M under the MPEG 21 Reflector Julie LoftonJeff Steele
m1510 Proposal of Reference Software for MPQF. Validation of Ruben Tous
86
N° Title Authors9 embedded XQuery expressions. Jaime Delgadom15115
Early UKNB comments on the Study of CD for the Video Surveillance Application Format
James Annesley
m15124
Use cases for content protection in Musical slide show Application Format 2nd Edition
Houari SabirinMunchurl Kim
m15126
Proposed Editorial Update for ISO/IEC 23000-6 WD 1.0 HendryMunchurl Kim
m15127
Editor's study text of ISO/IEC 23000-4/PDAM1 Musical slide show application format
Hyouk-Jean ChaTae Hyeon KimJisoo Hong
m15128
Proposal for Pre-Processing Tool Location Reference in Professional Archival Application Format
HendryMunchurl Kim
m15129
Set of MPEG-7 Tools for Professional Archival Applications Format
HendryHouari SabirinMunchurl Kim
m15133
Proposed corrections to ALC/FLUTE server file format Jani PeltotaloMiska M. Hannuksela
m15134
Proposed additions to ALC/FLUTE server file format Jani PeltotaloMiska M. Hannuksela
m15135
MPEG-21 schema assets update Christian Timmerer
m15136
Study Text of ISO/IEC FCD 23000-7 Open access application format
Florian Schreiner
m15138
Multiple MPEG-21 DIA AdaptationQoS Descriptions within a Digital Item
Ingo KoflerChristian TimmererHermann Hellwagner
m15141
Editor's study text of ISO/IEC 23000-8/FCD Portable video application format
Hyouk-Jean ChaTae Hyeon KimHerbert Thoma
m15142
Considerations on Integrating LASeR and DID Technologies for WIM TV
Jihun ChaInjae LeeYoung-Kwon LimKyungAe MoonJinwoo Hong
m15146
MPEG2-TS and RTP reception hint tracks Stefan DöhlaMiska M. Hannuksela
m15147
Extended sample grouping mechanism for the ISO Base Media File Format
Stefan DöhlaMiska M. Hannuksela
m15148
Proposed conformance files for ALC/FLUTE server file format
Jani PeltotaloMiska M. Hannuksela
m15152
Study Text of ISO/IEC 23000-10/CD Video Surveillance Application Format
Gero Bäse
m15157
Video Surveillance Application Format: Reference Software
James Annesley
m15168
Open Access Application Format: Reference Software Florian Schreiner
87
N° Title Authorsm15171
GENB comments on the Study of the MPEG-21 REL Open Access Profile FPDAM
Florian Schreiner
m15173
Editors' Input to ISO/IEC 14496-15/FPDAM 2 (SVC File Format)
Dave SingerYe-Kui WangThomas Rathgen
m15176
Paging function in MPEG Query Format Masanori SanoHideki SumiyoshiNobuyuki Yagi
m15177
Interpretation Consistency for SpatialQuery and TemporalQuery
Masanori SanoHideki SumiyoshiNobuyuki Yagi
m15178
Codec-independent color information in part 12 files David Singer
m15179
Backwards-compatibility for alternate groups DW Singer
m15182
Updated requirements on Professional Archival Application Format
Noboru HaradaTakehiro MoriyaYutaka Kamamoto
m15184
Proposed workplan for Portable video application format conformance
Hyouk-Jean Cha
m15187
Proposed text of ISO/IEC 23000-9/PDAM1 DMB AF: Conformance and Reference software
Hui Yong KimHouari SabirinMunchurl Kim
m15188
Proposed text of ISO/IEC 23000-9/DCOR1 DMB AF: timescale of TS
Hui Yong KimMyungSeok KiGun Bang
m15189
Proposed WD on 14496-12 ISO-FF Amendment: MPEG-2 TS storage
Hui Yong KimGun BangMyungSeok KiHan-Kyu LeeYong Han Kim
m15195
Transport of GB 20090.2 video data over ITU-T Rec. H.222.0 | ISO/IEC 13818-1
Xiaozhong XuXilin ChenTiejun Huang
m15203
Updated WD 23000-11 for Stereoscopic Video Application Format
Kyuheon Kim
m15203
Updated WD 23000-11 for Stereoscopic Video Application Format
Kyuheon Kim
m15205
Proposed Working Draft of ISO/IEC 23000-5 2nd Edition
Filippo Chiariglione
m15206
WIM TV Trial at Beijing Olympics L. ChiariglionePhilip MerrillLuntian MouOlivier AvaroXin Wang
m15207
Requirements for Digital Item Presentation L. ChiariglioneOlivier Avaro
88
N° Title Authorsm15208
Requirements for MPEG eXtensible Middleware (MXM) L. Chiariglione
m15210
China NB Comments on Transport of GB 20090.2 video data over ITU-T Rec. H.222.0 | ISO/IEC 13818-1
China National Body (CNNB)
m15211
Liaison Statement from DVB [SC 29 N 9045] DVB via SC 29 Secretariat
m15212
KNB Comments on ISO/IEC 23000-4 2nd Edition FCD KNB
m15216
Working draft for proposed MPEG-M Production Deliverables standard
Julie LoftonJeff Steele
m15216
Working draft for proposed MPEG-M Production Deliverables standard
Julie LoftonJeff Steele
m15220
Late GNB on ISO/IEC 13818-1:2007/FDAM3 Thomas Shierl
89
Annex G – Video report
Source: Jens Ohm and Gary Sullivan, Chairs
1 MPEG-2 Support for 1080/50p/60p
More industry support was brought regarding a new level for MPEG-2 video, which would enable compatibility of decoders for 1080p 50 and 60 fps formats. It was decided to place this new level on top of the previous existing levels in Main profile, but prohibiting the use of interlace-oriented tools in bitstreams of the new level (although decoders remain required to be capable of decoding bitstreams of lower levels, which may use these tools).
PDAM texts related to the video standard and to the conformance standard were produced.Work related to support for larger formats in the 4:2:2 profile is reported to be under further study. This will not affect the timeline of the ongoing amendment work, but if necessary be handled in a later work item.
Documents reviewed:m15100 Proposal of new level to support
1080@50p/60p for MPEG-2 videoTeruhiko Suzuki, Ajay Luthra, Yi-Jen Chiu
Documents approved:No. Title TBP Available9563 Request for 13818-2:2000/Amd.3 N 08/01/189564 Text of ISO/IEC 13818-2:2000/PDAM 3 Level for 1080@50/60p N 08/01/289583 Request for 13818-4:2004/Amd.3 N 08/01/189618 Text of ISO/IEC 13818-4:2004/PDAM 3 Level for 1080@50/60p
Conformance TestingN 08/01/28
2 MPEG-4 Simple Studio Profile Levels 5 & 6
The amendment work to support larger formats (beyond 1920x1080) with MPEG-4 simple studio profile was started by the 82nd meeting. Some necessary modifications to the PDAM text were reported in M15097. Furthermore, work on conformance (both for the newly proposed levels, and for improvements for already existing levels) has progressed (M15099). It was decided to issue Study texts for the PDAMs based on these contributions.
Documents reviewed:m15061 Summary of Voting on ISO/IEC
14496-5:2001/Amd.1:2002/DCOR 1 [SC 29 N 8938]
SC 29 Secretariat
m15097 Proposal for MPEG-4 visual studio profile level 5 and 6
Teruhiko Suzuki, Nick Saunders, John Stone, Paul GardinerFound some problems in the extension after the last meeting. First issue is macroblock number and total number of macroblocks in slice header. Another issue is wrong syntax for intra_DC in case of RGB. Put first issue in Study of PDAM, and second issue in Study of DCOR.
m15099 Proposal for MPEG-4 visual studio profile conformance testing
Teruhiko SuzukiAdd new conformance streams for levels 2-4 into Amd.35 (Study of PDAM). Replace the entire table for StuP conformance streams, including marks that the basic functional testing streams also apply to simple StuP.
90
Documents approved:No. Title TBP Available9565 Study Text of ISO/IEC 14496-2:2004/PDAM5 Simple Studio
Profile Levels 5 and 6N 08/01/18
9566 Study Text of ISO/IEC 14496-2:2004/DCOR3 N 08/01/189567 Study Text of ISO/IEC 14496-4:2004/PDAM35 Simple Studio
Profile Levels 5 and 6 Conformance TestingN 08/01/18
9570 Disposition of Comments on ISO/IEC 14496-5:2001/Amd.1:2002/DCOR 1
N 08/01/18
9571 Text of ISO/IEC 14496-5:2001/Amd.1:2002/COR 1 N 08/02/01
3 Development of AVC
The video subgroup jointly approved the ISO standard related output documents that were produced during the 26th JVT meeting which was held in parallel. Important work items in this context were as follows
– SVC verification tests– Approval of MVC PDAM– Preparation of software and conformance FPDAM for Scalable Video Coding
The report of the SVC verification tests was finalized, using conditions suitable for a range of possible application scenarios for progressive video, including
Video-conferencing with quality scalability for the Common Intermediate Format (CIF, 352x288 pixels) at 30 frames per second (fps) video, and spatial scalability for 640x352 pixels at 60 fps video with an enhancement substream for 1280x704 pixels at 60 fps;
Mobile TV with quality scalability for the Quarter Video Graphics Array (QVGA, 320x240 pixels) format at 25 fps video, and spatial scalability for QVGA at 12.5 fps with an enhancement substream for VGA (640x480 pixels) at 25 fps enhancement;
HD TV with spatial scalability for 720p (1280x720 pixels) at 50 fps with 1080p (1280x1080 pixels) at 50 fps enhancement; and
Movie production with spatial scalability for 1080p at 25 fps being the highest resolution, with two lower resolutions provided for scalability.
For the performance evaluations, SVC was compared against AVC single layer coding by means of subjective testing. Subjective tests were performed following relevant international recommendations using a controlled environment and a high number of test subjects.
The results of these tests indicate that these various types of scalability for these applications can be achieved with a bit rate overhead typically equal to or less than 10% when compared to AVC single layer coding using only the highest resolution in the test case. In the HDTV and movie cases, comparable quality was achieved with no apparent need to increase bit rate at all. The bit rate savings obtained by SVC compared to AVC simulcast transmission depend on the particular test case, and were found to be between 17% and 40% of the simulcast bit rate. These bit rate savings relative to simulcast are particularly important for applications in which video must be provided with different spatial resolutions, for which simulcast would previously have been the only available AVC-based standardized solution.
All results and more detailed description of the test setup are included in the public test report (N9577).
91
MVC has reached the level of FPDAM as amendment 1 of the new edition of AVC. The specification does not include any new coding tools at the macroblock level or below (see JVT report).
A first contribution related to MVC profiling was discussed jointly with JVT and MPEG Requirements SG. The current idea is defining only on “multiview high profile”, no interlaced coding, constraint set flag could be used to perform switching between main and high. Level definitions are preliminary and will need more careful investigation about buffer sizes, restrictions in inter-view prediction etc. In general, it should be avoided to define an entirely new set of levels beyond the existing ones. One solution could be made e.g. by starting from maximum number of macroblocks per second, and derive therefrom useful values e.g. for maximum number of views etc. within a given level.
Further study will be necessary taking into account requirements of certain applications (such as stereo, n-view), and investigate for specific levels, whether e.g. the numbers of possible reference pictures are sufficient for the multiview application. Ways to enable parallel processing should also be considered.
Documents reviewed:m15193 Summary of Voting on ISO/IEC
14496-4:2004/PDAM 31SC 29 Secretariat
m15194 Summary of Voting on ISO/IEC 14496-5:2001/PDAM 19
SC 29 Secretariat
m15108 Subjective results for the SVC Verification Test
Tobias Oelbaum ([email protected])
m15132 Verification of new SVC Verification Test Streams
Mathias Wien
m15196 Proposal on Profiles for MVC (Multi-view Video Coding)
Hideaki Kimata, Hiroya Nakamura, Takashi Itoh
Documents approved:No. Title TBP Available9568 Disposition of Comments on ISO/IEC 14496-4:2004/PDAM 31 N 08/01/189569 Text of ISO/IEC 14496-4:2004/FPDAM 31 Conformance Testing
for Scalable Video CodingN 08/02/29
9572 Disposition of Comments on ISO/IEC 14496-5:2001/PDAM 19 N 08/01/189573 Text of ISO/IEC 14496-5:2001/FPDAM 19 Reference Software for
Scalable Video CodingN 08/03/20
9574 Text of ISO/IEC 14496-10:200X/DCOR 1 N 08/04/119575 Disposition of Comments on ISO/IEC 14496-10:200X/PDAM 1 N 08/01/189576 Text of ISO/IEC 14496-10:200X/FPDAM 1 Multiview Video
CodingN 08/02/15
9577 Report on SVC Verification Tests Y 08/01/189578 Joint Multiview Video Model (JMVM) 7 N 08/02/159579 JMVM 7 Software N 08/02/229580 Overview of Multiview Video Coding (MVC) Y 08/01/18
4 MPEG-7 Visual
4.1 MPEG-7 Visual related work in AntalyaThe MPEG-7 breakout group was active during the whole week. Input documents related to the Visual part in 15938-3 are listed in the table below. All these documents were reviewed and discussed.
92
m15103 CE Report of VCE-5 Sangyoun Leem15104 Text/Logo Mask Image Generation
Software for VCE-7Kota Iwamoto, Ryoma Oami
m15106 Contribution of video test material for MPEG-7 video signature CE
Weon Genu Oh, Daeil Yoon, Jie Jia, Hae Kwang Kim
m15114 Proposal on Frame-Reduction video clip format
Ju-Kyong Jin, Weon-Geun Oh, Dong-Jin Seo, Sang-il Na, Jae-Hyun Huh, Dong-Seok Jeong
m15122 Errors in MPEG-7 reference software
James Annesley
m15130 Experiment Results of Image Signature for Complex Conditions
Weon-Geun Oh, Ayoung Cho, Won-Keun Yang, Ik-Hwan Cho, Ju-Kyong Jin, Jun-Woo Lee, Dong-Seok Jeong
m15131 The Extra Experiment Result to Verify Performance Measure Method of MPEG-7 VCE-6
Weon-Geun Oh, Won-Keun Yang, Ayoung Cho, Dong-Seok Jeong
m15137 Cross verification result for ETRI VCE-6 proposal
Min-Jeong Lee, Heung-Kyu Lee
m15139 Video Signature based on Inter-frame Correlation Coefficients
[email protected] Zheng Huang, [email protected] Tiejun Huang, yhtian@ @pku.edu.cn Yonghong Tian
m15140 Visual Signature based on Waston Perceptual Model
[email protected], [email protected], [email protected]
m15169 Correction to Image Signature XM Software
Paul Brasnett, Miroslaw Bober
m15170 Performance Evaluation of Image Signature on Extended Database
Paul Brasnett, Miroslaw Bober
m15172 Extending the Trace Transform Image Signature to Complex Conditions
Paul Brasnett, Miroslaw Bober
m15181 Cross verification result of Image Signature (VCE-6)
Karol Wnukowicz
m15217 Updated Results on Extended Trace Transform Image Signature
Paul Brasnett, Miroslaw Bober
On major work item has been the further review of image signature descriptors as investigated in VCE-6. A Dataset of approximately 130.000 images was used. Independence was tested on 8.45 billion image pairs. Robustness was tested on 250.000 images (24+1 different modifications). The following findings were made, comparing 15130 vs. 15217:– Average performance above 90% at 10ppm for both methods– 15130 (“concentric circles” method) performs significantly better for cropping– 15217 (extension of method in XM/WD, “trace transform” method) performs
slightly/sometimes significantly better for most other cases– Cropping result is very specific, because 15130 method would fail in case of non-center
crops, but 15217 has also very poor result for case of heavy croppingIn general, it can be concluded that both methods require more development for the cases of the more difficult modifications. In particular, further investigations appear necessary for cases of more localized signatures (which were extracted similarly in both methods, using areas around feature points). The global descriptors appear useful only for simple conditions, but could be also used as first step in quick database search, sorting out the clearly dissimilar images.
The following decision was made:- The technology of current WD (global signature) is promoted for PDAM- For complex operation, most probably good localized descriptors would be required, which
must not necessarily be derivates from the current global descriptor. Further investigations will be made in the upcoming CE, also taking into account combinations of complex conditions, such as combination of translation and cropping (no-center case), and also with scaling. necessary
- The method of 15217 (global WD/XM descriptor with some more localized feature extraction) will not be put into a new version of the XM, because it may turn out in CE that other localized descriptors perform better.
For Video Signatures, it is estimated that 208 hours of video content are required, of which ~100 hours have already been collected or committed to be submitted in near future. Varied content is
93
required such as: sports, news, film, soap, variety of others. The set should consist of approx. 4.000 longer clips and 24.000 shorter clips; 50 million comparisons must be performed for the envisaged range of quality (successful hits vs. false alarm rate).
Furthermore, work on software to create video modifications automatically has been done in VCE-7. Commitment for providing the remaining data material is expected for the time between Antalya and Archamps meetings. It is therefore decided to delay the CfP by one meeting to finalize the testing database work.
Technically, two new proposals (ideas) were received and reviewed, related to video signatures. Their testing awaits completion of the full database.
The following timeline is planned for the ongoing work on video signatures: Final CfP: 2008/04 with responses 2008/07 PDAM: 2008/07 FPDAM: 2009/01 FDAM: 2009/07
Three Core Experiments will continue:– Face Recognition in IR images– Image Signature for Complex conditions– Video Signature (collection of test material and software tools for CfP preparation)
A bug report concerning visual XM was reviewed. The bug reported and solutions proposed will be verified by the XM maintenance team at Warsaw University.
The development of the reference software for ISO/IEC 23000-3 was continued. The BIM incompatibility issue was solved, new SDL’s were provided to generate bitstreams. The Software is being rewritten to reflect new SDL’s, expected completion is within 3 weeks after the Antalya meeting.
4.2 Output documents related to MPEG-7 Visual
No. Title TBP Available15938-3 Visual
9581 Text of ISO/IEC 15938-3:2001/PDAM 3 Image Signature Tools N 08/01/289582 Description of Core Experiments for MPEG-7 New Visual
ExtensionsN 08/01/18
5 23002 MPEG-C Video Technologies
5.1 23001-4 and 23002-4 Reconfigurable Video Coding (RVC)
5.1.1 Allocation of input contributions
MPEG-B related CE (Monday Afternoon January 14, 2008 2:00PM or Tuesday Morning)Doc. No. Authors Title
m15113 Sinwook LeeJaebum JunByeongjun KimChungku YieEuee S. Jang
The results of RVC CE 1.2Review comments: compression results of CDDL are provided but there is still no comparison with BSDL description compressed with BiM. There are hints that suggests that a CDDL representation can be converted back to
94
BSDL schema.
Recommendations: continue the investigation for comparing BiM compression and to prove that conversion to BSDL schema is possible.
m15125 Hyungyu KimSikyung KimMyungjoong LeeChungku YieEuee S. Jang
The results of RVC CE 1.1Review comments: the contribution presents results of compressing XML-based RVC DDL. The compression performances are higher for CDDL, however there is no proof that applying a XML schema to BiM would not achieve better results.
Recommendations: to include into the Study document and solicit contributions showing that CDDL can perform better than BiM when using a DDL schema.
m15117 Byeongjun KimJaebum JunHyungyu KimChungku YieEuee S. Jang
Study of Application Requirements Related to RVCReview comments: doubts on addressing such requirements at RVC level while they should be addressed at system level that support RVC. The group acknowledges that such requirements should be taken into account at both RVC and System level.
Recommendations: start some studies for RVC on IP and on MPEG-2 TS, including application scenarios.
m15159 Christophe LucarzDandan DingJianjun LiMarco Mattavelli
BSDL Description of MPEG-4 SP and AVC BP Bitstream Syntax for RVC FrameworkReview comments: the contribution describes how BSDL can be used to describe a low-level bitstream.
Recommendations: add the BSDL description to the “study of CD” and proceed with validation of the schema. Include extensions needed to describe the low-level segments of the bitstreams (VLD) into the study document.
m15163 Christophe LucarzJianjun LiMarco MattavelliDandan Ding
Auto-generation of RVC Parser from BSDL Syntax Description: Variable Length DecodingReview comments: the contribution proposes a systematic procedure for generating VLD decoding FUs from VLD tables
Recommendations: to include the technology as informative procedure for the generation of bitstream parsers from BSDL descriptions.
m15166 Dandan DingChristophe LucarzMarco MattavelliLu Yu
Function Units for Conversion from Syntax to Sequence of Tokens: BTYPEReview comments: the contribution presents FUs for Btype generation and Motion Vectors generation for the instantiation of parser from a BSDL description.
Recommendations: to complete the implementation work for MPEG-4 SP, AVC and MPEG-2 for possible inclusion in the MPEG toolbox.
m15199 Hyungyu KimSinwook LeeByeongjun KimChungky YieEuee S. Jang
Proposed text of CCR CD: A chapter for DD transmissionReview comments: the contribution presents the text for the CD for inclusion of CDDL in the CD.
Recommendations: keep this co ntribution for possible inclusion of the CDDL technology in the FCD.
m15200 Dandan DingLu YuHonggang QiTiejun HuangWen Gao
BSDL Description of AVS Bitstream Syntax for RVC FrameworkReview comments: the contribution presented the extensions needed to
95
fully describe AVS bitstream syntax using BSDL.
Recommendations: the group recommends to check and define necessary extensions and compare them to the one of AVC for next meeting.
MPEG-C related CE (Tuesday Morning January 15, 2008 9:00AM)Doc. No. Authors Title
m15080 Gwo Giun LeeHe-Yuan LinMing-Jiun Wang
Functional units of AVC inter-prediction for adaptive interlace codingReview comments: the contribution identifies new FUs necessary to implement interlaced adaptive coding for AVC.
Recommendations: continue the work, develop textual description and CAL SW and respect the naming convention for inclusion in FCD at next meeting.
m15107 Kenji OtoiYoshihisa YamadaKohtaro Asai
Proposed text of the RVC FUs for MPEG-2Review comments: the contribution updates the textual description of FUs for MPEG-2
Recommendations: include update in the MPEG-C CD
m15156 Dandan DingMarco MattavelliChristophe LucarzLu Yu
Update of Classification of Tokens for FUs of MPEG-4 SP and MPEG-4/AVC in RVC FrameworkReview comments: the contributions updates token classification of FUs included in the RSM.
Recommendations: update classification for newly submitted FU (MPEG-2) and interlaced AVC coding modes to include them in the FCD.
m15164 Christophe LucarzJianjun LiMarco MattavelliDandan Ding
Functional Units for RVC Toolbox: Variable Length DecodingReview comments: the contribution presents new FUs for VLD decoding to be included in the RVC toolbox
Recommendations: to include the new FUs in the toolbox and provide the textual descriptions for the MPEG-C study of CD.
EE related (to be discussed on)Doc. No. Authors Title
m15202 Honggang QiTiejun HuangWen GaoDandan DingLu Yu
Text Description for Bitstream Parser FU of AVSReview comments:
Recommendations:
General (Tuesday Afternoon January 15, 2008 at 2:00PM)Doc. No. Authors Title
m15167 M. RauletG. RoquierM. WipliezJF. NezanO. Deforges
Update of CAL2C code generationReview comments:
Recommendations:
96
5.1.2 Action points after contribution reviewDecide naming convention for VLD decoding FUs (All). Done, include definition in the Study document.
Upload BSDL schemas and corresponding bitstreams on the CVS:o MPEG-4 SP (Dandan, Christophe) 05-02-08o AVC (Mickael) 12-02-08o AVS (Dandan) 12-02-08
Generate MPEG-2 BSDL description (Dandan) 28-02-08Upload on CVS BSDL and bitstreams for MPEG-2 28-02-08
SW for generation of VLD FUs o Upload the SW for generation of CAL FUs on the CVS (Dandan, Christophe) 25-01-2008
New VLD decoding FU in the toolbox:o Upload new VLD decoding FUs on the CVS with correct naming convention (Dandan,
Christophe) for MPEG-4 SP 25-01-2008o and provide textual description for study of CD part C 25-01-2008o Generate VLD decoding FUs for MPEG-2 (Yamada, Dandan) 25-01-2008o and provide textual descriptions for the study document of part C 02-02-08o Generate VLD decoding FUs for AVC (Dandan, He-Yuan/Chris) 02-03-08o and provide textual descriptions for the study document of part C 02-02-08
Non normative FUs:o Upload AVS parser on the CVS (Dandan) 20-01-08o and provide textual description.o Generate AVS VLD FUs and upload on the CVS (Dandan) 02-02-08o Provide textual description 02-02-08
Update token classification with all new uploaded FUs (VLD FUs, AVS parser, MPEG-2, AVC interlaced) for the study document of CD part C 15-03-8
Provide text explaining the procedure for BSDL schema validation to be included in the study of CD for part B as informative annex. (Mickael) 07-02-08
Include Copyright disclaimer to all CAL FUs in the CVS (Christophe) 15-02-08
5.1.3 RVC - Systems Joint meeting on Systems RVC support
After revision of the requirements for RVC systems support it was concluded that:1. Most of the mechanisms needed by RVC are available in Systems technology (MPEG-2
TS, MP4 file format), but not in all transport formats (i.e. switch of a configuration at a given time in a stream is available only in MPEG-4 transport format)
2. Some other mechanisms are not directly available (i.e. change of systems parameters such as buffer size, bitrate, etc etc, …)
3. Activity on systems level support for RVC can start from next meeting with input contributions that address one or more of the Systems requirements approved at this meeting.
97
4. Commitments for input contributions to the Systems Group for next April meeting have been agreed and are reported in the workplan document.
5.1.4 Output document processing
CD Part B (Editor: Gwo Giun) Revision completed in Wednesday session
Study of CD Part B
Section 5.2 BSDL extensions for RVC + examples of BSDL schema for MPEG-4 SP, AVC, MPEG-2 (Dandan, Christophe, Mickael, Marco) 28-02-08
Annex D Non normative procedure for:– BSDL schema validation (Mickael)– the instantiation of parsers fro the ADM (Dandan, Christophe, Mickael, Marco)
New section for compressed decoder description (Euee, Hyungyu) Annex D non normative procedure for instantiation of a ADM from a compressed
description.
CD part C (Editor: (Gwo Giun) Yishin) – finalize
Study of CD part C (Editor: (Gwo Giun) Yishin) Include all new FUs Update Token classification
Conformance WD 4.0 (Editor: Gwo Giun) New version approved at this meeting (two weeks editing period)
Output Documents:
No. Title TBP Available23001-4 Codec Configuration Representation
9584 Study Text of ISO/IEC CD 23001-4 Codec Configuration Representation
N 08/03/17
23002-4 Video Tool Library9585 Reconfigurable Video Coding Requirements V 4.0 N 08/01/189586 Overview of Reconfigurable Video Coding (RVC) Y 08/02/029587 Study Text of CD ISO/IEC 23002-4 Video Tool Library N 08/03/179588 Extensions of Video Tool Library under consideration N 08/02/049589 Description of Core Experiments in RVC N 08/02/049590 RVC Simulation Model (RSM) V7.0 N 08/02/049591 RVC Work Plan and FU Development Status N 08/01/189592 RVC Conformance Testing Working Draft V4.0 N 08/02/049593 Description of Exploration Experiments in RVC N 08/01/189594 Methodologies for Video Toolbox Extension V2.0 N 08/03/24
6 Explorations – Free Viewpoint Video/Television
98
The exploratory work on free-viewpoint video has its roots in the “3DAV” exploration, which was originally started in December 2001, and later led in a first CfP on multiview video compression technology (current MVC development in JVT). As discussed in the previous meeting, FTV can be defined as a compressed representation and associated technologies which enable generating a large number of different views from a sparse view set. This most probably (from technologies currently known) requires implementation of depth/disparity map estimation (non-normative), definition of depth/disparity map representation/compression and interpolation/rendering method (not clear yet whether the latter should be non-normative or normative). All of these elements rely on each other, such that proper technology selection will most probably not be simple. Furthermore, higher distortion may be expected than for MVC (or at least quality may not be measurable in terms of pixel fidelity, geometric distortions may appear that might only be noticeable under certain observation conditions). The amount of distortion most probably would also depend on compactness (density of views) and complexity of the methods. Depending on concrete application, the view number to be generated may range from two for simple stereoscopic up to "many" for almost-free walk-through a scene.
In Antalya, more clarification was achieved about the focus of the next phase of the FTV work. Realistic market needs appear to be existing in supporting standardized formats for upcoming 3D (M-view) displays where the number of views M as locally generated influences the quality of visual perception. While currently numbers of approximately M=9 are used in prototypes, while for the future M of up to 40-50 could be expected. Even then, the view angle support will be relatively narrow (max. 20 degrees left-to-right), which is a clear (and implementation-wise realistic) limitation as compared to the “really free” FTV scenario. One additional advantage could be that with such narrow view alteration ranges, co-planar camera setups could still be useful. After extensive discussion, the group agreed that the name of "3D Video" is very well reflecting this subset scenario of FTV (namely, enabling technology for 3D video displays).
Related to the last meeting’s call for test sequences and depth/disparity estimation algorithms the following input contributions were brought:- 2 contributions announcing generation of new test materials (15089, 15102), both will use
dense camera arrays. First proposal for 80 cameras with 5 cm baseline (convergent); the other 15 cameras with 7 cm baseline (linear/co-planar).
- 4 contributions on depth estimation (15090, 15119, 15175, 15191)- 2 contributions on view generation (15090, 15120)In addition, M15101 reports corrected camera parameters for an existing sequence, M15047 and M15088 relate to more generic applications and requirements of depth map technology and FTV (no need detected to revise the apps & reqs document based on these contributions). M15088 indicated possibilities to perform skipped view encoding when higher-quality depth information is available.
Following the focus as described above, it was concluded that co-planar camera setup would be the optimum case for this kind of 3D Video applications, and test sequences should be captured according to this. Nevertheless, slight rectification would most likely be necessary even in the parallel setup, due to slight variation in camera properties and the impossibility for perfect mechanical adjustment. Nevertheless, the original shots should be as close to co-planar as possible to keep rectification artifacts to a minimum. Sequences should be provided in rectified, illumination- and color-compensated version.Following this, a new call for test sequences was produced, which also includes a high-level description of 3DTV and FTV to make the purposes clear for which the new materials should be useful. Again, it is called to provide depth maps, depth estimation and interpolation software packages. To get more evidence about the possible elements of the 3D video chain, an Exploration Experiment was started to find out how depth estimation and interpolation inter-relate, based on the proposals that were brought (and for which software must be made available
99
in this context). The results of this should bring evidence by the next meeting about how we can find out about
– whether sufficiently good depth estimation algorithms are available– which level of quality can be achieved in view synthesis, and whether e.g. PSNR
comparison against original views is useful– suitability of test sequences we have (and will have after next meeting) for purposes of
upcoming CfP
Documents reviewed m15047 Consideration of Depth Format Taka Senoh, [email protected], [email protected],
[email protected], [email protected] Reports about various versions of depth, e.g. absolute z-depth, disparity. Various versions of normalization of depth and defining depth ranges. Proponents should check relationship with definitions in 23002-3.
m15088 Available Technologies for FTV Masayuki Tanimoto, Toshiaki Fujii, Kazuyoshi SuzukiPresented.
m15089 Contribution of Nagoya University on FTV Test Material
Masayuki Tanimoto, Toshiaki Fujii, Kazuyoshi Suzuki, Norishige Fukushima
m15090 Improvement of Depth Map Estimation and View Synthesis
Masayuki Tanimoto, Toshiaki Fujii, Kazuyoshi Suzuki
m15098 Inter-View Skip Mode for FTV using Depth Information
Gang Zhu, Xiaozhong Xu, Ping Yang, Yun He
m15101 Corrected Camera Parameters for N9468; Call for Contributions on FTV Test Material?
Aljoscha Smolic, Heribert Brust, Karsten Mueller, Marcus Mueller, Thomas Wiegand
m15102 Progress Report on 3DTV Video Acquisition
Ingo Feldmann, Marcus Mueller, Frederik Zilly, Ralf Tanger, Karsten Mueller, Aljoscha Smolic, Peter Kauff, Thomas Wiegand
m15119 Segment-based Multi-view Depth Map Estimation for FTV
Sang-Beom Lee, Kwan-Jung Oh
m15120 Virtual View Synthesis for FTV Sang-Tae Na, Kwan-Jung Ohm15175 Depth Map Estimation Software Olgierd Stankiewicz, Krzysztof Wegner.m15191 Segment-based Disparity Estimation
using Foreground SeprationGi-Miun Um, Taeone Kim, Namho Hur, Jinwoong Kim
Output documents:No. Title TBP Available
Exploration – Free Viewpoint TV Coding9595 Call for Contributions on 3D Video Test Material (Update) Y 08/01/189596 Description of Exploration Experiments in 3D Video N 08/01/18
100
Annex H – JVT report
Source JVT Management Team (Gary J. Sullivan, Jens-Rainer Ohm, Thomas Wiegand, and Ajay Luthra)
AbstractThe Joint Video Team (JVT) of ITU-T Q.6/16 and ISO/IEC JTC 1/SC 29/WG 11 held its 26th meeting during 13-18 January, 2008 at the Divan Hotel in Antalya, Turkey. The JVT meeting was held under the chairmanship of Dr. Gary Sullivan (Microsoft/USA) and Dr. Jens-Rainer Ohm (RWTH Aachen/Germany), and under the associate chairmanship of Dr. Thomas Wiegand (Fraunhofer HHI/Germany) and Dr. Ajay Luthra (Motorola/USA). The JVT meetings opened at approximately 2:30 p.m. on Sunday 13 January 2008 and closed at approximately 11:45 a.m. on Friday 18 January 2008. Approximately 124 people attended the JVT meetings and approximately 40 input documents were discussed. The meetings took place in a co-located fashion with a meeting of ISO/IEC JTC 1/SC 29/WG 11 (MPEG) – one of the two parent bodies of the JVT. The subject matter of the JVT meeting activities consisted of work on video coding.
1 Documents of the JVT meeting
1.1 Input documents
1.1.1 Administrative input contributionsJVT-Z000 (Admin) List of documents of Antalya meetingJVT-Z001-M (Admin) [G. J. Sullivan, J.-R. Ohm, A. Luthra, T. Wiegand] AHG Report: Proj
mgmt and errataJVT-Z002 (Admin) [T. Wiegand, K. Suehring, A. Tourapis, T. Suzuki, G. J. Sullivan] AHG
Report: JM text, ref soft, bitstream, confJVT-Z003 (Admin) [H. Schwarz, J. Vieron, T. Wiegand, M. Wien, A. Eleftheriadis, V. Bottreau]
AHG Report: JSVM text, S/W, confJVT-Z004 (Admin) [A. Segall, T. Wiegand] AHG Report: SVC bit depth and chroma formatJVT-Z005 (Admin) [J. Ridge, M. Karczewicz] AHG Report: FGS applications and design
simplificationJVT-Z006 (Admin) [A. Vetro, P. Pandit] AHG Report: MVC high-level syntax & buffer
managementJVT-Z007 (Admin) [H. Kimata, A. Smolic, P. Pandit, A. Vetro, Y. Chen] AHG Report: MVC JD
& JMVM text & softwareJVT-Z008 (Admin) [P. Pandit, H. Kimata, S. Cho, K. Muller] AHG Report: MVC RRU and
mixed-resolution view codingJVT-Z009 (Admin) [P. Pandit, H. S. Koo] AHG Report: MVC JMVM coding tools
1.1.2 Input liaison statements and parent-body inputsThe following WG 11 parent-body input contributions were noted:
M14863 JNB comment on 1080p50/60 MPEG-2/H.262M14869 Technical proposal on 1080p50/60 MPEG-2/H.262M15108: Subjective results for the SVC verification testM15132: Verification of new SVC verification test streams
101
M15209: Liaison response from SMPTE to sc29n8883 Liaison from JVT on potential extension of SVCM15215: Liaison response from DVD Forum regarding progress of video coding work
1.1.3 Non-administrative input contributionsJVT-Z020 ( Prop 2.2/3.1) [P. L. Lai (USC), P. Pandit, P. Yin, C. Gomila (Thomson)] CE2:
Adaptive reference filtering for MVCJVT-Z021 ( Prop 2.2) [H. Yang, Y. Chang, J. Huo (Xidian Univ.), S. Lin, S. Gao, L. Xiong
(Huawei)] CE1: Fine motion matching for motion skip mode in MVCJVT-Z022 / M15185 ( Prop 2.2/3.1) [S. Sekiguchi, K.Otoi, Y. Yamada, K. Asai, T. Murakami
(MEI)] 4:4:4 video coding perf with adaptive MV codingJVT-Z023 ( Prop 2.2) [S. Cho, N. Hur, J. Kim, S.-I. Lee (ETRI)] Coding eff of stereoscopic
video coding using residual downsamplingJVT-Z024 ( Info) [A. Vetro (MERL), P. Pandit (Thomson), H. Kimata (NTT), A. Smolic
(HHI), Y.-K. Wang (Nokia), C. Ying (Tech. U. Tampere)] MVC decoding process and HRD design
JVT-Z025 ( Errata 2.0/3.1) [Y.-K. Wang, M. M. Hannuksela (Nokia)] SVC corrigendum itemsJVT-Z026 ( Prop 2.2.1/3.1) [Y. Chen (TUT), Y.-K. Wang (USTC), S. Liu, M. M. Hannuksela
(Nokia), H. Li (Nokia)] On asymmetric MVCJVT-Z027 ( Prop 2.2/3.1) [H. Nakamura, M. Ueda (JVC)] Comments on SPS MVC extensionJVT-Z028 ( Prop Profiles) [B.-M. Jeon (LG), W. S. Shim (Samsung), S. Cho (ETRI), G. H.
Park (Kyung Hee U.), P. Pandit (Thomson), Y.-L. Lee (Sejong U.)] About MVC coding tools
JVT-Z029 ( Prop 2.2/3.1) [G. Zhu, X. Xu, P. Yang, Y. He (Tsinghua U.), J. Zheng, X. Zheng (Hisilicon)] MVC inter-view skip mode using depth information
JVT-Z030 ( Prop 2.2/3.1) [Y. S. Ho, K. J. Oh, C. Lee (GIST)] Regional disparity derivation for MVC motion skip mode
JVT-Z031 ( Prop 2.2) [J. H. Park, B.H. Choi (KETI)] MVC motion skip mode with residual pred
JVT-Z032 ( Prop 2.2) [J. H. Park, B.H. Choi (KETI)] Clarification of motion_skip_enable_flagJVT-Z033 ( Info) [Y. Chen (TUT), Y.-K. Wang (Nokia), S. Liu (USTC), M. M. Hannuksela,
H. Li (Nokia)] CE1: Information on motion skip and CE 1JVT-Z034 ( Prop 2.2) [S. Cho, B. Lee, N. Hur, J. Kim, S.-I. Lee (ETRI)] Prelim subjective test
results for mixed resolution stereo video codingJVT-Z035 / M15102 ( Info) [I. Feldmann, M. Mueller, F. Zilly, R. Tanger, K. Mueller, A.
Smolic, P. Kauff, T. Wiegand (HHI)] Progress report on 3DTV video acquisitionJVT-Z036 ( Prop Reqs) [A. Segall (Sharp)] On the requirements for bit-depth and chroma
format scalabilityJVT-Z037-V ( Info) [Y. Su, A. Segall (Sharp)] Verif of JVT-Z021: Fine motion matching for
motion skip mode in MVC (CE1)JVT-Z038 ( SEI Prop 2.0/3.1) [S. Yea, A. Vetro (MERL), A. Smolic, H. Brust (HHI)] Revised
syntax for SEI message on multiview acquisition informationJVT-Z039 ( Info) [S. Liu, A. Vetro (MERL)] Requirements for bit-depth scalable coding JVT-Z040 ( Prop 2.2) [A. A. Rodriguez, J. Au (SciAtl/Cisco)] Prop SEI message to convey
suitable splice points in the bitstreamJVT-Z041 ( Prop 2.2) [A. A. Rodriguez, J. Au (SciAtl/Cisco)] Prop SEI message to control
DPB output in non-seamless spliced bitstreams with end_of_streamJVT-Z042 ( Prop 2.2) [A. A. Rodriguez, J. Au (SciAtl/Cisco)] Prop SEI message to forewarn
location of end_of_streamJVT-Z043 ( Errata) [H. Schwarz (HHI)] SVC errata
102
1.1.4 Late-registered input contributions, BoG reports, etc.
JVT-Z044-L (Late Errata 2.0/3.1) [V. Bottreau (Thomson)] On level limits common to scalable profiles – constraint "l"
JVT-Z045-Q (Late Prop 2.2/3.1) [Y. Yu, S. Gordon, M. Yang (Broadcom)] Bit depth SVC with a prediction filter
JVT-Z046-QV (Late Verif) [J.-Z. Xu (Microsoft)] Verif JVT-Z029JVT-Z047-Q / M15196 (Late Prop 2.0/3.1) Proposal on Profiles for MVC (Multi-view Video
Coding)JVT-Z048-QV (Late Verif) [H. Yang (Xidian Univ.)] Verif JVT-Z029JVT-Z049-B (BoG Report) [A. Vetro (MERL)] BoG report on MVC profiles
1.2 Late document availabilityNon-administrative documents with document numbers suffixed in this report with "-L", "-Q", or "-M" were classified as late. Such documents will only be considered as information documents only (unless agreed otherwise by the group) if time permits, and consideration of them may be shifted to the end of the meeting as determined appropriate by the group.
For some time now, the JVT has agreed that no late-uploaded (non-AHG-report, non-liaison, non-verification) contribution would be presented without having a minimum of 4 JVT participants (from different other than that of the primary contribution author) recorded by name as supporting the allowance of such a presentation, in addition to a consensus of the general JVT membership to allow the presentation. Such support to allow a presentation is to be understood to not necessarily imply support of the adoption of the content of the late contribution, but only as a positive expression that the document should be allowed to be presented. Additionally, the provider of a presented late contribution shall send an email apology to the JVT email reflector. This rule does not apply to material requested by the JVT at the meeting (e.g., reports of JVT-authorized side activities).
JVT decision: Agreed.
A check mark () indicates a contribution considered to be available on time.
The suffixes for contributions not marked as “” are explained below:– "-L" indicates a non-administrative contribution that was somewhat late but was available by
the second meeting day (JVT-Z044 was in this category at this meeting).– "-Q" were more late than that (JVT-Z045 through JVT-Z048 were in this category at this
meeting – two of which were verification documents).– "-M" were still missing at the time of preparation of this report.– "-B" were break-out group discussion reports and other input requested during the meeting
Further suffixing by “V” indicates a contribution that contains a cross-verification of a proposal.
Three contributions were subject to lateness penalties as follows:JVT-Z044-L (Late Prop/Errata 2.0/3.1) [V. Bottreau (Thomson)] On level limits common to scalable profiles – constraint "l"JVT-Z045-Q (Late Prop 2.2/3.1) [Y. Yu, S. Gordon, M. Yang (Broadcom)] Bit depth SVC with a
prediction filterJVT-Z047-Q / M15196 (Late Prop 2.0/3.1) [H. Kimata (NTT), H. Nakamura (JVC), T. Itoh (Fujitsu), T. Nomura (Sharp)] Proposal on Profiles for MVC (Multi-view Video Coding)
103
Notes on the apologies and named participant support for these contributions are included in the sections of this report that discuss each of these documents.
There were no objections to presentations of late documents at this meeting.
It was noted that the situation surrounding the need for on-time availability of contributions has substantially improved since our lateness penalty rules were adopted.
1.3 Withdrawn document registrationsNone.
1.4 Major output documentsMajor output documents submitted to parent-body review included the following. (Dates listed are planned dates of availability.)
1.4.1.1.1 JVT-Z200 Meeting report of the 26th JVT meeting (this document)
1.4.1.1.2 JVT-Z205 -M (WG 11 N9569) Draft conformance testing for SVC (V. Bottreau) [2008-02-29]
(Conveyed to WG 11 as "Text of ISO/IEC 14496-4:2004/FPDAM 31 Conformance Testing for Scalable Video Coding".)
1.4.1.1.3 JVT-Z207 (WG 11 N9578 Joint multi-view video model (JMVM) 7 text [2008-02-15]
1.4.1.1.4 JVT-Z208 (WG 11 N9579) JMVM 7 software [2008-02-22]
1.4.1.1.5 JVT-Z209 (WG 11 N9576) Joint draft multi-view video coding (MVC) [2008-02-15]
(Conveyed to WG 11 as "Text of ISO/IEC 14496-10:200X/FPDAM 1 Multiview Video Coding".)
1.4.1.1.6 JVT-Z210 -M (WG 11 N9574) ITU-T Rec. H.264 | ISO/IEC 14496-10 Advanced video coding defect report (G. Sullivan) [2008-04-11]
(Conveyed to WG 11 as "Text of ISO/IEC 14496-10:200X/DCOR 1".)
1.4.1.1.7 JVT-Y211 -M (WG 11 N9573) Draft reference software for SVC [2008-03-20](Conveyed to WG 11 as "Text of ISO/IEC 14496-5:2001/FPDAM 19 Reference Software for Scalable Video Coding".)
1.5 JVT internal output documentsJVT internal output documents included the following. (Dates listed are planned dates of availability.)
1.5.1.1.1 JVT-Z202 -M Joint scalable video model (JSVM) text
1.5.1.1.2 JVT-Z203 -M JSVM software
104
2 JVT administrative and liaison topics
2.1 IPR policy reminder and updateParticipants were reminded of the IPR policy established by the parent organizations of the JVT and were referred to the parent body web sites for further information. The IPR policy was summarized for the participants.
Participants were particularly reminded of the need to supply a completed JVT IPR status reporting form in all technical proposals for normative standardization. Participants were also reminded of the need to formally report patent rights to the top-level parent bodies (using the common reporting form found on the database listed below) and to make verbal and/or document IPR reports within the JVT as necessary in the event that they are aware of unreported patents that are essential to implementation of a standard or of a draft standard under development.
The JVT chair noted that the top-level parent bodies have agreed upon a common patent policy for ITU-T, ITU-R, ISO, and IEC.
Some relevant links for organizational and IPR policy information are provided below:– http://www.itu.int/ITU-T/ipr/index.html (new common patent policy for ITU-T, ITU-R, ISO,
IEC and guidelines and forms for formal reporting to the parent bodies)– http://ftp3.itu.int/av-arch/jvt-site (JVT contribution template for each meeting)– http://www.itu.int/ITU-T/studygroups/com16/jvt/index.html (JVT founding charter)– http://www.itu.int/ITU-T/dbase/patent/index.html (ITU-T IPR database)– http://www.itscj.ipsj.or.jp/sc29/29w7proc.htm (SC29 Procedures)
The JVT chair noted that the ITU TSB director's AHG on IPR had recently issued a clarification of the IPR reporting process for ITU-T standards, as follows (and as previously sent to the JVT email reflector), per SG 16 TD 327 (GEN/16):
“TSB has reported to the TSB Director’s IPR Ad Hoc Group that they are receiving Patent Statement and Licensing Declaration forms regarding technology submitted in Contributions that may not yet be incorporated in a draft new or revised Recommendation. The IPR Ad Hoc Group observes that, while disclosure of patent information is strongly encouraged as early as possible, the premature submission of Patent Statement and Licensing Declaration forms is not an appropriate tool for such purpose.
In cases where a contributor wishes to disclose patents related to technology in Contributions, this can be done in the Contributions themselves, or informed verbally or otherwise in written form to the technical group (e.g. a Rapporteur’s group), disclosure which should then be duly noted in the meeting report for future reference and record keeping.
It should be noted that the TSB may not be able to meaningfully classify Patent Statement and Licensing Declaration forms for technology in Contributions, since sometimes there are no means to identify the exact work item to which the disclosure applies, or there is no way to ascertain whether the proposal in a Contribution would be adopted into a draft Recommendation.
Therefore, patent holders should submit the Patent Statement and Licensing Declaration form at the time the patent holder believes that the patent is essential to the implementation of a draft or approved Recommendation.”
105
The JVT chair noted (as also previously remarked on the JVT email reflector) that since we are at the completion of the MVC amendment project, it was suggested that if anyone needs to report IPR on that topic and has not yet done so, now would be a good time to file formal notices to the parent bodies for any patent rights that are believed to be essential to the implementation of the MVC extensions (not to mention any notices not previously filed relating to the new SVC profiles, AVC professional profiles, or other previous projects).
It is suggested that, to enable proper interpretation of such formal notices, the MVC amendment should be clearly identified in such formal notices. For example, as “ITU-T Rec. H.264 and ISO/IEC 14496-10 Advanced video coding (2007 Ed.) Amendment 1 (2008): Multiview video coding”. Notices pertaining to other efforts should be made with a similar degree of clarity of identification of the specific standardization work item to which the declaration pertains.
The chair invited participants to make any necessary verbal reports of previously-unreported IPR in draft standards under preparation and opened the floor for such reports: No such verbal reports were made.
2.2 Meeting opening and remarks by the chairmenThe meeting was opened at approximately 2:30 p.m. on Sunday 13 January 2008.
At the opening session of the meeting, the JVT chairs reminded participants of the relevant IPR policy as described above, and reviewed the status and plans for the major projects under way in the JVT. The largest area of activity consisted of multi-view video coding (MVC) extensions of the ITU-T Rec. H.264 | ISO/IEC 14496-10 Advanced video coding (AVC) standard. SVC work was categorized as "phase 1" or "phase 2", depending on whether the work related to the recently-designed initial SVC amendment or to a potential future further SVC extension.
Documents were made available for download at http://ftp3.itu.int/av-arch/jvt-site/2008_01_Antalya.
The deadline was Tuesday January 8th 2008 for registrations and uploads.
Initially-missing non-administrative documents included the late-registered document JVT-Z045, which was registered verbally during the opening of the meeting. . Document JVT-Z044 had also been registered and uploaded late, but was available approximately 2 days prior to the opening of the meeting. Documents with numbers higher than that of JVT-Z045 were registered after the opening session of the meeting.
Meeting information could be found at http://www.sunflowerconferences.com/mpeg83/.
A document template had been made available at http://ftp3.itu.int/av-arch/jvt-site/JVT-Zxxx.dot. It contained important instructions and policy information. Participants had been encouraged to read it and use it as the basis of their contributions.
Opening remarks:– IPR policy reminder– Professional profiles – follow up work on reference software and conformance– Scalable video coding (SVC) phase I – follow up work on reference software and
conformance and collaboration with MPEG was needed on verification testing– SVC phase II – work areas included investigation of bit depth, color gamut, and chroma
format scalability and fine-granularity scalability
106
– Multiview video coding (MVC) was a major project underway, and constituted the topic of most contributions to the meeting
– Corrigendum work is needed, and was a major priority for this meeting
Further work and additional needs on the development, standardization, and maintenance of the base specification and the recently-completed SVC and professional profiles, and of associated reference software and conformance specifications was noted. Needs for verification testing to be conducted by the WG 11 parent body were noted and discussed.
The incoming status of work on errata aspects of the AVC specification was as found in JVT-Y210, which was delivered (during the Antalya meeting) as an output from the previous meeting. Other inputs on errata consisted of JVT-Z025, JVT-Z043, and JVT-Z044.
The chair remarked that there were few late document uploads this time, and that the submitted documents seem to be generally adhering to the JVT guidelines in terms of formatting, filenames, etc., which is a good development, although further improvement (particularly in the formatting conventions) is still needed. The JVT operating rules on that subject have helped.
2.3 JVT communication practicesJVT documents were available at http://ftp3.itu.int/av-arch/jvt-site.
These can also be accessed via ftp with the site name ftp3.itu.int, user ID avguest and password Avguest. Upon login, documents will then be found in the directory "jvt-site". Uploading of contributions is done by upload via ftp protocol to the "jvt-site/dropbox" directory using this account ID and password.
JVT email lists are managed through the site http://mailman.rwth-aachen.de/mailman/options/jvt-xyz, and to send email to one of these reflectors, the email address is "[email protected]", where "xyz" corresponds to– "experts" for general experts group discussions– "bitstream" for bitstream exchange activities– "svc" for SVC work– "mvc" for MVC work
2.4 Scheduling and logistics notesSome parallel sessions were held during the meeting, particularly including some parallel review of MVC and SVC contributions. Some “break-out group” (BoG) side activities and informal study efforts were also conducted. Documents produced by break-out group activities (if any) are listed in this report with the abbreviation “BoG” and are suffixed with "-B".
A contribution template JVT-Zxxx.dot for the JVT meeting was made available on the JVT ftp site: http://ftp3.itu.int/av-arch/jvt-site/2008_01_Antalya. It contained essential information for JVT participants. Participants had been instructed to read it carefully, particularly if they planned to be submitting contributions to the meeting.
The document registration and upload deadline was Tuesday 8 January 2008 (the Tuesday preceding the meeting).
Note that the JVT has agreed that no late-uploaded (non-AHG-report, non-liaison, non-verification) contribution will be presented without having a minimum of 4 non-affiliated JVT
107
participants from different organizations recorded by name as supporting the allowance of such a presentation, in addition to a consensus of the general JVT membership to allow the presentation. Additionally, the provider of a presented late contribution must send an email apology to the JVT email reflector.
2.5 Administrative documents
2.5.1.1.1 JVT-Z000 (Admin) List of documents of Antalya meetingAs listed herein.
2.5.1.1.2 JVT-Z001 -M (Admin) [G. J. Sullivan, J.-R. Ohm, A. Luthra, T. Wiegand] AHG Report: Proj mgmt and errata
General project status was reported verbally as described above (see opening remarks).
The latest version of the meeting report of the Shenzhen meeting (marked as draft 5) had been made available on 4 December 2007 (approximately 6 weeks prior to the meeting).
On errata: The JVT-Y210 output of the previous meeting was produced and made available during the current meeting. Other inputs on errata consisted of JVT-Z025, JVT-Z043, and JVT-Z044.
2.5.1.1.3 JVT-Z002 (Admin) [T. Wiegand, K. Suehring, A. Tourapis, T. Suzuki, G. J. Sullivan] AHG Report: JM text, ref soft, bitstream, conformance
This document described the activities of the JM text, reference software and bitstream conformance ad hoc group since the last JVT meeting.
On JM Reference Text: There was no activity to report on the JM reference text.
On JM reference software: The professional profile integration has been finished. Several bugs have been found and been fixed. The bug fixing activity is ongoing.
Software releases JM 13.1 and JM 13.2 have been issued.
The following issues were reported to be the most important (volunteers needed):
As the official H.264/AVC reference software, the JM should be a correct source for checking implementations. This means the decoder should be able to decode all valid H.264/AVC bitstreams and the encoder should never create invalid bitstreams. This is currently not the case.
Depending on the configuration the JM encoder can create invalid bitstreams:– Level constraints are not properly checked– The 16-bit transform requirement is not checked– In Baseline/Main/Extended profile the restriction of CAVLC syntax elements needs proper
handling
The software coordinators would like to encourage all H.264/AVC experts to volunteer for fixing these issues.
108
Known Issues / Reporting bugs: A web based bug tracking system has been set up for keeping track of known issues and missing features. The system is publicly accessible but requires registration for entering bug reports.
The system is located at http://ipbt.hhi.de.
This internet site contains also some usage instructions.
Please note that the bug tracking system is using encrypted/secure http (https) for protecting the user’s login. The used certificate is self signed and has to be imported into the user’s web browser. The SHA-1 fingerprint of the certificate is 69:21:86:d9:3e:72:da:3f:e8:30:df:a8:dd:fa:a5:4c:ed:85:b5:09.
A list of known issues and their state can be found at: https://ipbt.hhi.de/mantis/view_all_bug_page.php.
As an annex to the AHG report, a list of the 34 issues identified as active in the bug tracker system (as of 2008-01-14) was provided.
It was requested that certain rules should be followed before reporting any new bugs:
– The database should be searched on whether the same issue was previously reported. If the problem was reported before, but there is additional information, then this information should be added to the original report.
– It should be specified if the problem is related to the encoder, decoder or both.
– The version of the software used should be specified.
– Description of the problem should be as precise as possible.
– The necessary steps to reproduce the problem should be described in detail.
– If available, the configuration files or/and command line syntax used to run the software should be provided.
– The language of the standard should be used when referencing the text description.
– After filing the report, the user should check if he/she is requested to provide additional or other information relating to this issue.
Bitstream Exchange Activities: Communications related to bitstream exchange activity have taken place on the bitstream exchange reflector ([email protected]). However this topic was not so active since the last JVT meeting.
The FTP area for downloading bitstream files is on the main JVT Experts FTP site:ftp://ftp3.itu.int/jvt-site/bitstream_exchange/
The bitstreams can alternatively be accessed from the following http site.http://ftp3.itu.int/av-arch/jvt-site/bitstream_exchange/
To volunteer a bitstream for testing, contributors are requested to include it in a zip archive along with related files (trace files, configuration, reconstructed frames) in a zip archive and upload it to the dropbox:ftp://ftp3.itu.int/jvt-site/dropbox using the user ID "avguest" and password "Avguest".
109
In general, the following naming convention is being followed for the bitstreams in the exchange:FeatureCode_Source_VersionLetter
Please refer to the spreadsheet and files on the FTP site for examples.
Once a bitstream is uploaded to the dropbox, e-mail should be sent to [email protected], and/or the bitstream exchange reflector and it will be made available in the bitstream_exchange directory.
To sign up for the bitstream exchange reflector, use the web address given below. http://mailman.rwth-aachen.de/mailman/listinfo/jvt-bitstream.
No new bitstreams for non-"professional" profiles had been exchanged since the last meeting.
Conformance bitstreams for professional profiles: New conformance bitstreams for High 10 Intra, High 4:2:2 Intra, High 4:4:4 Intra, CAVLC 4:4:4 Intra, and High 4:4:4 profile were generated and available at JVT ftp site. Additional bitstreams are available for lossless coding.
However, it was found that the JM software (JM13.0 or later) crashed when decoding some conformance bitstreams. The volunteers investigated the problem and found it seems to be a problem in the JM software. The following problems were found:
1) 4:2:0 10-bit intra-only: no encoder/decoder match both for luma and chroma2) 4:2:2 10-bit intra-only: decoder crashes3) DC quantization for 4:2:2
It was confirmed that problem 1 above was fixed in JM13.1, and that problem 3 should be fixed by the next JM release.
Regarding the conformance bitstreams for 4:2:0 8 bit profiles: There was a report that the conformance stream cama1_vtc_c seems to be missing zero_bytes. However this report was not checked yet. This should be investigated further.
The AHG recommended to continue to collect more conformance bitstreams
2.5.1.1.4 JVT-Z003 (Admin) [H. Schwarz, J. Vieron, T. Wiegand, M. Wien, A. Eleftheriadis, V. Bottreau] AHG Report: JSVM text, S/W, conf
This document presented the report of the AhG on JSVM text, JSVM software, and SVC conformance.
The text of Joint Scalable Video Model (JSVM-12) was submitted as JVT-Y202. The text of the Joint Scalable Video Model wasn't modified relative to JVT-X202 (JSVM-11).
The JSVM software was submitted as JVT-Y203 and JVT-Y211. It corresponds to CVS tag "JSVM_9_10_DEVEL2". The reported status is summarized in the following.
The following changes had been implemented relative to JVT-X203:– correction of position calculation for inter-layer intra and residual prediction (including recent
changes according to JVT-X201)– correction of subset SPS (syntax and usage)– correction of prefix NAL unit syntax– correction of slice header syntax– correction of IDR support
110
– correction of SVC profile identifications– correction of SEI message identifiers– several bug fixes
Furthermore, an effort had been started to improve the decoder implementation (major rewrite) and remove unnecessary code (e.g. tools that are not supported in the standard) from the decoder implementation.– removal of RCDO– removal of 4-tap upsampling filters– removal of FGS (remaining code parts)– removal of fragmented NAL units– removal of additional "base layer decoder"– clean up of processing order in decoder– clean processing of access units in decoder– general improvement, simplifications of main decoder classes
The clean-up of the decoder implementation had not been finished yet. Some of the tools that were implemented in prior versions of the JSVM software were reported to not be supported:– Temporal direct mode: Currently not supported, so that most of the AVC conformance
bitstreams cannot be decoded (in SVC, the temporal direct mode is not supported).– Error detection and error concealment: The error detection and error concealment code
had been temporarily removed, so that all validation scripts that simulate packet losses fail. – The previously implemented code assumed a fixed GOP structure (by analyzing the parameters of the first two access units) and only worked for 2 layers. It was reported that it should be tried to implement the error detection and concealment in a more general way.
– Support of interlaced coding tools: Bugs related to interlaced coding tools need to be fixed.– Support of multiple slice groups: Bugs related to multiple slice groups need to be fixed (the
bugs already existed in last version of the JSVM software).
The following tools were reported to need to be implemented or fixed in order to align the software to the text (JVT-X201):– fixing implementation of multiple slice groups (and IROI)– fixing implementation of interlaced tools– implementation of new loop filter modes (two filter passes, JVT-W063r1)– correction of position calculation for inter-layer prediction of coding modes and motion
vectors– double check SEI syntax– order of redundant pictures in bitstream– re-implementing temporal direct mode (low priority – not required for SVC bit streams)– re-implementing error concealment and detection (lower priority – long term issue)– base layer rate control (JVT-W043, non-normative)
Additional fixes of which the software coordinators are currently not aware reportedly might also be required to align the JSVM software to the text.
In order to keep track of the changes in software development and to always provide an up-to-date version of the JSVM software, a CVS server for the JSVM software has been set up at the Rheinisch-Westfälische Technische Hochschule (RWTH) Aachen. The CVS server can be accessed using WinCVS or any other CVS client. The server is configured to allow read access only using the parameters specified below. Write access to the JSVM software server is restricted to the JSVM software coordinators group.
111
– authentication: pserver– host address: garcon.ient.rwth-aachen.de– path: /cvs/jvt– user name: jvtuser– password: jvt.Amd.2– module name: jsvm or jsvm_red
The following example shows how the JSVM software can be accessed by using a command line CVS client.
cvs –d :pserver:jvtuser:[email protected]:/cvs/jvt logincvs –d :pserver:[email protected]:/cvs/jvt checkout jsvm
In the following example, it is shown how a specific JSVM software version – specified by a tag (JSVM_9_8 in the last example above) – can be obtained using a command line CVS client. Note that "co" represents an abbreviation for the command checkout, which was used in the example above.
cvs –d :pserver:jvtuser:[email protected]:/cvs/jvt logincvs –d :pserver:[email protected]:/cvs/jvt co –r JSVM_9_8 jsvm
It is also possible to check out only a reduced JSVM software package by using the module name jsvm_red instead of jsvm. In this case, the directories JSVM0-config-sample and MVC-Configs are ommitted in the checkout, as shown below.
cvs –d :pserver:jvtuser:[email protected]:/cvs/jvt logincvs –d :pserver:[email protected]:/cvs/jvt co jsvm_red
The CVS repository includes a JSVM software manual, which provides further information on the JSVM software.
The text of the conformance test specification document "Draft conformance testing for SVC" had been submitted as JVT-Y205.
A first SVC related errata list had been submitted as JVT-Z043.
The editors and software coordinators were thanked for their excellent and diligent work.
2.5.1.1.5 JVT-Z004 (Admin) [A. Segall, T. Wiegand] AHG Report: SVC bit depth and chroma format
The AhG was established at the Shenzhen meeting to study bit-depth and chroma format scalability. The mandates of the AhG were:– Identify applications– Work out suggestions for detailed needs– Find/create test material– Study bit-depth reduction techniques, e.g., tone-mapping tools– Study color space and/or gamma conversion requirements– Study combined spatial and bit depth scalability– Define experiments and test conditions– Investigate software and text modification needs– Identify complexity issues
The AhG sent a kick-off message to the JVT main reflector ([email protected]) on December 7, 2007. The message contained [BDS] in the subject line.
112
Related contributionsJVT-Z036 [A. Segall (Sharp)] On the requirements for bit-depth and chroma format scalabilityThis document discusses the requirements for bit-depth scalability within the context of consumer applications. Current trends in display technology are the focus, and it is asserted that these trends motivate the need for higher bit-depth formats within consumer devices. Thus, it is proposed that development of any bit-depth scalable system should consider these applications.
JVT-Z039 [S. Liu, A. Vetro (MERL)] Requirements for bit-depth scalabilityThis document considers a new application scenario for bit-depth scalable coding in which receiver-side editing of a high dynamic range video is desired. Requirements for bit-depth scalable coding are described and preliminary results that aim to demonstrate the benefits of higher-bit depth video at the receiver are shown.
JVT-Z045-Q (Late Prop 2.2/3.1) [Y. Yu, S. Gordon, M. Yang (Broadcom)] Bit depth SVC with a prediction filter (registered after the AHG report was written)This document describes research work on bit depth SVC. By applying a filter to the reconstructed image from the lower layer, an average of 4.4% BDBR, or an average of 0.15 dB BDPSNR, can reportedly be achieved at the 10 bit top layer for "Viper" sequences. Higher gain is reportedly seen on input sequences with normal lighting conditions. (One sequence had a reported benefit exceeding 10%.)
Two liaison statements related to bit-depth and chroma format scalability had been sent to our MPEG parent body: M15209 from SMPTE and M15215 from DVD Forum.
The AhG recommended– To review related contributions during the meeting– To continue the study of bit-depth and chroma format scalability– To continue evaluating test material
2.5.1.1.6 JVT-Z005 (Admin) [J. Ridge, M. Karczewicz] AHG Report: FGS applications and design simplification
At the Shenzhen meeting, the JVT established the FGS applications and design simplification AHG activity with the following mandates:– Identify applications that may require FGS functionality and their characteristics.– Determine to what extent new coding tools are needed to achieve the functionality.– Define experiments and test conditions relating to FGS technology.– Coordinate with JSVM software effort to align JSVM software with current design.– Explore simplification of FGS tool design.
No contributions were submitted to this meeting on the subject matter of this AHG.
The main questions asked during the last couple of meetings did not relate to AR-FGS design but rather to its applicability, specifically regarding:– Importance of conversational applications for mobile devices.– Influence of high variations of bits per frame on delay in OFDMA based networks.
The AHG report suggested that the best forums to answer these questions are 3GPP and 3GPP2. Hence it was suggested to consider sending liaison letters to these forums requesting their comments. Alternatively since some of JVT participants are also active members of 3GPP/PP2 and it was believed that SVC issues would be discussed in the upcoming 3GPP and 3GPP2
113
meetings, such that we may use the results of those discussions as input to decide on needs for future work in this area.
2.5.1.1.7 JVT-Z006 (Admin) [A. Vetro, P. Pandit] AHG Report: MVC high-level syntax & buffer management
At the Shenzhen meeting, the JVT established the AhG on MVC high-level syntax & buffering, with the following mandates:– Discuss high-level syntax for MVC including NAL unit type, NAL unit header extension,
SPS extensions, slice layer, and integration with SVC syntax.– Discuss reference picture management to enable simultaneous picture output of different
views and to facilitate parallel processing.– Discuss issues related to HRD.– Propose refined syntax and decoding processes for JMVM.
The editors were reported to have made several improvements to the JD text related to high level syntax.
A contribution related to the decoding process and HRD for MVC can be found inJVT-Z024 [A. Vetro (MERL), P. Pandit (Thomson), H. Kimata (NTT), A. Smolic (HHI), Y.-K. Wang (Nokia), C. Ying (Tech. U. Tampere)] MVC decoding process and HRD design
The AhG on MVC high-level syntax & buffering recommended discussing the issues related to HRD and making any necessary revisions to the MVC text.
2.5.1.1.8 JVT-Z007 (Admin) [H. Kimata, A. Smolic, P. Pandit, A. Vetro, Y. Chen] AHG Report: MVC JD & JMVM text & software
At the Shenzhen meeting, the JVT established the AhG on JMVM and JD text editing and software, with the following mandates:– Collect comments on the draft, perform necessary editing, and upload the final document by
the deadline.– Maintain JMVM and JD document and collect comments on the text until the next meeting.– Coordinate JMVM software integration– Coordinate the bug-fixing process for the JMVM software– Maintain the JMVM software manual
The JMVM6 and JD5 were submitted to the JVT as JVT-Y207 and JVT-Y209, respectively. The JD text included a minor syntax change regarding view dependency as described in JVT-Y061. The JMVM included the following updates:– JVT-Y033: IC bug fix – JVT-Y058: motion skip bug fixes – JVT-Y042/Y053: single loop decoding
Several other editorial improvements and clarifications were reported to have also been made to the JD and JMVM text, including:– Better alignment with the latest SVC specification– Clean up the decoding process with regard to IC (intensity compensation – including the
adopted fix in JVT-Y033)
114
Some minor editorial revisions to the JMVM were reported to have been received regarding the IC tool since the final document was uploaded and were requested to be considered as editor’s input to the meeting (provided as an attachment to the ad hoc report).
The JMVM 6 software was delivered to the group on November 30th, 2007. This release contained the addition of a motion skip flag in slice header, simplification B- and P-Skip modes on illumination compensation, sending multiple GDVs (global disparity vectors) in the case of multiple inter-view references, SPS (sequence parameter set) simplification, single loop decoding, and some software improvements.
Some software issues that still need to be addressed were reported as:– Fix software for compile errors for gcc 3.4+ version.– Remove all compilation warnings.– Output order of views is not sequential or parallel (it is on an as ready basis).– Prepare validation scripts (work in progress).– Provide support GOPsize=1 (with motion skip & IC).– Trace file support for arbitrary view_id assignments.
The AhG on JMVM and JD text editing recommended:– To consider the editor’s input (provided as an attachment to the ad hoc report) in preparing
future versions of the JMVM and JD.– To discuss the issues in the current version of the software as mentioned above.– To improve the manual created for the JMVM software.– To follow the same software integration guidelines present in JSVM (repeated below).
In order to improve the whole software integration process, the software integration guidelines and rules are as follows:– The integrated software shall compile without warnings when using the provided VC6 and,
VS .NET workspaces, as well as Linux makefiles.– Do not use variable declarations inside the header of for-loops (as the scope for for-loops is
not correctly supported with all compilers).– Follow the coding style of the JMVM software. Use 2 (two) spaces for indentation, no tabs.– Re-use code and integrate functionality as possible. Try to avoid redundant code.– Do not change the meaning of existing input parameters, but rather define new ones if
necessary (and applicable).– Make sure that new parameters have meaningful default values. Tools should not be
switched on by default (if not decided different by the JVT).– Do not re-structure the output of the compiled binaries (if not decided different by the JVT).– Please change the JMVM version number macro (i.e. "_JMVM_VERSION_":) located in the
file "CommonDefs.h" to be inline with your integration tag.
Reference to CVS repository[CVS] host address: garcon.ient.rwth-aachen.de user name: jvtuser password: jvt.Amd.2authentication: pserver path: /cvs/jvt module name: jmvm or jmvm_red
jmvm_red does not check out certain old folders related to SVC.
2.5.1.1.9 JVT-Z008 (Admin) [P. Pandit, H. Kimata, S. Cho, K. Müller] AHG Report: MVC RRU and mixed-resolution view coding
Mandates115
– Investigate approaches for enhancing MVC coding efficiency using spatial downsampling– Evaluate the complexity of such methods– Investigate the relationship between downsampling approaches and view interpolation– Investigate low-complexity methods for mobile stereoscopic 3DTV applications
No emails had been exchanged on this topic on the reflector.
There were three contributions to this meeting that were noted to relate to this AhG, as follows:JVT-Z023 [S. Cho, N. Hur, J. Kim, S.-I. Lee (ETRI)] Coding eff of stereoscopic video coding using residual downsamplingJVT-Z026 [Y. Chen, Y.-K. Wang, S. Liu, M. M. Hannuksela, H. Li (Nokia)] On asymmetric MVCJVT-Z034 [S. Cho, B. Lee, N. Hur, J. Kim, S.-I. Lee (ETRI)] Prelim subjective test results for mixed resolution stereo video coding
The AHG recommended to review the related contributions during the meeting.
2.5.1.1.10 JVT-Z009 (Admin) [P. Pandit, H. S. Koo] AHG Report: MVC JMVM coding tools
The JMVM coding tools AhG had been established with the following mandates:– Investigate simplification and improvement of current JMVM coding tools (IC and motion
skip)– Investigate techniques for single loop decoding to reduce complexity starting with motion
skip
No relevant email had been exchanged on the reflector during the interim period since the last meeting.
The following contributions were noted to relate to the AhG:JVT-Z021 [S. Lin, S. Gao, L. Xiong (Huawei), H. Yang, Y. Chang, J. Huo (Xidian University)] CE1: Fine Motion Matching for Motion Skip Mode in MVCJVT-Z029 [G. Zhu, X. Xu, P. Yang and Y. He (Tsinghua U.)] MVC inter-view skip mode using depth information JVT-Z030 [Y. S. Ho, K. J. Oh, C. Lee (GIST)] Regional disparity derivation for MVC motion skip modeJVT-Z031 [J. H. Park, B.H. Choi (KETI)] MVC motion skip mode with residual predJVT-Z032 [J. H. Park, B.H. Choi (KETI)] Clarification of motion_skip_enable_flag
The AhG recommended to discuss the related contributions at the meeting.
2.6 Closing session notesIn the closing session there were no requests to reopen discussions of preceding agenda topics and side activities recorded elsewhere in this report.
The JVT thanked its WG 11 parent body for hosting the 26th JVT meeting, and Sunflower Conference Services for arrangement of meeting logistics.
The meeting was closed at 11:45 a.m. on Friday 18 January 2008.
116
2.7 JVT liaison communications and parent-body communications
The JVT did not receive liaison communications at this meeting. However two WG 11 parent body input liaison statements were noted as discussed below.
No liaison statements were sent by the JVT from the meeting. However, outgoing liaison statements were sent by WG 11 regarding SVC verification test results as discussed below.
2.7.1.1.1 M15209: Response from SMPTE to sc29n8883 Liaison from JVT on potential extension of SVC
SMPTE appears interested in bit depth and will be meeting in March – no detailed input was reported to be available prior to that.
2.7.1.1.2 M15215: Liaison response from DVD Forum regarding progress of video coding work
The DVD Forum WG-1 thanked the Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC 1/SC 29/WG 11 and ITU-T SG 16 Q.6) for informing them about the recent progress on enhancements of the ITU-T Rec. H.264 & ISO/IEC 14996-10 Advanced Video Coding standard.
The DVD Forum indicated that it is studying the benefit for consumers, manufacturers and content providers to adopt these enhancements into their specifications. Although their study has not been concluded, they consider it essential to retain backward compatibility with the existing products to avoid market confusion. In addition, they would appreciate it if JVT could provide evidence of improvement by the enhancements so that their study becomes more practical.
The further information requested by the DVD Forum was provided by the outgoing WG 11 liaison statement N9617 discussed below.
2.7.1.1.3 N9617: Liaison Statement template for various organizations re SVC verification testing report
The WG 11 parent body sent liaison letters conveying the verification test report for SVC to a variety of organizations – specifically to: ARIB, ATSC, BDA, DLNA, DVB, DVD Forum, EBU, FLO Forum, IEC TC 100, IETF AVT, ISMA, ITU-R SG 6, ITU-T SG 9, ITU-T SG 12, OMA, SCTE, SMPTE, TTA-DMB, WorldDMB, 3GPP, and 3GPP2. See also item 1 of section 10 of this report.
3 AVC base specification, errata, and related topicsThe latest prior errata reporting status was provided in the JVT-Y210 output document of the previous meeting. Three additional documents related to errata issues were submitted for consideration at this meeting: JVT-Z025, JVT-Z043, and JVT-Z044.
3.1.1.1.1 JVT-Z025 ( Errata 2.0/3.1) [Y.-K. Wang, M. M. Hannuksela (Nokia)] SVC corrigendum items
This document reported three possible corrigendum items for the SVC specification.
117
The first item is related to a semantics constraint on sub-picture scalable layer SEI message. A fix is proposed.
JVT Decision: Adopted in spirit – exact phrasing to be determined (esp. relating to possibility of presence in different NAL units).
The second item is related to a constraint on the co-existence of “old” H.264/AVC SEI message and “new” SVC SEI message” in a same SEI NAL unit. A fix is proposed.
JVT Decision: Adopted in spirit – exact phrasing to be determined (esp. relating to changing an "and" to "that is").
The third one is on the definition of “decoded picture”. It was proposed to add a definition of “decoded picture” in Annex G to explicitly exclude a “reference base picture” being a “decoded picture”. Remark: Perhaps adding a NOTE (e.g. near semantics of use_ref_base_pic_flag) would be better than adding a new definition.
JVT Decision: Adopted in spirit – exact phrasing to be determined.
3.1.1.1.2 JVT-Z043 ( Errata) [H. Schwarz (HHI)] SVC errataThis document contained an SVC related errata list for eventual incorporation into a future amendment or corrigendum to the ITU-T Rec. H.264 | ISO/IEC 14496-10 Advanced Video Coding standard.
This document reportedly started with the JVT output document JVT-X201 as its basis. Changes were relative to that document. The document reportedly included all necessary issues of which the editors were aware prior to the 26th meeting.
Most issues were reported to have been identified by Danny Hong of Vidyo.
"r1" version also provided and presented.
JVT Disposition: Approved ("r1").
3.1.1.1.3 JVT-Z044 -L (Late Errata 2.0/3.1) [V. Bottreau (Thomson)] On level limits common to scalable profiles – constraint "l"
JVT members supporting presentation:– S. Pateux– H. Schwarz– S. Gao– Segall
Apology? Has been sent.
According to JVT-X201, constraint "l" sets limits to the number of reference layer macroblocks that can be encoded with mbType equal to I_PCM, I_16x16, I_8x8, I_4x4, or I_BL according to the number of enhancement layer macroblocks encoded with mbType equal to I_BL. It is understood that the primary intention of such a constraint was to limit the required decoder complexity. However, it was asserted that the impact of such a constraint may not have been sufficiently investigated. The contribution proposes to discuss the impact of this constraint from
118
an encoder perspective and highlights some use cases that such a constraint reportedly may preclude.
Reportedly, Equation G-370 may preclude some potential use cases. In addition, it was asserted that such a constraint imposes too strict and/or complex encoding rules from an encoder perspective. It was asserted that Equation G-370 sets encoding rules either on the reference layer or the enhancement layer by enforcing either a specific reference layer macroblock pattern to be encoded with mbType equal to I_PCM, I_16x16, I_8x8, I_4x4, or I_BL or an enhancement layer macroblock pattern to be encoded with mbType equal to I_BL. The contribution proposes that:– Equation G-370 be reformulated in order to better take into account the number of
enhancement layer macroblocks eligible to be encoded with mbType equal to I_BL, i.e. for instance only the macroblocks lying within the cropping window;
– And/or Equation G-370 be reformulated in order to minimize its impact on encoding mode selection for reference layer macroblocks, for instance by changing the 1.5 arbitrary factor;
– Or that constraint "l" be removed.
JVT decision: This appears to be a valid report of an actual problem in the standard. The intent was to establish a constraint that constrains (only) the macroblocks in the base layer that are actually used in the decoding process of the enhancement layer. Under some conditions (e.g. involving cropping) the text does not seem to express that intended constraint. Further study may be needed to draft the final necessary correction.
4 Scalable video coding (SVC)
4.1 SVC bit depth and chroma format scalability
4.1.1.1.1 JVT-Z036 ( Prop Reqs) [A. Segall (Sharp)] On the requirements for bit-depth and chroma format scalability
This document discusses the requirements for bit-depth scalability within the context of consumer applications. Current trends in display technology are the focus, and it is asserted that these trends motivate the need for higher bit-depth formats within consumer devices. Thus, it is proposed that development of any bit-depth scalable system should consider these applications.
The contribution contiains an emphasis on larger visual dynamic range, as opposed to increased precision representation of the same visual dynamic range (adding least significant bits).
The contribution suggests that 10 bit sample depth support is necessary for near term; 12 bits for longer term.
An approximate 1014:1 human visual dynamic range was reported; 104:1 in a short time interval. CRTs can do that, but with ambient light this is reduced to 50:1 or 100:1.
Multiple displays are on the market – getting brighter and extending their dynamic range, with very wide contrast ratios emerging. Example companies producing such technology: Sharp, LG, Brightside/Dolby.
An HDR image was demonstrated.
119
Remark: Justifying high bit depth and justifying scalable support for it are somewhat different subjects. Our previous work has already included increased bit depth support (except in the scalability context).
JVT conclusion: It seems generally agreed that support of a large visual dynamic range is an important capability to deliver in our work.
4.1.1.1.2 JVT-Z039 ( Info) [S. Liu, A. Vetro (MERL)] Requirements for bit-depth scalable coding
This contribution considers a new application scenario for bit-depth scalable coding in which receiver-side editing of a high dynamic range video is desired. Requirements for bit-depth scalable coding are described, and preliminary results that aim to demonstrate the benefits of higher-bit depth video at the receiver are shown.
The contribution contains an emphasis on editing, tone mapping, etc., with "default" representation in the base layer and having an enhancement layer to provide greater quality.
Professional, "pro-sumer", and high-end consumer applications were emphasized.
The contribution suggests to consider high bit depth scalability support and to study the benefits of post-capture editing of HDR video.
Remark: Again, there's a distinction between the need for high quality / high bit depth, and the need for scalability, which is something else.
Response: There is the argument for an "easy to access" default representation supplemented by extra enhancement information – consumer quality plus extra data for high-quality subsequent processing.
4.1.1.1.3 JVT-Z045 -Q (Late Prop 2.2/3.1) [Y. Yu, S. Gordon, M. Yang (Broadcom)] Bit depth SVC with a prediction filter
Supporting presentation of the late contribution:– P. Topiwala– A. Vetro– Y. Chiu– A. Segall
Apology? Has been sent.
This contribution describes research work on bit depth SVC. By applying a filter to the reconstructed image from the lower layer, an average of 4.4% BDBR improvement, or an average of 0.15 dB BDPSNR improvement, can reportedly be achieved at the 10 bit top layer for "Viper" sequences. Higher gain is reportedly seen on input sequences with normal lighting conditions. (One sequence had a reported benefit exceeding 10%.)
The contribution describes the use of bit-depth scalability with tone mapping as in the current JSVM.
The contribution proposes modification of tone mapping operation.
Had some problem with BD measurements.120
A presentation deck and "r1" version of the document were uploaded later.
Question: Has application of the filter after adding the enhancement layer been tried? Response: No. Remark: That might provide equivalent gain (without adding mandatory complexity to the decoding process).
Remark: Better gains in daytime scenes – good lighting conditions. Response: That's right. Remark: The sequences were intentionally chosen to provide a mixture of daytime and night-time content.
Further study is encouraged, including consideration of the post-processing comparison question.
4.2 SVC Conformance and verification
4.2.1 SVC conformance specificationSee JVT-Z003 and JVT-Z205. Proceeding to FPDAM status in the ISO/IEC approval process.
4.2.2 SVC verification testingThe following two parent-body input contributions were noted in relation to SVC verification testing as discussed below. The SVC verification test report was produced as WG 11 N9577 (discussed below) and was conveyed to various organizations in liaison communications as found in WG 11 N9617 (discussed above).
4.2.2.1.1 M15108: Subjective results for the SVC verification testSubjective result plots were shown toward development of the SVC verification test report. Tested scenarios included Scalable Baseline (conversational and mobile TV applications); Scalable High (3-layer dyadic and 1080p on 720p applications); and Scalable High Intra (production application).
4.2.2.1.2 M15132: Verification of new SVC verification test streamsReports that the verification test bitstreams were appropriate for the test.
4.2.2.1.3 N9577 Report on SVC Verification TestsThe verification test was conducted using conditions suitable for a range of possible application scenarios for progressive video, including:– Video-conferencing with quality scalability for CIF@30fps video, and spatial scalability for
640x352@60fps video with 1280x704@60fps enhancement– Mobile TV with quality scalability for QVGA@25fps video, and spatial scalability for
[email protected] with VGA@25fps enhancement– HD TV with spatial scalability for 720p@50fps with 1080p@50fps enhancement– Movie production with spatial scalability for 1080p@25fps being the highest resolution, with
two lower resolutions
121
For the performance evaluations, SVC was compared against AVC single layer coding by means of subjective testing. Subjective tests were performed following relevant international recommendations using a controlled environment and a high number of test subjects.
The results of these tests indicate that these various types of scalability for these applications can be achieved with a bit rate overhead typically equal to or less than 10% compared to AVC single layer coding using only the highest resolution in the test case. The bit rate savings obtained by SVC compared to AVC simulcast transmission depend on the particular test case, and were found to be between 17% and 40% of the simulcast bit rate. These bit rate savings relative to simulcast are particularly important for applications in which video must be provided with different spatial resolutions, for which simulcast would previously have been the only available AVC-based standardized solution.
Full detail is available in the WG 11 N9577 parent-body document, which was made a public document by WG 11. The drafting of the verification test report was coordinated by Tobias Oelbaum.
The JVT was pleased to see the good results from the verification test, which appear to validate the merit of the SVC design.
5 Multi-view coding (MVC)
5.1 Core experiment #1 & related docs: Fine motion matching for motion skip mode in MVC
5.1.1.1.1 JVT-Z021 ( Prop 2.2) [H. Yang, Y. Chang, J. Huo (Xidian University), S. Lin, S. Gao, L. Xiong (Huawei)] CE1: Fine motion matching for motion skip mode in MVC
At the Shenzhen meeting, a fine motion matching technique was first proposed in JVT-Y037. Some issues about the technique were raised that needed clarification, and a core experiment was set up. This contribution reportedly continues with JVT-Y037, fulfills the mandates of the core experiment plan, and provides results for the proposed technique. The experimental results reportedly show that, compared with the performance of the current MVC JD and JMVM, BD gains of 0.169 dB / 4.11% and 0.083 dB / 2.11% can be obtained as an average over all views; with BD gains of 0.379 dB / 9.19% and 0.181 dB / 4.64% respectively, on average of the views that employs the proposed technique.
Clarification of reported percentage improvement relationships:– 2.11% average gain over all views (for 8 sequences) relative to current motion skip design of
JMVM.– 4.64% average gain over the views that can use motion skip (for 8 sequences) relative to
current motion skip design of JMVM.– 4.11% average gain over all views (for 8 sequences) relative to current JD, which does not
include motion skip feature.– 9.19% average gain over the views that can use motion skip (for 8 sequences) relative to
current JD, which does not include motion skip feature.
Current design of motion skip in MVC uses a 16-sample GDV (global disparity vector) resolution. Entire GOP uses same GDV value. Contribution proposes to change this to an 8-
122
sample resolution, with a block offset sent to determine the final 16x16 region in the reference picture to use for motion inference. Also send a flag to indicate whether to use a list 0 or list 1 reference picture in this process.
Supports single loop decoding.
Decoder complexity increase: Seems negligible – some extra syntax and minor calculations on MB level.Encoder complexity increase: Performs, e.g., 9*9*2 = 162 SSE calculations which were not otherwise needed (fast search is also obviously possible).
No change relative to what was proposed in JVT-Y037 (although some simplification of encoding process).
Proposal does not include CAVLC support.
Remark: That seems like a problem (perhaps for motion skip and intensity compensation in general, if CAVLC operation hasn't been tested for them).
Proposal does not include draft text. Considering that the book-keeping that is needed to infer the motion from the new non-macroblock-aligned positioning, the amount of necessary draft text might be substantial.
Remark: Gain seems small.
Remark: Might be good for support of single-loop decoding.
Remark: These results are for multi-loop decoding. How much do you get for single-loop decoding? After some discussion, rough estimates were 19% for multi-loop without new coding tools and 11% for single-loop.
Question: What would be the performance with 8-sample GDV but not adding the refinement? Response: Approximately no improvement from doing that alone – it is the refinement that provides the gain.
Remark: It seems clear that coding tools will not be in MVC phase 1.
Discussed again after further consideration of that issue.
Remark: Phase 1 seems sufficient serve the need of enabling the relevant applications that are ready to be deployed today. With or without low-level decoding tools, these tools still end up with a bit rate that is roughly proportional to the number of views. The market may not be ready yet for further profile definitions without a more major difference than this, e.g., 20% better compression than phase 1 MVC.
Later in the meeting, CAVLC operation was developed by the proponent, with roughly similar relative gain reported. Draft text was also drafted by the proponent.
JVT decision: Adopted into JMVM.
It was further agreed to set up an AHG to investigate MVC enhancements. But it was also agreed that we are in no rush to create more new MVC profiles.
123
The JMVM software is (already) structured in a way that makes it easy to remove experimental stuff from it. We need to make sure that this stays true.
5.1.1.1.2 JVT-Z033 ( Info) [Y. Chen (TUT), Y.-K. Wang (Nokia), S. Liu (USTC), M. M. Hannuksela, H. Li (Nokia)] CE1: Information on motion skip and CE 1
In this contribution, information is given on the performance of the current motion skip in JMVM and the performance and the functionalities of potential tools related to the CE 1. In multiple-loop decoding, the original motion skip reportedly contributes an average bit-saving of around 2%. The method proposed by Huawei in JVT-Z021 reportedly doubles the bit-savings to around 4%.
5.1.1.1.3 JVT-Z037 -V ( Info) [Y. Su, A. Segall (Sharp)] Verif of JVT-Z021: Fine motion matching for motion skip mode in MVC (CE1)
This document provides a verification report of Huawei's response to CE1 on motion skip mode in MVC. Huawei provided Sharp the software for JVT-Z037. Sharp reportedly carefully inspected the software, compiled the software, and generated results as specified in the CE. All results reportedly matched. JVT-Z021 was thus reported to have been verified.
5.2 MVC motion skip mode (without depth information) and related documents
5.2.1.1.1 JVT-Z030 ( Prop 2.2/3.1) [Y. S. Ho, K. J. Oh, C. Lee (GIST)] Regional disparity derivation for MVC motion skip mode
This document described a method of regional disparity derivation for motion skip mode. The current motion skip mode in JMVM utilizes a global disparity vector of 16-sample precision to find the position of the corresponding macroblock for the current macroblock. However, since the multi-view scene consists of several objects and each object has its own disparity value, it was asserted that the global disparity is not enough to cover the disparity of the whole image. It was proposed to use regional disparities, instead of the global disparity, for the motion skip mode. The proposed scheme generates the disparity map for each anchor frame considering its motion vectors and then derives disparity maps for non-anchor frames using both forward and backward disparity maps. The temporal movement is also considered. Compared to JMVM 6.0, the proposed scheme reportedly achieved a similar coding gain with the previous scheme.
Results were reported for 5 sequences, 2 GOPs. Relative to the current motion skip mode, approximately the same coding performance was reported.
The proponent suggested combining with residual prediction and suggested that better results might be obtained that way.
Contribution noted.
5.2.1.1.2 JVT-Z031 ( Prop 2.2) [J. H. Park, B.H. Choi (KETI)] MVC motion skip mode with residual pred
This document proposes a prediction structure for MVC which is reportedly a combination of motion skip mode (per JVT-Y058) and residual prediction. In terms of residual prediction, it was claimed that the proposed method is very similar to the residual prediction technique of SVC. The proposed method uses an integer precision global disparity vector and derives a disparity vector of motion skip mode from the global disparity vector by a shift operation. For such use, it
124
was asserted that a smoothing filter would be needed to reduce boundary artifacts, but investigation of such a filter reportedly could not be finalized due to lack of time. The proposed method reportedly showed some gain without a smoothing filter for "dense" sequences. The number of test sequences was limited – 1.4% average gain on set of 5 sequences. The contribution recommended the creation of a CE on this topic.
Remark: Adds substantial complexity (e.g. searching in encoder for integer-precision GDV value, residual handling and storage in decoder).
Contribution noted.
5.2.1.1.3 JVT-Z032 ( Prop 2.2) [J. H. Park, B.H. Choi (KETI)] Clarification of motion_skip_enable_flag
This document requested a change of the conditions relating to the slice header syntax element motion_skip_enable_flag, which was newly introduced from JVT-Y207. This contribution suggested to change the conditions on the presence of the motion_skip_enable_flag so that it is not sent when it is not used, and to otherwise structure the syntax in a more logical fashion in relation to that syntax element.
JVT Decision: Adopted (conditioned on whether we actually will use the feature that this is refining the design of).
5.3 MVC motion skip mode with depth information
5.3.1.1.1 JVT-Z029 ( Prop 2.2/3.1) [G. Zhu, X. Xu, P. Yang, Y. He (Tsinghua U.), J. Zheng, X. Zheng (Hisilicon)] MVC inter-view skip mode using depth information
This contribution proposed an inter-view skip mode using depth information. A motion vector (disparity vector) of this mode is derived from the corresponding camera parameters and depth map. Experiments on the one sequence with depth ("Breakdancers") reportedly showed an average benefit of 0.288 dB / 11.37% for the P frame portion of the bitsteam.
No temporal prediction used at all in this test.
Question: What would be the percentage savings of the total bitstream?
Remark: Similar prior investigations have been done.
Remark: Perhaps depth bit rate should be accounted for.
Only tested on P slices. Only applies if (high quality) depth map is available.
Investigation of this feature is reported to only be preliminary.
Contribution noted. This may be a useful area for investigation toward some future project that may include depth map support, but does not seem to be within the scope of the current JVT MVC project.
125
5.3.1.1.2 JVT-Z046 -QV (Late Verif) [J.-Z. Xu (Microsoft)] Verif JVT-Z029This contribution was provided to verify the results reported in JVT-Z029. Encoding and decoding results were reportedly verified.
The authors of JVT-Z029 provided their modified source code to the contributor. The encoder and the decoder were compiled to get the results of the proposed scheme of JVT-Z029. And the encoder of anchor was run to get the results of anchor.
The results were reported to be exactly the same as those listed in Table I of JVT-Z029.
5.3.1.1.3 JVT-Z048 -QV (Late Verif) [H. Yang (Xidian Univ.)] Verif JVT-Z029This document was provided to verify the results reported in JVT-Z029 from Tsinghua Univ. The software was provided to the contributor for verification, and was reportedly confirmed to match the method described in the proposal. An executable file was generated by compiling the provided software, and was used to decode the provided bitstream. The provided bitstreams were reported to have been successfully decoded, and all results reportedly matched. JVT-Z029 was thus reported to have been verified.
5.4 Core experiment #2: Adaptive reference filtering for MVC
5.4.1.1.1 JVT-Z020 ( Prop 2.2/3.1) [P. L. Lai (USC), P. Pandit, P. Yin, C. Gomila (Thomson)] CE2: Adaptive reference filtering for MVC
This contribution reported work and results for CE2 (JVT-X302) to evaluate coding gain of adaptive reference filtering (ARF) for MVC. To avoid needing a two-pass encoding procedure (initial estimation, filter design, then encoding with filtered references), a fixed set of filters was reportedly designed by clustering the ARF filters with a method described in previous JVT documents (JVT-W065, JVT-X060, JVT-Y041). They were 3x3 filters with symmetric constraints as proposed in JVT-X060. The results provided in this document were for anchor-only sequence coding with P and B frames. The average bit rate savings for anchor-only sequence coding was asserted to be 5.69%, or equivalently 0.26 dB. The gains for anchor-only sequence coding were asserted to range from 0.07 dB to 0.45 dB. The asserted gain was larger for sequences with stronger focus mismatches such as Race1 and Rena.
Question: Are there applications that would use anchor-only coding? Response: Professional applications such as editing – note that there is a profile proposal for such a profile.
Question: What would the gain be when not using anchor-only coding? Seems very small.
Question: Is the gain additive to IC? Response: Not tested.
Question: Complexity impact? Extra filtering to be done during inter prediction.
Remark: Was combining the two filters considered.
Question: How is the reference filtering adaptive? A particular filter is assigned to each position in the reference picture list (no ability to change which filter is applied to which position). The only adaptivity is the dependence of the filtering on the reference picture list index.
JVT Disposition: Further study was encouraged.
126
5.5 Residual downsampling for MVC
5.5.1.1.1 JVT-Z023 ( Prop 2.2) [S. Cho, N. Hur, J. Kim, S.-I. Lee (ETRI)] Coding eff of stereoscopic video coding using residual downsampling
This contribution reported an analysis of experimental results of stereoscopic video coding using the residual-downsampling algorithm which was contributed in JVT-Y052 at the last meeting. It also compared the coding efficiency of the residual-downsampling algorithm in JVT-Y052 with that of the JMVM including 8x8 transforms for only the non-base view.
The basic concept is analogous to RRU of H.263 Annex Q / MPEG-4 pt. 2, plus also supporting horizontal-only and vertical-only downsampling.
Tested with CAVLC, IPPP coding structure, no 8x8 transform in non-base view.
Tested on three non-common-conditions test sequences, one of which was animation content.
Significant coding gain was reported at lower bit rates (with or without 8x8 transform in non-base view).
With the 8x8 transform in both the reference and modified design, the contribution reports significant gain for the use of RRU: About 8% on two natural video sequences, and 16% on one animation sequence, as a percentage of the bit rate for the second view.
Remark: But this is the percentage savings of just the bits for one of the two views – not of the total bit rate. Response: Yes, as a percentage of the total, it would be about 3% for the natural sequences and perhaps 5% for the animation.
Remark: Does this make sense? How would it perform on regular single-view video?
Remark: Common conditions sequences are not stereoscopic sequence.
Remark: Is there really anything about this proposal that makes it specific to the stereoscopic context? Perhaps not.
Remark: JMVM with and without 8x8 transform – no significant difference in performance was reported between these two. Response: Probably because this is using QVGA resolution sequences.
Not verified, not using common conditions, just a few (low-res) test sequences.
Remark: Should test using MVC common conditions.
JVT Disposition: Further study was encouraged.
5.6 Mixed-resolution MVC
127
5.6.1.1.1 JVT-Z026 ( Prop 2.2.1/3.1) [Y. Chen (TUT), Y.-K. Wang (USTC), S. Liu, M. M. Hannuksela (Nokia), H. Li (Nokia)] On asymmetric MVC
This is a follow-up proposal to JVT-Y054. "Asymmetric coding" involves the coding of two views of a stereoscopic video with different resolutions. In JVT-Y054, it was asserted that the proposed scheme has low complexity and requires a smaller decoded picture buffer size compared to downsampled inter-view prediction. Simulation results under common test condition were provided. In this proposal, results on stereoscopic video (two views) were provided. It was claimed that the proposed method has almost the same efficiency as downsampled inter-view prediction.
Rather than downsampling, uses 1/2 sample interpolation as 1/4 sample positions and uses odd integer samples as 1/2 sample positions, and even integer samples as integer-sample positions.
Results were compared to simulcast and to the downsampled reference technique. About the same performance, relative to downsampled reference technique, was reported (a small gain), and about 14% gain relative to simulcast-resolution use of MVC design.
Software? Can be made available.
Draft text? Not yet.
JVT Disposition: Further study was encouraged.
5.6.1.1.2 JVT-Z034 ( Prop 2.2) [S. Cho, B. Lee, N. Hur, J. Kim, S.-I. Lee (ETRI)] Prelim subjective test results for mixed resolution stereo video coding
This document reported preliminary results of subjective tests for mixed resolution stereo video coding. Such coding methods were envisaged to be used for 3D DMB services and systems in the future. The presented results were partly achieved in collaboration with Fraunhofer HHI. They were reported to be considered to be preliminary and to report work in progress. Test conditions were reported to be "not yet perfect" and the results were reported to be not consistent and complete. So far the results reportedly do not allow drawing reliable conclusions. However, there was reported to be some indication that mixed resolution stereo coding may outperform MVC at least in some cases, under some conditions.
Shown on 3.5 inch display, each view having 320x480 resolution, 30 fps.
3 test sequences.
CAVLC, no 8x8 transform.
Results were characterized as preliminary.
Conclusions difficult – not consistent The author indicated an interest in providing improved measurements in the future.
Disposition: Further study was encouraged.
128
5.7 MVC high-level syntax and HRD
5.7.1.1.1 JVT-Z024 ( Info) [A. Vetro (MERL), P. Pandit (Thomson), H. Kimata (NTT), A. Smolic (HHI), Y.-K. Wang (Nokia), C. Ying (Tech. U. Tampere)] MVC decoding process and HRD design
This informational contribution discussed some issues relating to the description of the decoding process – whether the decoding process specifies the output of only one view (i.e. repeating the decoding process when output of multiple views is needed) or the output of any number of views to be output. It was asserted that the decision on the issues may deeply affect the MVC specification text editorially. Furthermore, it was asserted that the design of the HRD for MVC may also be affected technically. Two methods with their pros and cons were discussed (single-view output or output of all view).
The goal of this contribution was reportedly to outline the issues for discussion and collect opinions from other JVT experts. It was recommended that the group carefully discuss these issues since it was asserted that they affect several important aspects of the MVC specification.
After discussion – an approach similar to that used for SVC and for separate_colour_plane_flag equal to 1 was suggested. The suggestion was to focus on the decoding of one view throughout time. Other views on which that view depends will be generated during the decoding process as needed to decode the target view. Mark the pictures of the other views as "not used for reference" after completing the decoding process for the target view (or consider them as marked as inter-view-only reference pictures and all such pictures are removed from this classification upon completion of the access unit decoding process for the target view). Pictures of other views that are not the target view are always marked as "not needed for output". Make a distinction between marking for temporal referencing and for inter-view referencing.
A BoG discussion group was formed (coordinated by A. Vetro) to further discuss the editing aspects – the exact phrasing was left to the editors.
5.7.1.1.2 JVT-Z027 ( Prop 2.2/3.1) [H. Nakamura, M. Ueda (JVC)] Comments on SPS MVC extension
This contribution proposed two changes relating to view dependency information in the sequence parameter set MVC extension. Both proposals are for redundancy reduction. These were asserted to be independent proposals.
This contribution suggested modifying the sequence parameter set MVC extension syntax.
Aspect #1: Signaling for applying inter-view prediction
Analogous to simulcast AVC coding.
The contribution notes that various aspects of simulcast coding may need some (e.g., externally specified) application-level support external to the video elementary bitstream data.
The contribution proposed a flag to indicate a lack of any inter-view dependency (rather than using existing dependency info syntax).
129
This case seems easy to detect with existing syntax, and the savings in this case is very minor – the JVT concluded that there was no real need to treat this case as special. No action was therefore taken by the JVT in this regard.
Aspect #2: anchor_ref_l0[i][j] inference from num_anchor_refs[i] for i equal to 1
The contribution proposed eliminating syntax element presence when a value can be easily inferred.
The proposal was concluded to seem to be excessive fine-tuning with negligible real effect. No action was therefore taken by the JVT in this regard.
5.7.1.1.3 JVT-Z038 ( SEI Prop 2.0/3.1) [S. Yea, A. Vetro (MERL), A. Smolic, H. Brust (HHI)] Revised syntax for SEI message on multiview acquisition information
This contribution proposed alternative syntax options for the SEI message on multiview acquisition information (camera parameters). In particular, two forms of syntax were considered: one being a floating-point representation with a variable-length mantissa, while the other also has a variable-length mantissa and follows the IEEE 754 format. The proposed options were asserted to enable a wider range of numerical values and precisions, which would reportedly overcome some shortcomings and problems with the existing syntax. Matlab scripts were provided for verifying the proposed formats.
Remark: Is there an ISO, IEC, or ITU-T spec for IEEE 754? Apparent answer: IEC 60559.
Remark: Is it OK for the SEI message to not be parsable without first parsing the SPS?
JVT decision: Adopted.
5.8 MVC profiles
5.8.1.1.1 JVT-Z028 ( Prop Profiles) [B.-M. Jeon (LG), W. S. Shim (Samsung), S. Cho (ETRI), G. H. Park (Kyung Hee U.), P. Pandit (Thomson), Y.-L. Lee (Sejong U.)] About MVC coding tools
The JVT has put two new coding tools into the JMVM: illumination compensation (including loop filtering) and motion skip to improve the coding efficiency in MVC. However, the current JD does not contain either of these new coding tools that are found in the JMVM.
Compared with simulcast H.264/AVC, JMVM 5.0 without the two new coding tools reportedly achieves about a 19% bit rate savings on average, while JMVM 5.0 with two new coding tools reportedly achieves about a 27% bit rate savings on average, when JMVM common test conditions are used for all test sequences except the "Uli" sequence. This contribution recommended that the JVT adopt the two JMVM tools into the JD (and thus presumably into intended profile plans) at this meeting.
A description of some planned MVC service deployments in Korea was presented, including mobile services in particular.
130
Question: Are the described deployments planning to use CABAC or CAVLC? Response: CAVLC? Follow-up question: But these coding tools have never been tested with CAVLC, so how could we consider specifying to use this untested configuration?
Question: How many views? 3 (autostereoscopic).
Remark: The gain for these tools, when using only 3 views, will be less than what has been measured in our common conditions tests.
Remark: The best thing for us to do to enable the application at this stage is to establish a standard which is as easy to implement as possible, based on existing implementation designs for AVC. Once the application becomes established, it might then begin to make sense to consider a more customized design.
We seem to only be able to specify profiles at this stage (for completion of standardization by July) without MB-level coding tools, considering the lack of testing of CAVLC performance for such coding tools. Agreed.
Remark: It seems worthwhile to still consider longer-term work that could include new coding tools.
The JVT decided to plan AHG activity to include testing of CAVLC performance making that a mandate of the phase 2 MVC AHG.
5.8.1.1.2 JVT-Z047 -Q / M15196 (Late Prop 2.0/3.1) [H. Kimata (NTT), H. Nakamura (JVC), T. Itoh (Fujitsu), T. Nomura (Sharp)] Proposal on Profiles for MVC (Multi-view Video Coding)
Supporting allowing presentation of this late contribution:– A. Vetro– J. Ridge– F. Istiaq– P. Purvin
Apology? Was reportedly sent.
Proposes 4 profiles for MVC.– Profile A: With no inter-view prediction (basically a simulcast profile), up to 16 views– Profile B: No temporal prediction
Remark: We ordinarily only specify aspects that have a decoder complexity impact.– Profile C: With inter-view and temporal prediction, possibly with temporal and SNR
scalability – inter-view prediction only for anchor pictursRemark: Restricting inter-view prediction to anchor pictures seems questionable.
– Profile D: Withdrawn.
The contribution advocated having some constraints on dependency structures. We should study this.
Suggestion: Define one profile with SPS-level switches of features.
Suggestion: Profile should be based on High profile.
131
Suggestion: Add a third dimension to capability specification: Not just profile & level, but profile & level & number of views.
Remark: Regarding the inclusion of SVC features, it seems like we haven't thought about how MVC and SVC can work together – does the high-level syntax work for such a scenario?
A. Vetro and H. Kimata considered these as break-out discussion subjects and reported back to the JVT. This was further discussed jointly with MPEG as reported elsewhere in this report. See JVT-Z049 and section 7.2 of this report.
5.8.1.1.3 JVT-Z049 -B (BoG Report) BoG report on MVC profilesThis document provided a report that summarized the results of break-out group activity discussions on MVC profiles during the Antalya meeting.
JVT decision: It was decided at the meeting to target the definition of a single Multiview High profile. Also, some dimensions of the level definition were outlined and discussed. The current draft text that would specify this profile and the level definition were provided.
5.9 MVC test sequences
5.9.1.1.1 JVT-Z035 / M15102 ( Info) [I. Feldmann, M. Mueller, F. Zilly, R. Tanger, K. Mueller, A. Smolic, P. Kauff, T. Wiegand (HHI)] Progress report on 3DTV video acquisition
This contribution contained a progress report about work on content creation for 3DTV. The document described a multiview camera arrangement, details of the camera hardware, planned content, disparity range, and potential use of the data.
This contribution had been reviewed by MPEG as M15102 and was not presented in detail to the JVT.
6 New AVC Proposals
6.1 Adaptive MV precision
6.1.1.1.1 JVT-Z022 / M15185 ( Prop 2.2/3.1) [S. Sekiguchi, K.Otoi, Y. Yamada, K. Asai, T. Murakami (MEI)] 4:4:4 video coding perf with adaptive MV coding
This contribution reports a potential compression performance improvement of 4:4:4 video coding with adaptive coded representation of motion vector information depending on the magnitude of the motion vector. The objective of this study was to investigate the possibility of further compression performance improvement for a 4:4:4 profile in order to apply it to consumer-level applications requiring much higher compression ratios. In the case of high-compression conditions, it was suggested to be necessary to consider further reduction of the coded bits for motion information. The studied approach here is to perform an adaptive motion vector search that limits fractional sample accuracy depending on the magnitude of motion vector, and to derive a coded representation of motion vectors assuming the adaptive accuracy motion search. Experimental results reportedly showed an advantage to the studied approach in the case of high-compression conditions.
132
This was a contribution toward a hypothetical future 4:4:4 profile oriented toward consumer applications.
The contribution suggested to use a relatively small search range with 1/4-sample search, a larger range with 1/2-sample search, and a yet larger integer search range.
On one test sequence ("Shimoda"), compared to the current JM search and syntax technology, an experiment yielded a reported 5.6% delta bit rate improvement over the 38-42 dB fidelity range.
What does this have to do with 4:4:4? Couldn't it apply equally well to other chroma formats? Response: Yes, it can.
A 50-63% bit rate savings was reported for the motion vector data portion of the bitstream with IPPP coding.
The contribution asserts a savings in memory bandwidth also.
Remark: The entropy coding aspect seems difficult to comprehend, since motion vectors are sent as deltas from predictors, whereas the precision depends on the actual MV value result rather than the delta – how would we determine the interpretation of the MVD? Response: A scheme is used which seems to work; it involves maintaining quarter-sample precision for the predictor – other details were considered difficult to explain, but were reported to work.
Question: What happens if you change the Lambda for the anchor? Response: Don't know.
Question: Was this using CAVLC or CABAC? Answer: CAVLC.
Presentation? Uploaded later.
JVT disposition: Further investigation was encouraged, including testing on more data and for other chroma formats.
6.2 SEI messages
6.2.1.1.1 JVT-Z040 ( Prop 2.2) [A. A. Rodriguez, J. Au (SciAtl/Cisco)] Prop SEI message to convey suitable splice points in the bitstream
This contribution proposed a new SEI message that would convey information about a potential splice point in the bitstream located N access units subsequent to (in bitstream order) the location of the SEI or the current access unit.
A "Suitable splicing point" of order M was defined as a point in the bitstream at which some specific number of pictures M is present in the DPB that are ready for output at consecutive clock ticks, prior to any gap in output times. For example, if the DPB contains F = 5 pictures, and if the bitstream is cut at that point, and it is a suitable splicing point with M = 2, there would be 2 pictures with consecutive output times and each of the other three pictures would have one of the following two properties:– There is a "time gap" after the output times of the M pictures and before the output time of
the other picture arrives, or– The other picture has an output time that precedes the output times of the M pictures.
133
The proposed SEI message identifies a "suitable splice point" and conveys the value of M, N, and F.
Question: Fixed frame rate assumption? Response: Yes.
Question: What is the need for providing advance notice in the bitstream of these properties that are to be fulfilled at a later point in the bitstream? Response: Motivation is to reduce complexity and delay in the splicer, to enable "pre-conditioning" of the other stream that is to be spliced in, etc. Remark: Don't really understand that explanation.
Question: Why is an SEI message needed? Why can't the decoder/splicer just scan the bitstream and watch the picture properties to identify for itself the points in the bitstream that have these properties?
Remark: Need to consider the additive nature of the buffering in relation to the buffer capacity.
The contribution proposes to add flags/indicators whether the indicated "suitable splice point" immediately precedes an I or IDR.
Question: How is that aspect useful?
Question: Is there an assumption that the new data that would be concatenated after the "suitable splice point" would begin with an IDR picture? Response: Perhaps an all-Intra picture rather than an IDR picture.
Question: What to do with the other pictures that are waiting in the DPB if not followed by an IDR?
Response: 1) Send an MMCO to mark them "not used for reference", and 2) if they have not yet been output, perhaps this should not be indicated as a "suitable splice point".
Proposed syntax and semantics were shown. There were some other syntax elements in the proposal.
The proposal also includes signaling of some CPB properties.
Remark: As shown, the contribution may reflect a misconception about the meaning of PicOrderCnt (although the concept may be expressible in another way).
Remark: These proposals create "promises" that must be fulfilled later in the bitstream – but some of the "not yet fulfilled promises" may be broken by a splicing operation.
Remark: This is especially true for (e.g. older-generation) splicing equipment that has been designed without awareness of this proposed SEI message.
Response: We can specify that the promises go away upon encountering an IDR picture or end_of_stream NAL unit. Or we can specify not to make overlapping promises.
Remark: The proposal does not seem like it provides a complete solution to this "broken promises" problem.
134
Response: But we risk having other specifications developed that establish harmful constraints on bitstream or application behavior – resulting in reduced interoperability and loss of potential capability.
Remark: Managing CPB aspects is another aspect that needs study in this context.
Remark: Whenever the "M" is mismatched between the old stream and the new stream, the only thing the splicer can probably to is set no_output_of_prior_pics_flag to 1. A splice operation must manage both DPB output times and CPB output times (and decoding times, etc.). Is it really possible to produce seamless behaviour and/or maintain conforming behavior under some of these circumstances?
See further notes below.
6.2.1.1.2 JVT-Z041 ( Prop 2.2) [A. A. Rodriguez, J. Au (SciAtl/Cisco)] Prop SEI message to control DPB output in non-seamless spliced bitstreams with end_of_stream
This contribution proposed a new SEI message that would convey information related to the output of DPB pictures at the splice point of non-seamless concatenated bitstreams. It was asserted that the proposed SEI message could serve as a tool to aid splicing devices, along with the end_of_stream NAL unit and no_output_of _prior_pics_flag. The proposed SEI message would be provided in the bitstream prior to the end_of_stream NAL unit to identify its location and specify the output behavior of non-previously output pictures in the DPB subsequent to the end_of_stream NAL unit.
Information that specifies the output behavior of each non-previously output DPB picture reportedly allows for outputting a picture, not outputting, or outputting the picture for a number of consecutive times prior to outputting the subsequent picture from the first bitstream.
It was asserted that the proposed SEI message could be signaled ahead with information that points to the location of the end_of_stream
Discusses gaps – non-consecutive (fixed frame rate) output times for pictures in the DPB.
There is currently no equivalent of MMCO for output marking – just no_output_of_prior_pics_flag. This is basically what is suggested in this contribution.
Remark: We need to consider the conformance implications – we have a standard that establishes requirements for conformance that we can't change (at least not easily/substantially).
Remark: Consider the situation where a "left-over" picture from prior to the splice point ends up with the same output time as a picture from the new spliced-in coded video sequence.
See further notes below.
6.2.1.1.3 JVT-Z042 ( Prop 2.2) [A. A. Rodriguez, J. Au (SciAtl/Cisco)] Prop SEI message to forewarn location of end_of_stream
This contribution proposed a new SEI message that would identify the location of an end_of_stream NAL unit in the bitstream (some number of access units prior to its placement).
135
The end_of_stream NAL unit is the last NAL unit in the access unit that ends a bitstream. In some system environments, a new bitstream may immediately follow the access unit that ended the bitstream. The proposed SEI message would indicate that an end of stream NAL unit will appear in the bitstream at a position N access units after the location of the SEI message.
See further notes below.
6.2.1.1.4 Discussion of JVT-Z040, JVT-Z041, and JVT-Z042 togetherJVT Disposition: These contributions seem to open an topic likely to require action. But further study is needed to determine exactly what to do.
Related activities may be under way in ITU-T SG 9 (J.181, incoming LS, on cue messages and codec changes) and SCTE (SCTE 35, which is a corresponding specification, and other activities on "conditioning" – how to manage the operation) with some interest also found in DVB.
The JVT planned to conduct further study of this subject, and to establish an AHG in which to perform such investigations.
6.3 Deployment issuesIn meeting discussions, it was noted that many products (esp. portable video players) have implemented the "toolbox" subset corresponding to profile_idc = 66 with constraint_set1_flag and that some application specifications have specified to use this subset, which is currently not a defined conformance point of the AVC standard. It was suggested to provide a new profile definition corresponding to those settings.
The suggested name for this new profile that arose in these discussions was the "Common Profile".
JVT Decision: The JVT suggests that its parent bodies and participants study this suggestion and provide their opinions about the desirability of this potential future action. A resolution on this topic was conveyed to the WG 11 parent body for inclusion in the parent body meeting resolutions. See item 4 of section 10 of this report.
7 Joint discussion with MPEG requirements
7.1 SVC bit depth, gamut, and chroma format scalabilityRemarks made in this discussion include the following:– How ready is the market for such a thing?– Display technology development is not mature.– Need ability to demonstrate benefits clearly.– Need good testing conditions for experiments, etc.– Need display capability for standard development.– SVC would be just an efficiency improvement of something that can be done already in
another way.
7.2 MVC profiles and levels
136
JVT-Z049-B was reviewed and the following topics were further discussed with decisions as recorded below:
Use a constraint set flag with Main/High profile values to indicate compatibility with MVC. JVT decision: Agreed.
How do we specify the memory capacity? Use a fixed multiplier of buffering capacity and maximum ref_idx value, associated with a nominal number of views expected in the profile. The fixed multiplier should be 2. JVT decision: Agreed.
View random access friendliness constraint / parallel-processing friendliness constraint: Editors to put their best effort into the draft – others can review and comment as things move forward. JVT decision: Agreed.
Should we have a slice size constraint like we have in the "professional" profiles? Yes. JVT decision: Agreed.
7.3 3DV / FVVThere was a presentation and discussion of MPEG's exploration work on 3D video / free-viewpoint video.
Two applications were discussed:– The application known as "3D video" is video for 3D display viewing.– The application known as "free viewpoint video" is video with support of extensive
navigation capability within a 3D environment.
The current focus is on the first of these two application domains.
7.4 1080p50/60 MPEG-2The JVT was informed of some MPEG-2 contributions and discussions within MPEG.
7.4.1.1.1 M14863 JNB comment on 1080p50/60 MPEG-2/H.262This contribution to MPEG was provided in support of the M14869 technical proposal discussed below.
7.4.1.1.2 M14869 Technical proposal on 1080p50/60 MPEG-2/H.262This contribution to MPEG contained a technical proposal for support of 1080p50/60 in MPEG-2/H.262.
Max bit rate 80 Mbps = same as High level, buffer size = same as High level.
4:2:2 profile? And an Intra-only 4:2:2 profile? Leaving open for further study.
No interlace support? Maintain nesting decoding capability, but prohibit interlace in top level.
MPEG indicated that it would list 5 NBs supporting this proposal.
137
MPEG's plan was indicated to be to go to the PDAM stage of the ISO/IEC approval process at this meeting. Activities toward development of an associated conformance specification and reference software were planned.
8 JVT internal operating rulesJVT decision: The following clarifications/adjustments of JVT operating rules have been adopted.
The JVT decided that participants shall to refrain from long (=more than 4 Minutes) presentations of their proposal, if the results of their coding efficiency experiments have provided less than 2% bit-rate on average (or equivalently 0.1 dB gain on average).
Presentations should also not use "cherry picking" of results for summary reporting in abstracts and presentations. Summary reports must be true summaries – not highlights of best results while ignoring worst results.
Regarding late contributions: Due to our difficulties with a large quantity of late-submitted contributions at previous meetings, the JVT has agreed that for its next meeting, no late-uploaded (non-AHG-report, non-liaison, non-verification) contribution will be presented without having a minimum of 4 JVT participants (working for separate organizations other than that of the primary contribution author) recorded by name as supporting the allowance of such a presentation, in addition to a consensus of the general JVT membership to allow the presentation. Such support to allow a presentation is to be understood to not necessarily imply support of the adoption of the content of the late contribution, but only as a positive expression that the document should be allowed to be presented. Additionally, the provider of such a presented late contribution shall send an email apology to the JVT email reflector. This rule does not apply to material requested by the JVT at the meeting (e.g., reports of JVT-authorized "break out group" side activities).
For all contributions that have presentation material that is used to present them to the group (e.g., PowerPoint presentations), the presentation material should be provided along with the written contribution (within the same zip container file). PDF is preferred over PPT for presentations when the PPT filesize is large and there is no need for the slide deck to be editable by others.
All submissions must be made in JVT-Zxxx.zip format with the word docs, excel sheets and other information being in the zip container. The document must contain an abstract and be accompanied with an e-mail notification containing title, authors and abstract (identical to the one in the doc) which is no longer than 200 words and no shorter than 25 words and is written in 3rd person language in a manner that does not express endorsement of the content of the document.
On filenames inside of .zip containers – use a filename so that if someone takes the files out of the zip container, they would still know what contribution they came from. Every file (or directory) in the .zip container for document JVT-Zxxx should start with JVT-Zxxx. Example: JVT-Zxxx.doc (main document), JVT-Zxxx_presentation.pdf, JVT-Zxxx_results1.xls, etc.
When providing additional or revised files, do not include copies of files that were already included in the prior .zip archive for the same contribution and do not re-use the same filenames without adding revision numbers (r1, r2, etc.) – this saves us needing to worry about whether the files someone obtains with the same filenames are the same or different.
Independent verification (necessary for adoption of a proposal) is provided either through
138
a) independent implementation by 1 or more organization different than that of the proponent based on the textual description (after adoption, both decoder source code versions must be made publicly available along with one encoder version), or
b) providing source code to all CE participants prior to the meeting (CEs can only be joined at the meeting, when the CE is created. CEs are created at each meeting and last until the next meeting.)
Simply running binary executables provided by a proponent is not ordinarily considered independent verification. Source code should be provided and used, and the verifying party should invest a proper degree of effort to ensure that the “verification” they perform is a meaningful and professional study with significant depth rather than just a perfunctory procedural formality.
For every SEI message and every syntax element that are currently in the SVC/MVC draft, a showcase has to be provided in order to retain it in the JSVM/JMVM/JD. If such a showcase is not provided at the next meeting for an SEI message or parts of it, the SEI message or the respective parts will be removed from the JSVM/JMVM/JD. The source code and executables for the showcase must be made available.
When Core Experiments (CEs) are to be established, a first CE description should be available at the last day of the meeting (or at least within a few days). Changes of the CE description are only allowed until 3 weeks prior to the next meeting. These changes must be of evolutionary characteristic relative to the input documents on which the CE is based and must be agreed by those who contributed the respective input document(s) or be added as an option.
Contributions that are proposals of new technology that was not what was described as being tested in a CE (even if related to the tested technology) should not indicate that they are CE documents in their title and abstract.
9 List of AHGs establishedThe following JVT “ad hoc groups” (AHGs) were established to progress work on identified topics until the next meeting of the JVT.
9.1 JVT project management and errata reportingDiscussion: [email protected]: Gary Sullivan, Jens Rainer Ohm, Ajay Luthra, and Thomas WiegandMandates:– Collect errata reports on standards under management of JVT– Coordinate overall interim JVT progress– Prepare status information for JVT status reporting
9.2 JM Text, reference software, bitstream exchange and conformanceDiscussion: [email protected]: Thomas Wiegand, Karsten Sühring, Alexis Tourapis, Teruhiko Suzuki, Gary SullivanMandates:– Maintain and update JM algorithm description text– Maintain and update JM reference software and its usage manual– Facilitate exchange of test bitstreams to aid interoperability testing– Collect bitstreams for inclusion in (non-SVC) Conformance specifications– Identify and correct problems in Conformance specifications and associated bitstreams
139
9.3 SVC JSVM text, software and conformanceDiscussion: [email protected]: Heiko Schwarz, Jérome Vieron, Thomas Wiegand, Mathias Wien, Alex Eleftheriadis, Vincent BottreauMandates:– Edit and deliver improved JSVM text– Coordinate JSVM software integration– Coordinate bug-fixing process for the JSVM software– Maintain JSVM software manual– Plan, edit, and collect bitstreams for SVC conformance specification
9.4 SVC bit depth, color gamut, and chroma format scalabilityDiscussion: [email protected]: Andrew Segall, Thomas WiegandMandates:– Identify applications– Work out suggestions for detailed needs– Find/create test material– Study bit-depth reduction techniques, e.g., tone-mapping tools– Study color space and/or gamma conversion requirements– Define experiments and test conditions– Investigate software and text modification needs– Identify complexity issues
9.5 SVC FGS applications and design simplificationDiscussion: [email protected]: Justin Ridge, Marta KarczewiczMandates:– Identify applications that may require FGS functionality and their characteristics– Determine to what extent new coding tools are needed to achieve the functionality– Define experiments and test conditions relating to FGS technology– Coordinate with JSVM software effort to align JSVM software with current design– Explore simplification of FGS tool design
9.6 MVC JD and JMVM text and softwareDiscussion: [email protected]: Hideaki Kimata, Aljoscha Smolic, Purvin Pandit, Anthony Vetro, Ying ChenMandates:– Collect comments on draft, perform necessary editing and delivery.– Maintain JMVM and JD document and collect comments on the text.– Coordinate JMVM software integration– Coordinate bug-fixing process for the JMVM software– Maintain JMVM software manual
140
9.7 MVC JMVM coding toolsDiscussion: [email protected]: Ying Chen, Shan Gao, Han-Suh Koo– Investigate simplification and improvement of current JMVM coding tools (IC and motion
skip)– Investigate techniques for single loop decoding to reduce complexity starting with motion
skip– Investigate approaches for enhancing MVC coding efficiency using spatial downsampling– Investigate low-complexity methods for mobile stereoscopic 3DTV applications– Investigate other potential approaches to achieving enhanced MVC capability– Coordinate software, test material, and experiment conditions for these techniques– Evaluate performance of enhanced MVC proposals (including CAVLC operation in
particular)
9.8 Splicing operationDiscussion: [email protected]: Gary Sullivan, Arturo Rodriguez, Sam NarasimhanMandates:– Study the use of bitstream splicing in applications– Investigate potential needs for SEI data to aid in splicing operations, including consideration
of JVT-Z040, JVT-Z041, and JVT-Z042 and the issues raised in their discussion– Study the implications of ITU-T Rec. J.181 and the draft new ITU-T Rec. J.h-dpi– Gather information about activities of other relevant organization regarding the development
of specifications relating to bitstream splicing
10 Resolutions reported to WG 11 parent bodyIn addition to requesting approval of the texts described above in section 1.4 (and associated dispositions of WG 11 NB comments and expressions of thanks to WG 11 NBs for their input) and informing WG 11 of the AHGs established as described above in section 9, the following JVT resolutions were reported to the WG 11 parent body:
1. The the JVT and the video subgroup of WG 11 recommended approval of the WG 11 N9577 Report of the WG 11 SVC verification tests and the N9617 liaison statement template for conveying these results with WG 11 liaison letters.
2. The JVT and the video subgroup of WG 11 thanked Technische Universität München for use of its facilities and thanked the following participants for their substantial contributions of effort in the work on the SVC verification test: Vincent Bottreau (Thomson), Christian Keimel (Technische Universität München), Tobias Oelbaum (Technische Universität München), Heiko Schwarz (Fraunhofer HHI), and Mathias Wien (RWTH Aachen University).
3. The JVT and the video subgroup of WG 11 thanked the following companies for their financial support of the SVC verification test: Fraunhofer HHI, Microsoft, Orange, ST Microelectronics, and Vidyo.
4. The JVT and the video subgroup of WG 11, considering the apparent deployment of a significant number of products that support only the coding tool features that are in common between the Baseline, Main and High profiles of ISO/IEC 14496-10 (ITU-T Rec. H.264) Advanced Video Coding, requested NBs to provide comments regarding the potential need for specification of a new AVC "Common profile" consisting of the tool
141
constraints expressed by the syntax element combination of profile_idc equal to 66 with constraint_set1_flag equal to 1. It was suggested that comments should arrive prior to the April 2008 JVT and WG11 meetings.
5. The JVT chairmen proposed to hold the 27th JVT meeting during 23-29 April 2008 under the auspices of the meeting of ITU-T SG 16 in Geneva, CH. Further meetings are expected to be held during 20-25 July 2008 under WG 11 auspices in Hannover, DE; 12-17 October 2008 under WG 11 auspices in Busan, KR; and 27 January – 3 February 2009 under ITU-T SG 16 auspices in Geneva, CH.
Post-meeting note: The plans for the Geneva meeting were subsequently modified to start the JVT meeting on Thursday 24 April 2008 after lunch at 2:30 p.m. and end by lunchtime on Tuesday 29 April 2008, as announced by email to the JVT reflector on 18 February 2008.
11 AttendancePersons registered to attend the meeting, as recorded by a sign-in sheet circulated during the meeting, were the following (124 listed participants):
1. Amon, Peter (Siemens AG)2. Andersson, Kenneth (Ericsson)3. Asai, Kohtaro (Mitsubishi)4. Bandoh, Yukihiro (NTT)5. Bjøntegaard, Gisle (Tandberg)6. Bottreau, Vincent (Thomson R&D France)7. Bruls, Fons (Philips)8. Budagavi, Madhukar (Texas Inst.)9. Chen, Weizhong (Huawei Tech.)10. Chen, Ying (Tampere Univ. Tech.)11. Chiu, Yi-Jen (Intel)12. Cho, Sukhee (ETRI)13. Choe, Yoonsik (Yonsei Univ.)14. Choi, Hae-Chul (ETRI)15. Choi, Woongil (Samsung AIT)16. Chujoh, Takeshi (Toshiba)17. Cieplinski, Leszek (Mitsubishi Electric)18. Cock, Jan De (Ghent Univ.)19. de Casanove (Actimagine)20. de Haan, Wiebe (Philips)21. Divorra, Òscar (Thomson)22. Domański, Marek (Poznań Univ. Tech.)23. Eleftheriadis, Alex (Layered Media)24. Fröjdh, Per (Ericsson)25. Fujii, Toshiaki (Nagoya Univ.)26. Fuldseth, Arild (Tandberg)27. Gao, Shan28. Gomila, Cristina (Thomson)29. Guleryuz, Onur (Docomo USA Labs)30. Han, Dong-hoon (Sejong Univ.)31. Han, Ki Hun (Sejong Univ.)32. Hannuksela, Miska (Nokia)33. Horowitz, Michael (CoVi Tech. --> Vidyo)34. Hsiang, Shih-Ta (Motorola)
142
35. Hu, Yi (Conexant Systems)36. Husak, Walt (Dolby Labs)37. Ishtiaq, Faisal (Motorola)38. Itoh, Takashi (Fujitsu Labs)39. Jeon, Byeong-Moon (LG Electronics)40. Jeon, Byeungwoo (SKKU)41. Jeon, Yongjoon (LG Electronics)42. Jeong, Jechang (Hanyang Univ.)43. Jeong, Seyoon (ETRI)44. Jia, Jie (Sejong Univ.)45. Jung, Joël (France Telecom R&D)46. Kang, Jung Won (ETRI)47. Karczewicz, Marta (Qualcomm)48. Kim, Dae-Yeon (Sejong Univ.)49. Kimata, Hideaki (NTT)50. Klimansewski, Krynxtox (Poznań Univ. Tech.)51. Kook, Seung Ryong (Kyunghee Univ.)52. Lainema, Jani (Nokia)53. Lee, Yung Ki (Sejong Univ.)54. Lee, Yung-Lyul (Sejong Univ.)55. Lim, Chong Soon (Panasonic)56. Lim, Jung Eun (LG Electronics)57. Lim, Sung Chang (Sejong Univ.)58. Lin, Sixin (Huawei)59. Liu, Yingjia (Huawei)60. Lizcano, Leonardo (Telefonica R&D)61. Luthra, Ajay (Motorola)62. Ma, Siwei (Peking Univ.)63. Masashi, Takahashi (Hitachi)64. McAdoo, Kyle (Conexant Systems)65. Moon, Joo Hee (Sejong Univ.)66. Muczko, Marian (Telekomunikacja Polska)67. Naito, Sei (KDDI)68. Narasimhan, Sam (Motorola)69. Narroschke, Matthias (Panasonic)70. Nilsson, Mike (BT)71. Nishi, Takahiro (Panasonic)72. Oelbaum, Tobias (Tech. Univ. Munich)73. Oh, Kwan-Jung (GIST)74. Ohm, Jens-Rainer (RWTH Aachen Univ.)75. Pandit, Purvin (Thomson)76. Park, Gwang-Hoon (Kyung Hee Univ.)77. Park, Hyoung-Mee (Sejong Univ.)78. Park, Ji Ho (KETI)79. Park, Jong Tae (Kyunghee Univ.)80. Park, Joon-young (LG Electronics)81. Park, Min-Cheol (Sejong Univ.)82. Park, Min-woo (Kyung Hee Univ.)83. Park, Seung-Wook (LG Electronics)84. Pateux, Stephane (Orange - France Telecom)85. Ridge, Justin (Nokia)86. Rodriguez, Arturo (Scientific Atlanta / Cisco)87. Sampedro, Jesus (Polycom)
143
88. Schwarz, Heiko (Fraunhofer HHI)89. Segall, Andrew (Sharp Labs USA)90. Sekiguchi, Shun-ichi (Mitsubishi)91. Senoh, Takanori (Univ. Tokyo)92. Shim, Seung-yong (Sejong Univ.)93. Shim, Woo-Sung (Samsung Electronics)94. Shimizu, Shinya (NTT)95. Siong, Lee Wei (I2R)96. Smolić, Aljoscha (Fraunhofer HHI)97. Suh, Hyungsik (LG Electronics)98. Sullivan, Gary (Microsoft Corp.)99. Sun, Huifang (Mitsubishi)100. Suzuki, Teruhiko (Sony)101. Tan, Thiow Keng (NTT DoCoMo)102. Tanizawa, Akiyuki (Toshiba)103. Tomonobu, Yoshino (KDDI)104. Ugur, Kemal (Nokia)105. Um, Gimun (ETRI)106. Van de Walle, Rik (Ghent Univ.)107. Vartiainen, Juha (SPS)108. Vermeirsch, Kenneth (affiliation ?)109. Vetro, Anthony (Mitsubishi Electric)110. Wang, Xianglin (Nokia)111. Wiegand, Thomas (Fraunhofer HHI)112. Wittmann, Steffen (Panasonic)113. Won, Kwanghyun (SKKU)114. Xiong, Lianhuan (Huawei)115. Yamakage, Tomoo (Toshiba)116. Yamamoto, Tomoyuki (Sharp)117. Yamasaki, Takahiro (Oki Electric Industry)118. Yang, Haitao (Xidian Univ.)119. Yang, Jungyoup (SKKU)120. Yann, Bodo (Joost)121. Yao, Wei (I2R)122. Yoo, Jeong-Ju (ETRI)123. Yoo, Young Joe (Sejong Univ.)124. Zhu, Gang (Tsinghua Univ.)
144
Annex I – Audio report
Source: Schuyler Quackenbush, Chair
1 Opening of the meeting.............................................................................................................32 Administrative matters...............................................................................................................3
2.1 Communications from the Chair 32.2 Approval of agenda and allocation of contributions 32.3 Creation of Task Groups 32.4 Approval of previous meeting report 32.5 Review of AHG reports 32.6 Joint meetings 32.7 Received National Body Comments and Liaison matters 3
3 Record of AhG meetings...........................................................................................................33.1 AhG Meeting SAOC, Unified Speech and Audio Sunday 1000-1700 3
3.1.1 SAOC 1000-13000..........................................................................................................33.1.2 Unified Speech and Audio 1400-1700............................................................................5
4 Task group activities..................................................................................................................64.1 Joint meetings and documents from other groups 64.2 Task Group discussions 6
4.2.1 MPEG-4 audio, conformance, reference software..........................................................64.2.2 MPEG-D MPS.................................................................................................................84.2.3 MPEG-D SAOC..............................................................................................................84.2.4 MPEG-D Unified Speech and Audio..............................................................................8
5 Meeting deliverables..................................................................................................................95.1 Responses to Liaison and NB comments 95.2 Recommendations for final plenary 95.3 Establishment of Ad-hoc Groups 95.4 Approval of output documents 95.5 Press statement 9
6 Future activities.........................................................................................................................96.1 Schedule of future meetings 96.2 Agenda for next meeting 96.3 All other business 96.4 Closing of the meeting 9
Annex A Participants.................................................................................................................10Annex B Audio Contributions and Schedule.............................................................................11Annex C Task Groups...............................................................................................................15Annex D Output Documents......................................................................................................16Annex E Agenda for the 84th MPEG Audio Meeting................................................................18
145
1 Opening of the meetingThe MPEG Audio Subgroup meeting was held during the 82nd meeting of WG11, October 22-26, 2007 in Shenzhen, CN. The list of participants is given in Annex A.
2 Administrative matters2.1 Communications from the ChairThe Chair summarised the issues raised at the Sunday evening Chair’s meeting, proposed task groups for the week, and proposed agenda items for discussion in Audio plenary.
2.2 Approval of agenda and allocation of contributionsThe agenda and schedule for the meeting was discussed, edited and approved. It shows the documents contributed to this meeting and presented to the Audio Subgroup, either in the task groups or in Audio plenary. The Chair brought relevant documents from Requirements, Systems and MDS to the attention of the group. It was revised in the course of the week to reflect the progress of the meeting, and the final version is shown in Annex B.
2.3 Creation of Task GroupsTask groups were convened for the duration of the MPEG meeting, as shown in Annex C. Results of task group activities are reported below.
2.4 Approval of previous meeting reportThe 82nd Audio Subgroup meeting report was registered as a contribution, and was approved.
2.5 Review of AHG reportsThere were no requests to review any of the AHG reports.
2.6 Joint meetingsThere were no joint meetings with Audio over the course of the week.
2.7 Received National Body Comments and Liaison mattersThe NB Comments and Liaison documents for the meeting that require a response are as shown below.No. Title Response byM15072 Liaison Statement from ITU-T SG 9 [SC 29 N 9004] Audio Subgroup
3 Record of AhG meetings3.1 AhG Meeting SAOC, Unified Speech and Audio Sunday 1000-17003.1.1 SAOC 1000-13000
Oliver Hellmuth, FhG, presentedm15123 Information and Verification Results for CE
on Karaoke/solo System Improving Performance of MPEG SAOC RM0
Oliver HellmuthJohannes HilpertAndreas HölzerLeonid TerentievCornelia Falch
This notes that RM0 does not provide a very satisfying level of performance for the difficult problem of muting a foreground object as in the Karaoke application. It reviewed the technology proposed as a CE at the previous MPEG meeting. If the Fore Ground Object (FGO) is stereo, it proposes to cascade TTT-1 boxes and shows that such a cascade can be formulated as a TTN-1
box, where N=3 if FGO is mono and N=4 if FGO is stereo.146
Listening test results were presented, comparing SAOC RM0 and SAOC with the new TTN technology. In global mean performance TTN was better than SAOC RM0 in all tests at the 95% level of significance. Furthermore, for the operating points demonstrated, the SAOC TTN technology achieving scores that were solidly in the “good” region.Heiko Purnhagen, Dolby Labs, presentedm15162 Cross Verification of SAOC CE on Karaoke
enhancementJonas Engdegard
This contribution presents a listening test that provided a cross-check on the FhG Karaoke CE. In all cases, the mean performance of the TNN technology was better than the mean performance of RM0 at the 95% level of significance.Henney Oh, LGE, noted that FhG presented no evidence of performance for energy mode, and that there is no basis for incorporating this operating mode into the SAOC WD. The Chair suggested that this could be provided at the next meeting, perhaps even as a collaboration between FhG and LG.The AhG recommends that the Audio Subgroup accept the TTN prediction mode with residual coding into the SAOC WD.Jeongil Seo, ETRI, presentedm15144 Consideration on enhanced Karaoke
processing for stereo FGOJeongil SeoSeungkwon BeackKwang-ki KimKyeoungok Kang
This contribution notes that the current performance of SAOC RM0 in the karaoke application (i.e. suppression of FGO) has limited quality. ETRI suggest an alternative structure for karaoke/solo modes based on a cascade of OTT boxes in the case of stereo FGO. It further notes that the OTT box required 2 parameters while the TTT box requires 3 parametersETRI feels that the proposed technology can provide lower complexity and lower bitrate. The Chair welcomed ETRI to proceed with the CE, but noted that the proposed technology provided functionality similar to that of the FhG CE, which is recommended to be accepted into the SAOC WD. Hence there must be a significant increase in performance in order to displace the FhG CE technology. The Chair asked ETRI to give specific estimates of what, if any, resources ETRI might seed from the SAOC sometime during the MPEG week.Henney Oh, LG, presentedm15112 Comments on SAOC applications and
architecturesHenney OhYang-Won Jung
The contributions makes three suggestions: Downmix preprocessor – it suggests that mono to mono downmix be supported. Binaural transcoder - it suggests incorporating a separate binaural synthesis engine into
the SAOC decoder. MBO architecture – it suggests that in the case of Multichannel Background Object
(MBO), the downmix should be able to be either mono or stereo.
The Chair noted that the suggested modification for binaural transcoding provides no additional functionality as compared to the SAOC and MPEG Surround combination. Oliver Hellmuth, FhG, noted that in real implementation, one is free to optimize the internals relating to how to combine the SAOC and MPEG Surround functionality.The Chair suggested that it may be good to add an informative section to the SAOC specification on how to “collapse” SAOC and MPEG Surround functionalities in the case of a unified implementation.It was agree that interested parties should continue to discuss this contribution and report to the Audio Subgroup mid-week. Osamu Shimada, NEC, presentedm15110 A core experiment proposal for an additional
SAOC functionality of separating real-Osamu ShimadaToshiyuki Nomura
147
environment signals into multiple objects Akihiko SugiyamaOsamu Hoshuyama
The contribution notes that SAOC does not provide information on the nature or relationship of the multiple objects in the SAOC bitstream such that the decoder can meaningfully decode and place objects in a multi-channel presentation.Oliver Hellmuth, FhG, asked whether the current SAOC architecture with the addition of metadata that indicates that two objects are related (e.g. from the same microphone) could provide the same functionality. The Chair asked if NEC might clarify why the proposed technology (System 4) does not show significant improvement over what can be provided by the existing SAOC architecture (System 3).In conclusion, the Chair suggested that NEC have discussions with interested parties during the first part of the MPEG week and make a mid-week presentation that addresses the issues raised.
3.1.2 Unified Speech and Audio 1400-1700
Kristofer Kjörling, Dolby, presentedm15158 Homework according to the joint speech and
audio workplanKristofer KjörlingHeiko Purnhagen
This contribution reports the information requested in “Workplan for Candidate Test Items.” It did not find permission information on the item from NRSC, but did give information on where to get the DC associated with other items.Schuyler Quackenbush will contact David Layer, NRSC, to ask if MPEG can get access to this item.In addition, it presented a table that recommends the downmix, as L or (L+R)/2 and level adjustment, based on subjective evaluation.Schuyler Quackenbush, Audio Research Labs, presentedm15095 Collected Set of Possible Evaluation
GuidelinesS. Quackenbush
This contribution is merely the collection of text from various audio experts that was available on the Friday of the 82nd MPEG meeting. The presenter highlighted area in which a choice of methods must be made, but asked that discussion be deferred as the remaining contributions will a provide better vehicle for discussion.Werner Oomen, Philips, presentedm15155 Evaluation criteria and test items for unified
speech and audio codingWerner OomenErik Schuijers
This contribution covers four topics Derivation of VC – for each item and each operating point a VC is selected. Candidate test items – remove items that might duplicate the effect of oncatenated test
items. Figure of Merit – system of assigning points. Item Selection – to select a representative subset of the 38 items, as two sets: most critical
items and items that are coded with very good performance
The contribution presented the results of applying the item selection procedure using testing at 32 kb/s. Kristofer Kjörling, Dolby, presentedm15160 Thoughts on evaluation criteria for joint
speech and audio workitemKristofer KjörlingHeiko Purnhagen
This contribution covers five topics Derivation of VC – for each item, each operating point and each test site, a VC is
selected. Figure of Merit – which operating points are evaluated, and how do we pick a winner. Candidate all test items to make a single item to code – this prevents the opportunity of:
148
o Per-item tuningo Bit buffer abuse
Speech to Music transition – such items should be removed from the test, in that grading is difficult in that case that e.g. speech is handled well and music is not.
Dolby endorses the notion of using items such as the “classic” 12 MPEG items for the speech and audio process, as these are difficult and diverse items that span a large space of possible encoder “tunings.” The Speech and Audio test set should be known at the close of the April MPEG meeting.
Johannes Boehm, Thomson, presentedm15145 Thoughts on Speech and Audio Evaluation
GuidelinesOliver WuebboltJohannes Boehm
The contribution shows a method to combine the variances of a given system under test over all test sites. It recommends that the Evaluation Guidelines document
Take care when building a measure of variance or use in determining 95% CI on a global mean performance
Specify in advance what your information might be when you must “consider additional information” in order to choose a best system when the Figure of Merit fails to decide a winner.
Miyoung Kim, Samsung, presentedm15118 Comments on Unified Speech and Audio CfP
Evaluation GuidelinesMiyoung KimEunmi OhJungHoe Kim
The contribution proposes to Determine VC by pooling over all test sites Requirements – at 64 kb/s pool over all signal categories to get a single mean
performance
The Chair noted that pooling over all signal categories will result in a smaller confidence interval for that one score and thus may make the proposed 64 kb/s requirement more difficult to fulfil.Markus Multrus, FhG, presentedm15165 Comments on Speech and Audio Evaluation
GuidelinesRalf GeigerMarkus MultrusBernhard Grill
The contribution raises a number of issues Confidence intervals on the grand mean performance should be used when comparing the
performance of systems under test. The “winner” amongst systems with overlapping confidence intervals should be selected by considering additional information such as:
Operation at higher bitrates, e.g. 128 kb/s That re-use of existing MPEG technology is desirable
Miyoung Kim, Samsung, noted that it is undesirable to delay the selection process by running another listening test to get additional information. Anisse Taleb, Ericsson, stated that we cannot ask for subjective performance information at 128 kb/s because that operating point is not listed in the Call, and the Chair agreed with that statement. Ralf Geiger, FhG, noted that in a deadlocked situation an additional listening test may be the quickest way to resolve the deadlock.Schuyler Quackenbush, Audio Research Labs, presentedm15096 Draft Workplan for Testing of SA Proposals S. QuackenbushThis is a skeleton for the final workplan document. The presenter asked that interested audio experts please read and provide comments on components that are missing or could be improved.The Chair presented the AhG report, which was approved the AhG members present.
149
4 Task group activities4.1 Joint meetings and documents from other groupsThere were not joint meetings.
4.2 Task Group discussions4.2.1 MPEG-4 audio, conformance, reference software
Markus Schnell, FhG, presentedm15151 Update on AAC-ELD Verification Test Markus Schnell
Ralf GeigerThis contribution is a draft of the AAC-ELD Verification Test Workplan. It proposes two tests, the first “application-driven” which asses performance in application-driven operating points for typical material, and the second “technology-driven” which asses performance over a range of operating points for critical material. All operating modes of AAC-ELD are being tested (e.g. block length, sampling rate).For the first test, contribution proposes to use speech items from a wide range of languages from both male and female talkers. This may be “corrupted” using a set of representative office noise signals which were recorded by FhG. There was much valuable discussion on test items and how to construct test items from the signal toolbox. The Chair urged the Audio Subgroup to help in the task of specifying a process to construct the final test items that represent best practice.Tilman Liebchen, LG, presentedm15121 Update of ALS Conformance Tilman LiebchenThis contribution reports
some bugfixes in the current set of conformance data some new conformance data relating to MP4FF OAFI box Update of ASL Conformance data
The contribution proposes to Issue a DCOR on MPEG-4 Conformance to
o remove an equation from the spec and instead reference an equation in the MPEG-4 ALS specification.
o Replace incorrect ALS conformance data (due to a bug in the ASL reference software)
Conformance data for OAFI to be generated by a tool rather than as a pre-stored waveform.
It was the consensus of the Audio Subgroup to issue the DCOR from this meeting and to incorporate the new OAFI conformance data into the AAC-ELD Conformance amendment.Andreas Schneider, Dolby, presentedm15161 Proposed correction to PS conformance and
reference softwareAndreas SchneiderHeiko Purnhagen
This contribution corrects a restriction on how random access points interact with parapeter prediction in the combination of SBR and PS tools. It also corrects a disagreement between the PS specification and the PS reference software, in which the reference software must be corrected. In addition, new conformance bitstreams will be generated that removed the “bug” situation such that the old and new decoder both produces the current reference waveformsIt was the consensus of the Audio Subgroup to issue the.DCOR on conformance at this meeting and a WD on an AAC-ELD Reference Software.Pierfrancesco Bellini, University of Florence, presentedm15078 Editors Study on ISO/IEC
14496-4:2004/FPDAM 29, SMR Pierfrancesco BelliniPaolo Nesi
150
Conformance Giorgio ZoiaMaurizio Capanai
This study presents corrections to the FPDAM text as requested by National Bodies. Ralph Sperschneider, FhG, presentedm15180 WD on Audio part of MPEG-4 Conformance Manuela Schinn
Ralph SperschneiderSince this is a large document, the presenter urged audio experts to review it as homework. Furthermore, he expects that this rollup of audio-related conformance may be complete at the April MPEG meeting.Takehiro Moriya, NTT, presentedm15183 Proposed update of MPEG-4 ALS reference
software for OAFINoboru HaradaTakehiro MoriyaYutaka Kamamoto
This contribution proposes to add OAFI functionality to the ALS MPEG-4 Reference Software, and notes several bugs in the ALS reference software. It was the consensus of the Audio Subgroup to issue a DCOR on MPEG-4 Reference Software that will include the bugfixes and which will bring the Reference Software in line with the MPEG-4 Specification by incorporating the OAFI code.
4.2.2 MPEG-D MPS
Andreas Schneider, Dolby, presentedm15154 Update on MPEG Surround Conformance Andreas SchneiderThis update is summarizes as follows:
Conformance text defines 32 sequences These 32 sequences are combined with AAC and HE-AAAC as core coders, giving a total
of 64 sequences, of which 42 are available and 38 are cross-checked.It was the consensus of the Audio Subgroup to issue this work as FDAM with an editing period, but with only defining the 21 sequences that are available.
4.2.3 MPEG-D SAOC
Yang-Won Jung, LG, presentedm15111 A proposed CE on object parameter
estimation in SAOCYang-Won JungHenney Oh
The contribution proposes to modify OLD estimation such that transmition of DMG is not required, thus achieving bitrate savings.Oliver Hellmuth, FhG, noted that if DMG is not transmitted then it is not possible to recover the level (i.e. gain) of an object as input to the SAOC encoder. The Chair urged interested experts to discuss whether DMG in important for certain application scenarios and therefore should be available at the decoder.Jeongil Seo, ETRI, presentedm15143 CE on efficient decoding of a controllable
object and an MBOJeongil SeoSeungkwon BeackKwang-ki KimKyoungok Kang
This contribution proposes an efficient decoding process for FGO or MBO solo applicationLeonid Terentiev, FhG, asked for more details on the complexity reduction, which were provided on an additional slide. Oliver Hellmuth, FhG, noted that in a real-world implementation the full machinery of MPEG Surround would not be invoked, and hence the complexity figures presented by ETRI are not realistic.The Chair noted that the CE as proposed delivers only “fair” subjective results on the MUSHRA scale but at the same time appears to also deliver lower complexity. Howerver, compared the the
151
Karaoke CE that was reviewed in the AhG meeting, the ETRI proposal appears to deliver lower quality and lower complexity, which is typically not the basis for accepting new technology. The Chair suggested that ETRI discuss these issues with the SAOC proponents and report back to the group.The task group produced a Workplan for progressing the CE work which had the consensus of all CE participants.
4.2.4 MPEG-D Unified Speech and Audio
The task group continued the discussions of the AhG. The Chair proposed two additional pieces of text, one for selection of VC and the other for the Requirements. Subgroup experts gave valuable feedback to correct and clarify the mathematical expressions. The Chair identified remaining open issues with these two excerpts from the Evaluations Guidelines document, and the open issues will be discussed in break out groups.Later in the week, the Chair incorporated the new text into a revised version of the Evaluation Guidelines and added additional text for review. Identified open issues are:
Evaluated operating points (i.e. test results) used in the Requirements calculations Evaluated operating points (i.e. test results) used in the Figure of Merit calculations
The open issues in the Evaluation Guidelines document were discussed in the task group on Thursday, and several new versions of the document were produced. Friday the task group continued morning at 8AM. The group had the previous evening to review the last version of the Evaluation Guidelines document. The Chair presented that version of the document with the following additional changes and explained his motivation for the new text.
Editorial changes e.g to correct nomenclature and clarify the text. Changes to the Requirements and Figure of Merit sections that
o Remove the statistic for a subset of the testso Re-phrased the text so that the Requirements relate to the performance of the work
item.
The Chair noted that Samsung objects to the removal of the computation of the statistic for a subset of the
tests. Ericsson objects to the re-phrasing of the Requirements and Figure of Merit text.
Considering how much time and effort was spend in the Audio Subgroup discussing this document, and how divergent were many of the expert positions, the Chair declared this document to represent the consensus of the Audio Subgroup.
5 Meeting deliverables5.1 Responses to Liaison and NB commentsThe responses to Liaison and NB comments were prepared and approved.
5.2 Recommendations for final plenaryThe Audio recommendations were presented and approved.
5.3 Establishment of Ad-hoc GroupsThe following ad-hoc groups were established by the Audio subgroup:
No. Title Mtg9653 AHG on Audio Standards Maintenance No9654 AHG on Unified Speech and Audio Coding and SAOC Yes
5.4 Approval of output documentsAll output documents, shown in Annex D, were presented in Audio plenary and were approved.
152
5.5 Press statementThere was no Audio contribution to the press statement.
6 Future activities6.1 Schedule of future meetingsAd Hoc group meetings are indicated in Section 5.3. Unless otherwise indicated, Ad Hoc group meetings will be held at the location of the next MPEG meeting on the weekend preceding that meeting.
6.2 Agenda for next meetingThe agenda for the next MPEG meeting is shown in Annex E.
6.3 All other businessThere was none.
6.4 Closing of the meeting The 83rd Audio Subgroup meeting was adjourned Friday at 12:15, which the Chair noted has to be a record!
153
Annex A Participants
First Name Last NameCountry Affiliation
Pierfrancesco Bellini Italy DSI-UNIFIJohannes Boehm DE ThomsonTi Eu Chan SG I2RRalf Geiger DE Fraunhofer IISBernhard Grill DE Fraunhofer IISOliver Hellmuth DE Fraunhofer IISYang-Won Jung KR LG ElectronicsDong Soo Kim KR LG ElectronicsJunghoe Kim KR Samsung AITMi Young Kim KR SamsungKristofer Kjörling SE DolbyTerentiev Leonid DE Fraunhofer IISTilman Liebchen DE LG ElectronicsTakehiro Moriya JP NTTMarkus Multrus DE Fraunhofer IISToshiyuki Nomura JP NECTakeshi Norimatsu JP PanasonicHenney Oh KR LG Electronics
Werner Oomen NLPhilips Applied Technologies
Pierrick Philippe FRFrance Telecom R&D
Heiko Purnhagen SE DolbySchuyler Quackenbush USA ARL
Andreas Schneider DECoding Technologies
Markus Schnell DE Fraunhofer IISJeongil Seo KR ETRIOsamu Shimada JP NECRalph Sperschneider DE Fraunhofer IISAkihiko Sugiyama JP NECAnisse Taleb SE Ericsson ABYasuhiro Toguri JP Sony
David Virette FRFrance Telecom R&D
Sungyong Yoon KR LG Electronics
154
Annex B Audio Contributions and Schedule
Day / Time Task Group XSunday1000-1300 AhG: SAOCm15123 Information and Verification Results for CE on Karaoke/solo
System Improving Performance of MPEG SAOC RM0Oliver HellmuthJohannes HilpertAndreas HölzerLeonid TerentievCornelia Falch
X
m15162 Cross Verification of SAOC CE on Karaoke enhancement Jonas Engdegard Xm15144 Consideration on enhanced Karaoke processing for stereo
FGOJeongil SeoSeungkwon BeackKwang-ki KimKyeoungok Kang
X
m15112 Comments on SAOC applications and architectures Henney OhYang-Won Jung
X
m15110 A core experiment proposal for an additional SAOC functionality of separating real-environment signals into multiple objects
Osamu ShimadaToshiyuki NomuraAkihiko SugiyamaOsamu Hoshuyama
X
1300-1400 Lunch1400-1800 AhG: Unified Speech and Audio Codingm15158 Homework according to the joint speech and audio workplan Kristofer Kjörling
Heiko PurnhagenX
m15095 Collected Set of Possible Evaluation Guidelines S. Quackenbush Xm15155 Evaluation criteria and test items for unified speech and audio
codingWerner OomenErik Schuijers
X
m15160 Thoughts on evaluation criteria for joint speech and audio workitem
Kristofer KjörlingHeiko Purnhagen
X
m15145 Thoughts on Speech and Audio Evaluation Guidelines Oliver WuebboltJohannes Boehm
X
m15118 Comments on Unified Speech and Audio CfP Evaluation Guidelines
Miyoung KimEunmi OhJungHoe Kim
X
m15165 Comments on Speech and Audio Evaluation Guidelines Ralf GeigerMarkus MultrusBernhard Grill
X
m15096 Draft Workplan for Testing of SA Proposals S. Quackenbush XDiscussionRecommendations and review of AhG Report
1800- Chairs Meeting
155
Monday0900-1130 MPEG Plenary1200-1300 Audio Plenary
Welcome and commentsm15094 82nd MPEG Audio Report S. Quackenbush Xm15043 Ad Hoc Group on Audio Standards Maintenance R. Sperschneider Xm15044 Ad Hoc Group on Unified Speech and Audio Coding and
SAOCS. QuackenbushEunmi Oh
X
NB CommentsLaison
M15072 Liaison Statement from ITU-T SG 9 [SC 29 N 9004]On seamless bitstream splicing.
X
NXXXXGenerate a Liaison statement to ETSI TC DECT to say that - AAC-ELD specification is attached- Verification performance data available in April
X
1300-1400 Lunch1400-1730 MPEG-4m15151 Update on AAC-ELD Verification Test Markus Schnell
Ralf GeigerX
m15121 Update of ALS Conformance Tilman Liebchen Xm15161 Proposed correction to PS conformance and reference
softwareAndreas SchneiderHeiko Purnhagen
X
m15078 Editors Study on ISO/IEC 14496-4:2004/FPDAM 29, SMR Conformance
Pierfrancesco BelliniPaolo NesiGiorgio ZoiaMaurizio Capanai
X
m15180 WD on Audio part of MPEG-4 Conformance Manuela SchinnRalph Sperschneider
X
m15183 Proposed update of MPEG-4 ALS reference software for OAFI
Noboru HaradaTakehiro MoriyaYutaka Kamamoto
X
1730-1800 MPEG Surroundm15154 Update on MPEG Surround Conformance Andreas Schneider X
Tuesday0900-1000 SAOCm15111 A proposed CE on object parameter estimation in SAOC Yang-Won Jung
Henney OhX
m15143 CE on efficient decoding of a controllable object and an MBO Jeongil SeoSeungkwon BeackKwang-ki KimKyoungok Kang
X
1000-1300 Unified Speech and Audio Coding
156
Task group activities Improve nomenclature of VC definition Discuss Requirements
1300-1400 Lunch1400-1800 Task Group Activities1900- Chairs
Wednesday0900-1100 MPEG Plenary1130-1300 Task Group Activities
Report on Tuesday’s Chairs meetingS+A show of hands on FoM operating pointstatistics discussion on Requirements tests
1300-1400 Lunch1400-1800 Task Group Activities
AAC-ELD Verification TestConstruction of test items from “toolbox”S+AEvaluationWorkplan for test item selectionWorkplan for S+A Evaluation Test
1800-2030 Social
Thursday0900-1300 Task Group Activities
S+AEvaluationWorkplan for test item selectionWorkplan for S+A Evaluation Test
1300-1400 Lunch1400-1800 Task Group ActivitiesM15072 Liaison Statement from ITU-T SG 9 [SC 29 N 9004]
Review Liaison statements1800- Chairs Meeting
Friday0800-0900 Unified Speech and Audio Coding Evaluation Guidelines0900-1300 Audio plenary
Recommendations for final plenary XEstablishment of new Ad-hoc groups XAhG Mandates X
157
Get document numbers X1000 Approve Responses to NB comments X1030 Approval of output documents X
Review of Audio presentation to MPEG plenary XAgenda for next meeting XA.O.B. XClosing of the Audio meeting X
1300-1400 Lunch1400- MPEG Plenary
158
Annex C Task Groups
1. MPEG-D Unified Speech and Audio Coding2. MPEG-D SAOC3. MPEG-D MPS4. MPEG-4 audio, conformance, reference software
159
Annex D Output DocumentsNo. Title TBP Available
14496-3 Audio9619 Workplan for AAC-ELD Verification Test No 08/01/18No. Title TBP Available
14496-4 Conformance testing9620 DoC on ISO/IEC 14496-4:2004/FPDAM 20, SLS Conformance No 08/01/189621 ISO/IEC 14496-4:2004/FDAM 20, SLS Conformance No 08/03/149622 ISO/IEC 14496-4:2004/AMD 11/DCOR 3, Parametric Stereo No 08/01/189623 ISO/IEC 14496-4:2004/AMD 19/DCOR 1, ALS No 08/01/25
9624ISO/IEC 14496-4:2004/AMD XX, WD on AAC-ELD, OAFI and additional AAC Conformance
No 08/01/18
9625 DoC on ISO/IEC 14496-4:2004/FPDAM 29, SMR Conformance No 08/01/189626 ISO/IEC 14496-4:2004/FDAM 29, SMR Conformance No 08/01/189627 MPEG-4 Audio Conformance Rollup No 08/01/18No. Title TBP Available
14496-5 Reference Software 9628 ISO/IEC 14496-5:2001/AMD 10/DCOR 2, ALS No 08/01/259629 ISO/IEC 14496-5:2001/AMD XX, WD on AAC-ELD Reference Sw. No 08/01/18
9630Study on ISO/IEC 14496-5:2001/FPDAM 20, Reference Software for MPEG-1/2 Audio in MPEG-4 and BSAC Extensions
No 08/01/18
No. Title TBP Available23003-1 MPEG Surround
9631DoC on ISO/IEC 23003-1:2006/FPDAM 1, MPEG Surround Conformance
No 08/01/18
9632 ISO/IEC 23003-1:2006/FDAM 1, MPEG Surround Conformance No 08/03/149633 Workplan on MPEG Surround Conformance No 08/01/18
9634DoC on ISO/IEC 23003-1:2006/FPDAM 2, MPEG Surround Reference Sw.
No 08/01/18
9635 ISO/IEC 23003-1:2006/FDAM 2, MPEG Surround Reference Sw. No 08/03/14No. Title TBP Available
23003-2 SAOC9636 Status and Workplan on SAOC Core Experiments No 08/01/189637 WD on SAOC Text and Reference Software No 08/02/15No. Title TBP Available
Exploration – Unified Speech and Audio coding9638 Evaluation Guidelines for Unified Speech and Audio Proposals YES 08/01/189639 Workplan on Speech and Audio Material Selection No 08/01/18
9640Draft Workplan on Subjective Testing of Unified Speech and Audio Coding Proposals
No 08/01/18
No. Title TBP AvailableLiaison Statements
9641 Liaison Statement to ETSI TC DECT No 08/01/189660 Liaison Statement to ITU-T SG 16 No 08/01/18
160
Annex E Agenda for the 84th MPEG Audio Meeting
Agenda Item1. Opening of the meeting2. Administrative matters
2.1. Communications from the Chair2.2. Approval of agenda and allocation of contributions2.3. Review of task groups and mandates2.4. Approval of previous meeting report2.5. Review of AhG reports 2.6. Joint meetings2.7. Received national body comments and liaison matters
3. Plenary issues4. Task group activities
4.1. Spatial Audio Object Coding4.2. Unified Speech and Audio Coding4.3. MPEG Maintenance, including MPEG-1, MPEG-2, MPEG-4, SMR and MPEG
Surround issues5. Discussion of unallocated contributions6. Meeting deliverables
6.1. Responses to Liaison and NB comments6.2. Recommendations for final plenary6.3. Establishment of new Ad-hoc groups6.4. Approval of output documents6.5. Press statement
7. Future activities8. Agenda for next meeting9. A.O.B10. Closing of the meeting
161
Annex J – 3DG report
Source: Marius Preda, Chair
1 Opening of the meeting
1.1 Approval of the agendaThe agenda is approved.
1.2 Goals for the weekThe goals of this week are: Review FAMC results and edit the FPDAM Review 3DGCM related contributions and edit the Study of CD Review on-going AFX experiments Promote the 3DGC profiles Review contributions on reference software and edit the related output documents Review contributions on conformance and edit the related output documents Review Liaisons to MPEG 3DG Review and promote 3DG related demonstrations Investigate future developments of MPEG 3D Graphics
1.3 Standards from 3DGCStd Pt Edit. Project Description CfP WD CD
PDAMDCOR
FCDFPDAM
FDISFDAMCOR
4 4 2004 Amd.32 FAMC conformance 07/04 07/10 08/04 08/104 4 2004 Amd.33 Multiresolution profile
conformance07/04 07/10 08/04 08/10
4 4 2004 Amd.34 3DGC Model Conf. 08/01 08/04 08/104 5 2001 Amd.21 FAMC reference
software07/04 07/10 08/04 08/10
4 5 2001 Amd.22 3DGC Model RefSoft 06/07 08/01 08/04 08/104 16 2006 Amd.1/Cor.1 3DMC ext. corr. 07/10 08/044 16 2006 Amd.2 FAMC 07/07 08/01 08/074 16 2006 Amd.3 3D Multiresolution
profile07/07 07/10 08/04
4 11 2006 Amd.xxx Scene partitioning 07/07 08/04 08/10 09/014 16 200x 3rd Ed. AFX 08/01 08/104 25 200x 3D Graphics
Compression model07/04 07/10 08/04 08/10
4 16 Low complexity mesh compression
162
1.4 Room allocation3DGC: Tombak
163
1.5 Allocation of contributionsN° Title Schedule ActivityD1 Monday D1
MPEG Plenary D1 09:00~11:30
MPEG General
3DG Plenary D1 12:30~13:00 3DG General
Roll call, Agenda, Goals, FAQ, etc., Marius PredaStatus of www.mpeg-3dgc.org/www.mpeg-3dgc.com Patrick Gioia
15042 Report of AHG on 3DGC documents, experiments and software maintenance Jeong-Hwan Ahn, Nikolce Stefanoski
Dissemination Karsten Muller, Marius PredaRefSoft Policy Marius Preda
Lunch Break D1 13:00~14:00
FAMC (AMD2) D1 14:00~15:30
m15149 FAMC decoder conformance Khaled Mamou, Titus Zaharia, Françoise Prêteux
m15150 FAMC integration into the MPEG-4 RefSoft Khaled Mamou, Titus Zaharia, Marius Preda, Françoise Prêteux
m15201 GNB comments on ISO/IEC 14496-16:2006/PDAM 2 (FAMC)
Nikolce StefanoskiJörn Ostermann
Coffee Break 15:30~16:00
Core Experiments (Low Complexity Mesh Encoding) D1 16:00~17:00 CE
m15153 Low-complexity approach for static mesh compression Khaled Mamou, Titus Zaharia, Marius
164
N° Title Schedule ActivityPredaFrançoise Prêteux
Joint meeting Scene partitioning Joint with Systems (in Systems) D1
17:00~18:00D2 Tuesday D2
Core Experiments (Low Complexity Mesh Encoding) D2 09:00~09:30
xxx Open discussions all
Joint meeting Metaverse Joint with Req and Systems (in Systems) 09:30 – 10:00
Core Experiments (Low Complexity Mesh Encoding) D2 10:00~10:30
xxx Open discussions
3DGCM D2 10:30~11:00
m15085 Software Implementation for P25 Blagica Jovanova, Marius Preda, Francoise Preteux
m15086 3DGC Conformance dataset for P25 Blagica Jovanova, Marius Preda, Francoise Preteux
Coffee Break 10:30~11:00
Demo D1 11:00~11:30
m15087 3D graphics player for N93 and N95 Ivica Arsov, Marius Preda, Francoise Preteux
Profiles D2 11:30~12:00
Multi-Resolution Profile AMD 3 and Conformance Patrick Gioia
Lunch Break D2 12:00~14:00
165
N° Title Schedule Activity
Repository and benchmarking D2 14:00~15:30
m15084 Online platform for 3D graphics compression benchmarkingBenoit Le BonhommeMarius PredaFrançoise Preteux
m15219 Table of 3D models in the MPEG 3DGC repository Sikyung KimEuee S. Jang
m15198 KNB Comment on 14496-16:2006/AMD1.Corr1 (3D Mesh Coding Extension Correction)
Jeong-Hwan Ahn.Daiyong Kim.Euee S. Jang
Coffee Break 15:30~16:00D2 16:00~16:30
Part 16 AMD2 FAMC Editing allFAMC RefSoft and Conformance Editing all
D3 Wednesday D3
MPEG Plenary D3 09:00~12:00
MPEG General
Lunch Break D3 12:00~14:00
3DGC Plenary (Editing of documents) D3 14:00~18:00
DoC for FAMC allISO/IEC 14496-16 2nd Ed. AMD1 Cor1 Editing all
Coffee Break 15:30~16:00RefSoftware report for AFX tools Francisco Moran, allPart 16 AMD3 Multiresolution Profile Editing allMultiresolution Profile Conformance all3DGCM Editing all3DGCM RefSoft Editing all3DGCM Conformance Editing all
D4 Thursday D4
166
N° Title Schedule Activity
3DGC Editing and other issues D3 09:30~12:00
Joint meeting Scene Partitioning Systems (in Systems Room) 09:30 – 10:30
GNB comment on FAMC Jorn OstermanReview of the CfP for Low complexity mesh encoding Françoise Preteux
Lunch Break D3 12:00~14:00
3DGC Editing and other issues D3 14:00~18:00
Requirements for Low complexity 3D mesh compression all
AFX 3rd Edition allJoint meeting Scene Partitioning Systems (in Systems) 16:00 – 16:30
AFX 3rd Edition allD5 Friday D5
3DG output documents preparation D4 09:00~12:00 3DG General
AhGs and resolutions all
Lunch Break D5 12:00~14:00
MPEG Plenary D5 14:00~ MPEG General
167
1.6 Attendance listName Country CompanyMarius Preda France ITFrançoise Prêteux France ITPatrick Gioia France OrangeLabsFrancisco Morán Burgos Spain UPMKarsten Muller Germany FHG-HHIDaiyong Kim Korea HanyangCorey Manders Singapore IIRFarzam Farbiz Singapore IIRInkwon Kim Korea VarovisonChan-Yang Kim Korea VarovisonDan Cernea Belgium VUB
2 General issues
2.1 General discussion
2.1.1 Reference SoftwareIt is recalled that the source code of both decoder AND encoder should be provided as part of the Reference Software for all technologies to be adopted in MPEG standards. Moreover, not providing the complete software for a published technology shall conduct to the removal of the corresponding technical specification from the standard.
2.1.2 Web siteOrangeLabs proposed a new version of the web site, now available at www.mpeg-3dgc.com. The goal of the web site is to disseminate the group activities (documents, software and demonstration), to maintain the FAQ and to be active in providing answers through the use of the Forum. 3DGC contributors are kindly asked to check the web-site and provide comments.
3 AFX (14496-16) related activities
3.1 Experiments
3.1.1 CE1. Mesh Animation CompressionTitle Low-complexity approach for static mesh compression
Authors Khaled Mamou, Titus Zaharia, Marius PredaFrançoise Prêteux
Summary - TFAN: encoding the connectivity based on triangle fan decomposition of a mesh - advantages: low complexity
168
- comparison of the performances with 3DMC, 6% better in compression performances and 50% in decoding time
Resolution
This technology together with the one presented during the 82nd Meeting shows evidences that mesh compression may be performed with lower complexity than current tools. A call for proposal will be issued this meeting. The requirements document will be updated to address low complexity.
3.1.2 Frame-based animation compressionTitle GNB comments on ISO/IEC 14496-16:2006/PDAM 2 (FAMC)
Authors Nikolce Stefanoski, Jörn Ostermann
Summary
The contribution addresses the problem of FAMC compression for temporal scalability. When using the delta prediction mode, it is possible than the variable used for prediction belongs to a frame that is not decoded (due to sub sampling in time). The contribution proposes an identity prediction (as an alternative to existent delta prediction)
Resolution
Resolution:It was identified that prediction always takes place with respect to the static mesh. To clarify this aspect, the specification was updated with en explicative note.
Title FAMC decoder conformanceAuthors Khaled Mamou, Titus Zaharia, Françoise Prêteux
Summary The contribution proposes a set of MP4 formatted files showing FAMC functionalities and describe the testing condition for them.
Resolution Adopt the set of files for conformance
Title FAMC integration into the MPEG-4 RefSoftAuthors Khaled Mamou, Titus Zaharia, Marius Preda, Françoise Prêteux
SummaryThe contribution presents the implementation of FAMC in IM1 indicating the supported functionalities. A demonstration of the reference software was shown.
Resolution
Adopt the software provided as the RefSoft for FAMC. It is recommended to upload the software on the SVN when the latter will be ready.Since providing the encoder in source code is one of the conditions of accepting MPEG technologies, it is also requested to FAMC contributors to provide the FAMC encoder.
3.1.3 Scene partitioningSP will be followed as a joint activity between Systems and 3DGC. The technology will be integrated in Part 11.
3.2 Profiles
3.2.1 Proposal for 3D MultiResolution ProfileTitle Multi-Resolution Profile AMD 3 and Conformance
Authors Patrick Gioia
169
SummaryThe contribution shows the status of Conformance for AMD3. Some editorial changes were performed. Some of the old bitstreams (3DMC and BBA) are not working in the last version of the IM1 player
Resolution Once the ReferenceSoftware is available on the SVN, INT and Samsung will verify the broken bitstreams
3.3 Maintenance
3.3.1 3DMC Extension correction for support of multiple attribute per vertex
Title KNB Comment on 14496-16:2006/AMD1.Corr1 (3D Mesh Coding Extension Correction)
Authors Jeong-Hwan Ahn, Daiyong Kim, Euee S. Jang
Summary In order to preserve the backward compatibility, it is proposed to use an existing variable (function_type) and extend its semantics.
Resolution Accepted
3.3.2 AFX 3rd EditionThe document was updated during the week. However, editing is not finished (an editing period of 2 weeks was accepted).
3.4 Dataset and benchmarkingTitle Online platform for 3D graphics compression benchmarking
Authors Benoit Le Bonhomme, Marius Preda, Françoise Preteux
Summary
The contribution introduces an online platform able to integrate encoder and decoder libraries for 3D graphics compression. The advantages of using it will be the use of the same hardware for executing the programs, the access to a large database and the rapidness in obtaining the curves and other quantitative measures.
Resolution Use the platform for benchmarking the tools submitted for standardization.
Title Table of 3D models in the MPEG 3DGC repositoryAuthors Sikyung Kim, Euee S. Jang
Summary The contribution consists in a a table presenting the specificities of each file (attributes per vertex, …) available in the MPEG database.
Resolution Upload the table on the repository web site.
3.5 SoftwareTitle RefSoftware report for AFX tools
Authors Francisco Morán BurgosSummary A document with the status of encoders for all the 3DGC bitstreams Resolution Include the document in the SVN repository, in the Reference Software section
3.6 PromotionsTitle 3D graphics player for N93 and N95
Authors Ivica Arsov, Marius Preda, Francoise Preteux
170
Summary The contribution presents an implementation of the MPEG-4 3D Graphics player for Symbian 9, able to decode and render static and animated objects
Resolution -
3.7 Future
3.7.1 CfPCall for Proposal for Low Complexity 3D Mesh Compression. More information are provided in the output document w9651.
3.7.2 MetaverseA presentation was done in Joint meeting with Systems and Requirements. No resolution yet.
4 3D Graphics Compression Model (14496-25) activities
4.1 Textual specificationThe text was reviewed and a study was issued as the output document.
4.2 Software and conformanceTitle Software Implementation for P25
Authors Blagica Jovanova, Marius Preda, Francoise Preteux
SummaryContribution on an implementation of the P25 for COLLADA, including the encoder (parser, 3DMCE, BBA, JP2K and GZIP encoders, multiplexer) and the decoder.
Resolution Accept the software as the reference software for P25. To upload it on the SVN.
Title 3DGC Conformance dataset for P25Authors Blagica Jovanova, Marius Preda, Francoise Preteux
Summary Contribution on a set of files implementing several functionalities (geometry, texture and animation)
Resolution Accept the files as Conformance Test for P25. To upload on the Conformance directory of the SVN.
5 Output documents and Resolutions of 3DGC
5.1 Part 4 Conformance testing
5.1.1 The 3DGC subgroup recommends approval of the following documentsNo. Title TBP Available
14496-4 Conformance testing
171
9642Study on PDAM of ISO/IEC 14496-4:2004 AMD32 (FAMC Conformance)
No 08/01/18
9643Study on PDAM of ISO/IEC 14496-4:2004 AMD33 (MultiResolution Profile Conformance)
No 08/01/18
9644 ISO/IEC 14496-4:2004 PDAM 34 (3DGCM Conformance) No 08/01/18
5.1.2 The 3DGC subgroup recommends nominating Mark Callow (HI Corporation) as project editor for ISO/IEC 14496-4:2004/Amd.16.
5.2 Part 5 Reference Software
5.2.1 The 3DGC subgroup recommends approval of the following documentsNo. Title TBP Available
14496-5 Reference Software 9645 ISO/IEC 14496-5 PDAM 22 (3DGCM RefSoft) No 08/01/18
5.3 Part 16 Animation Framework eXtension (AFX)
5.3.1 The 3DGC subgroup recommends approval of the following documentsNo. Title TBP Available
14496-16 Animation Framework eXtension (AFX)9646 Study of ISO/IEC 14496-16:2006/AMD1/DCOR1 No 08/01/18
9647DoC on ISO/IEC 14496-16:2006/PDAM2 (Frame-based Animated Mesh Compression)
No 08/01/18
9648Text of ISO/IEC 14496-16:2006/FPDAM2 (Frame-based Animated Mesh Compression)
Yes 08/01/18
9649 WD2.0 of AFX 3rd Edition No 08/02/019650 Requirements for low-complexity 3D mesh compression Yes 08/01/189651 CfP for low-complexity 3D mesh compression Yes 08/01/18
5.3.2 The 3DGC subgroup thanks FNB and GNB for their comments on ISO/IEC 14496-16:2006/Amd.2.
5.3.3 The 3DGC subgroup recommends the publication of all Scene Partitioning related technologies in Part 11 of ISO/IEC 14496 and its removal from Part 16, hence conducting the removal of Part 16 Amd.4.
5.3.4 The 3DGC subgroup thanks Samsung AIT for the creation and maintenance of the first version of the MPEG-3DGC web site and also thanks Orange Labs for taking over this project.
5.4 Part 25 3D Graphics Compression Model
5.4.1 The 3DGC subgroup recommends approval of the following documentsNo. Title TBP Available
172
14496-25 3D Graphics Compression Model9652 Study of CD of ISO/IEC 14496-25 No 08/01/18
5.5 Establishment of 3DGC Ad-Hoc GroupsN9661 AHG on 3DGC documents and software maintenanceMandate: 1. Coordinate 3DGC related conformance and reference software
2. Maintain and edit 3DGC documents 3. Coordinate editing of the www.mpeg-3dgc.com web-site
Chairmen: Patrick GioiaFrancisco Morán Burgos
Duration: Until 84th MeetingMeetings Sunday before 84th meetingReflector: mpeg-3dgc AT gti. ssr. upm. esSubscribe: http://www.gti.ssr.upm.es/mailman/listinfo/mpeg-3dgc
6 Closing of the MeetingSee you in Archamps.
173