document1
TRANSCRIPT
INTERNATIONAL ORGANISATION FOR STANDARDISATIONORGANISATION INTERNATIONALE DE NORMALISATION
ISO/IEC JTC 1/SC 29/WG 11CODING OF MOVING PICTURES AND AUDIO
ISO/IEC JTC 1/SC 29/WG 11 N10520Maui, HI –April 2009
Source: Leonardo Chiariglione Title: Report of 88th meetingStatus
Report of 88th meeting.................................................................................................................1Annex A – Attendance list........................................................................................................17Annex B – Agenda....................................................................................................................23Annex C – Input contributions..................................................................................................26Annex D – Output documents...................................................................................................44Annex E – Requirements report................................................................................................51Annex F – Systems report.........................................................................................................54Annex G – Video report............................................................................................................93Annex I – Audio report...........................................................................................................105Annex J – 3DG report.............................................................................................................128
Report of 88th meeting
1 OpeningThe 88th MPEG Meeting was held from 20 to 24 April 2009 at The Westin Maui Resort & Spa Maui, HI, US.
2 Roll call of participantsAnnex 1 provides the list of participants
3 Approval of agendaThe agenda adopted is provided by Annex 2
4 Allocation of contributionsAnnex 3 provides the list of input documents
1
5 Communications from ConvenorThere was no specific communication
6 Report of previous meetingThis was approved
7 Processing of NB Position PapersPapers were presented and, where appropriate, responses provided.
8 Work plan management
8.1 Media coding
8.1.1 AAC family of profiles
The following document was approved
10652 WD on AAC family of profiles
8.1.2 HD-AAC Profile
The following document was approved
10651Study on ISO/IEC 14496-3:2009/ FPDAM 1:200x, HD-AAC Profile, MPEG Surround Signalling
8.1.3 AVC Constrained Baseline Profile
The following document was approved
10540 Study Text of ISO/IEC 14496-10:200X/FPDAM 1
8.1.4 AFX 3rd edition
The following document was approved
10530 ISO/IEC 14496-16 3rd Edition
8.1.5 Scalable-complexity 3D mesh compression
The following documents were approved
10528 Study Text of ISO/IEC 14496-16:2006/PDAM4 (Scalable Complexity 3D Mesh Compression)
10529 Description of AFX CE and explorations
2
8.1.6 Open Font Format extensions
The following documents were approved
10627Request for ISO/IEC 14496-22:200X AMD 1 Support for many-to-one range mapping
10587Text of ISO/IEC 14496-22:200X/PDAM 1 Support for many-to-one range mapping
8.1.7 Video Tool Library
The following document was approved
10551 Description of Core Experiments in RVC
8.1.8 Spatial Audio Object Coding
The following documents were approved
10659 Study on ISO/IEC FCD 23003-2:200x, Spatial Audio Object Coding10660 Status and Workplan on SAOC Core Experiments
8.1.9 Unified Speech and Audio Coding
The following documents were approved
10661 WD3 of USAC10662 Workplan for USAC CEs10663 Workplan on MPEG USAC Reference Encoder10664 MPEG Audio CE methodology10669 MPEG Audio Test Material for Core Experiments
8.1.10 Media Context and Control
The following documents were approved
10526 MPEG-V Extended Call for Proposals10616 WD 2.0 of Architecture10617 WD 2.0 of Control Information10618 WD 2.0 of Sensory Information10619 WD 2.0 of Avatar Information10672 WD 1.0 of Reference Software10673 WD 1.0 of Conformance
8.1.11 3D Video Coding
The following documents were approved
10570 Applications and Requirements for 3DV10552 Description of Exploration Experiments in 3D Video Coding
3
10649 Evaluation and Testing of 3D Video Coding
8.1.12 High-Performance Video Coding
The following document was approved
10553 Call for Evidence on High-Performance Video Coding
8.2 Composition coding
8.2.1 Interactive Digital Radio
The following documents were approved
10576 Requirements v3.0 for a new BIFS profile to support Interactive Digital Radio10568 Call for Proposal on additional BIFS technologies for Interactive Services for
Digital Radio
8.2.2 LASeR Adaptation
The following documents were approved
10682 LASeR Requirements10582 Study of ISO/EC 14496-20:2008 LASeR & SAF/FPDAM 2 Adaptation10585 Updated Workplan for service example of LASeR Adaptation & PMSI
8.2.3 Presentation of Structured Information
The following documents were approved
10583 DoC on ISO/IEC 14496-20:2008 LASeR & SAF/PDAM 3 PMSI10584 Text of ISO/IEC 14496-20:2008 LASeR & SAF/FPDAM 3 PMSI10585 Updated Workplan for service example of LASeR Adaptation & PMSI
8.2.4 Advanced User Interaction
The following document was approved
10586 TuC for ISO/IEC 14496-20:2000 LASeR & SAF AMD 4 Advanced User Interaction
8.3 Description coding
8.3.1 Video Signature Descriptors
The following documents were approved
10566 Request for 15938-3:2000/Amd.410542 WD 1.0 of 15938-3/Amd.4 Video Signature Descriptors10543 MPEG-7 Visual XM 35
4
10544 Description of Core Experiments in Video Signature Description development
8.3.2 Extraction and Matching of Image Signature Tools
The following documents were approved
10549 Disposition of Comments on ISO/IEC 15938-8:2002/PDAM 5 10550 Text of ISO/IEC 15938-8:2002/DAM 5 Extraction and Matching of Image Signature
Tools
8.3.3 Query format
The following documents were approved
10525 Enhanced MPEG-7 Query Format Requirements10588 WD 4.0 of ISO/IEC 15938-12:200X AMD 1 MPQF Conf. and Ref. SW10589 WD 1.0 of ISO/IEC 15938-12:200X AMD 2 Semantic Enhancement
8.4 Systems support
8.4.1 MPEG-4 Systems 4th edition
The following document was approved
10574 WD of ISO/IEC 14496-1 4th Edition
8.4.2 Registration Authority and systems extensions
The following document was approved
10575 WD of ISO/IEC 14496-1 PDAM4 Registration Authority and systems extensions
8.5 IPMP
8.5.1 Protection of Presentation Element
The following documents were approved
10592 Request for ISO/IEC 21000-4 AMD 2 Protection of Presentation Element10593 Text of ISO/IEC 21000-4 PDAM 2 Protection of Presentation Element
8.6 Digital Item
8.6.1 Presentation of Digital Item
The following documents were approved
10590 Request for ISO/IEC 21000-2 AMD 1 Presentation of Digital Item10591 Text of ISO/IEC 21000-2 PDAM 1 Presentation of Digital Item
5
8.7 Transport and File formats
8.7.1 Carriage of MVC in MPEG-2 Systems
The following documents were approved
10572 Study of ISO/IEC 13818-1:2007/FPDAM4 Transport of MVC10573 WD 2.0 of ISO/IEC 13818-1:2007 DCOR X
8.7.2 Miscellaneous additions to File Format
The following documents were approved
10579 DoC on ISO/IEC 14496-12:2008/FPDAM 1 General Improvements10580 Text of ISO/IEC 14496-12:2008/FDAM 1 General Improvements
8.7.3 AVC File Format extensions for MVC
The following document was approved
10581 Study of ISO/IEC 14496-15:2004/FPDAM 3 MVC File Format
8.7.4 MPEG Media Transport
The following document was approved
10571 Workshop on MMT (MPEG Media Transport) – Call for Contributions
8.8 Multimedia architecture
8.8.1 MXM Architecture
The following document was approved
10620 Study of ISO/IEC CD 23006-1 MxM Architecture and Technologies
8.8.2 MXM API
The following documents were approved
10621 Study of ISO/IEC CD 23006-2 MXM APIs
10623First ideas on normative APIs compliant to MXM framework for future MPEG standards
8.8.3 Advanced IPTV Terminal
The following documents were approved
6
10569 Draft Advanced IPTV Terminal (AIT) Requirements10570 Context and Objectives for Advanced IPTV Terminal
8.8.4 Rich Media UI Framework
The following documents were approved
10626 WD MPEG Rich Media UI10690 Proposal for support of MPEG Rich Media UI in WC3 Widget Recommendation10670 MPEG comments on WC3 Widget Recommendation
8.9 Application formats
8.9.1 Musical Slide Show Application Format
The following document was approved
10594 Text of ISO/IEC 23000-4 MSSAF/FDAM2 Conf. & Ref. SW for Protected MSSAF
8.9.2 Media Streaming Application Format
The following document was approved
10595 Text of ISO/IEC CD 23000-5 MSAF 2nd edition
8.9.3 DMB AF Harmonization of MPEG-2 TS storage
The following document was approved
10612Study of ISO/IEC 23000-9:2008/PDAM2 DMB AF Harmonization of MPEG-2 TS storage
8.9.4 Interactive Music Application Format
The following document was approved
10615 Study of ISO/IEC CD 23000-12 Interactive Music AF
8.10 Protocols
8.10.1 MXM Protocols
The following document was approved
10625 Study of ISO/IEC CD 29116-1 2nd edition MXM Protocols
7
8.11 Reference implementation
8.11.1 AAC-ELD Reference Software
The following documents were approved
10653 DoC on ISO/IEC 14496-5:2001/FPDAM 24, MPEG-4 AAC ELD10654 ISO/IEC 14496-5:2001/FDAM 24, MPEG-4 AAC ELD
8.11.2 MVC Reference Software
The following document was approved
10538 Study Text of ISO/IEC 14496-5:2001/FPDAM 15 Reference Software for Multiview Video Coding
8.11.3 Synthesized Texture Reference Software
The following documents were approved
10576 DoC on ISO/IEC 14496-5:2001/PDAM 23 Synthesized Texture Reference SW10577 Text of ISO/IEC 14496-5:2001/FPDAM 23 Synthesized Texture Reference SW
8.11.4 Image Signature Tools Reference Software
The following documents were approved
10545 Disposition of Comments on ISO/IEC 15938-6:2003/PDAM 3 10546 Text of ISO/IEC 15938-6:2003/FPDAM 3 Reference Software for Image Signature
Tools
8.11.5 Professional Archival Application Format Reference Software
The following documents were approved
10596 Study of ISO/IEC 23000-6 PA-AF/PDAM1 Conf. and Ref. SW10597 Workplan for ISO/IEC 23000-6 PA-AF Conformance and Reference Software
8.11.6 DMB Application Format Reference Software
The following documents were approved
10598 Study of ISO/IEC 23000-9:2008/FPDAM1 DMB AF Conf. And Ref. Soft.10599 Workplan for ISO/IEC 23000-9:2008/FPDAM1 DMB AF Conf. And Ref. Soft.
8.11.7 Stereoscopic Video Application Format Reference Software
The following documents were approved
10628 Request for ISO/IEC 23000-11 AMD 1 Stereoscopic Video AF Ref. Soft and Conf.
8
10613 Text of ISO/IEC 23000-11 PDAM 1 Stereoscopic Video AF Ref. Soft and Conf.10614 Workplan for Stereoscopic Video Application Format Ref. Soft and Conf.
8.11.8 MXM Reference Software
The following documents were approved
10622 Study of ISO/IEC CD 23006-3 MXM Conf. & Ref. SW10683 MXM Development Roadmap for Engines and Applications10624 MXM Developer’s Day: Call for Participation
8.12 Conformance
8.12.1 MPEG-4 Video bitstreams
The following document was approved
10537 First Ideas on New MPEG-4 Video Bitstream Repository Structure
8.12.2 MVC Conformance
The following document was approved
10535 Study Text of ISO/IEC 14496-4:2004/FPDAM 38 Multiview Video Coding Conformance Testing
8.12.3 BSAC Conformance for Broadcasting
The following documents were approved
10655 Request for Amendment, 14496-26:2009/PDAM 210656 ISO/IEC 14496-26:2009/PDAM 2, BSAC Conformance for Broadcasting
8.12.4 Image Signature Tools Conformance
The following documents were approved
10547 Disposition of Comments on ISO/IEC 15938-7:2003/PDAM 5 10548 Text of ISO/IEC 15938-7:2003/FPDAM 5 Conformance Testing for Image
Signature Tools
8.12.5 Professional Archival Application Format Conformance
The following documents were approved
10596 Study of ISO/IEC 23000-6 PA-AF/PDAM1 Conf. and Ref. SW10597 Workplan for ISO/IEC 23000-6 PA-AF Conformance and Reference Software
9
8.12.6 DMB Application Format Conformance
The following documents were approved
10598 Study of ISO/IEC 23000-9:2008/FPDAM1 DMB AF Conf. And Ref. Soft.10599 Workplan for ISO/IEC 23000-9:2008/FPDAM1 DMB AF Conf. And Ref. Soft.
8.12.7 Stereoscopic Video Application Format Conformance
The following documents were approved
10628 Request for ISO/IEC 23000-11 AMD 1 Stereoscopic Video AF Ref. Soft and Conf.10613 Text of ISO/IEC 23000-11 PDAM 1 Stereoscopic Video AF Ref. Soft and Conf.10614 Workplan for Stereoscopic Video Application Format Ref. Soft and Conf.
8.12.8 MXM Conformance
The following document was approved
10622 Study of ISO/IEC CD 23006-3 MXM Conf. & Ref. SW
8.13 Maintenance
8.13.1 Systems coding standards
The following document was approved
10578 Text of ISO/IEC 14496-5:2001/AMD14:2009/DCOR 1 OFF Ref. SW
8.13.2 Video coding standards
The following documents were approved
10534 Defect Report on ISO/IEC 14496-2:200410534 Defect Report on ISO/IEC 14496-2:200410541 Defect Report on ISO/IEC 14496-10:200X
8.13.3 Audio coding standards
The following documents were approved
10650 ISO/IEC 14496-3:2009/DCOR 1:200X Byte Alignment10657 Study on ISO/IEC 14496-26:2009/DCOR 1, ALS, SLS and AAC updates10658 Study on ISO/IEC 23003-1:2007/DCOR 2, Misc. Corrections
9 Organisation of this meeting
9.1 Tasks for subgroups
10
Requirements Std Pt Amd4 11 -- New BIFS Profile for Digital RadioV HapticsU User Interface framework
Loudness metadataAdvanced IPTV TerminalHVC - CfE3DV – Vision, Applications, RequirementsMMTPervasive AV scene codingNew standard areas Audio for HVCContribution to press release CfE CfP BIFS for DR CfP on Haptics for MPEG-V MMT workshop
Systems Std Pt Amd2 1 4 Carriage of MVC4 RA
4 37 FF conformance23 Synthesised texture RS? SVC FF RS
11 ? New BIFS Profile for Digital Radio12 1 Miscellanea
Cor.2 Usage of brands etc.15 3 MVC File Format20 2 Adaptation technologies for Laser
3 Presentation and Modification of Structured InformationTuC for advanced input interface
22 Open Font Format7 12 Amd1 C and RS21 2 Amd PDI
4 Amd Protection of presentation element19 MVCO
A 4 2 Protected Musical Slide Show MAF RS & C5 2nd Ed Media Streaming MAF6 1 Professional Archival AF RS & C9 Cor 1 DMB MAF9 1 DMB MAF RS & C9 2 DMB MPEG-2 TS storage
10 1 Video Surveillance AF RS & C11 Cor1 Stereoscopic video AF
1 Stereoscopic video AF RS & C12 Interactive music AF
M 1 MXM Architecture2 MXM API3 MXM RS & C
V 1 Architecture
11
2 Control Information3 Sensory information4 Avatar information
U Rich Media WidgetsCommunicationReference Software and Conformance
X Advanced IPTV TerminalUpdate MPEG technology web pageContribution to press release PMSI & PDI
Video4 2 Cor4 10 1 Constrained BL profile & suppl. enhanc. info. + Stereo
High ProfileCor 1 Miscellanea
7 3 4 Video Signature Tools6 Image Signature Tools RS7 Image Signature Tools C8 Image Signature Tools Matching and feature extraction
A 3 2 Photo Player ConformanceC 4 1 Video Tool Library Conformance & RS
2 Video Tool Library extensions3DV/FTVHVC
4 4 38 MVC Conformance15 MVC RS
Update MPEG technology web pageContribution to press release
Audio 4 3 1 SLS profile3 960/1024
4 36 AAC-ELD conformance5 24 AAC-ELD Reference Software
26 ConformanceD 2 Spatial Audio Object Coding
3 USACNew audio issues (HVC)Contribution to press release
3DG4 16 3rd Ed
4 Scalable complexity 3DMC27 3DG Conformance
1 3D Graphics Compression model Conformance5 22 3D Graphics Compression model Reference Software
25 Scene partitioning RS27 Scalability complexity 3DMC RS
3DG vision
12
V Information exchange with virtual worldsReconfigurable Graphics codingContribution to press release
9.2 Joint meetingsThe following joint meetings were held
Groups What Day Time WhereS, R AIT Mon 16:00-18:00 ReqV, R Prof/Lev, 3DV Apps Tue 09:00-10:00 Req3, V, R RGC Tue 14:00-15:00 3DG3, S MPEG-V Tue 15:00-16:00 3DG3, S MXM Tue 16:00-17:00 3DGS, R MMT Wed 14:00-16:00 ReqS, R AIT Wed 16:00-18:00 ReqR, S MPEG-V Thu 09:00-10:00 ReqS, V Carriage of MVC over MP2 Thu 09:00-11:00 SysR, V, A Pervasive AV Thu 10:00-11:00 ReqR, V HVC Thu 15:00-17:00 Vid
10 WG management
10.1 Terms of referenceThe following document was approved
10600 Terms of reference
10.2 EditorsThe following documents were approved
10604 Editors of MPEG standards10694 Editors nominated for an ISO/IEC certificate of appreciation
10.3 Liaisons The following documents were approved
10689 Liaison statement to ITU-T SG 16 Q.6/1610631 Liaison statement to JTC 1/SGSN10632 Liaison statement to ITU-T SG 16 on rights information interoperability10691 Liaison statement to IEC TC 100 on rights information interoperability10633 Liaison statement to IEC TC 100 on Multimedia Gateway in Home Networks10635 Liaison statement to IEC TC 9 10636 Liaison statement to SGDCMP on DRM technologies in MPEG
13
10637Liaison statement to IEC TC 100 on Multimedia home server systems - Conceptual model for domain management
10638Liaison statement to JTC 1/SC 34/WG 2 on Open Font Format reference software
10639Liaison statement to IEC TC 100 on Transmission of time code in the ancillary data space
10640 Liaison statement to ITU-T SG 16 on Advanced IPTV Terminal10641 Liaison statement to JTC 1/SC 34/WG 2 on media types of ISO/IEC 14496-2210642 Liaison statement to W3C on MPEG Rich Media UI 10685 Liaison statement to UPnP on MPEG Rich Media UI 10686 Liaison statement to DLNA on MPEG Rich Media UI 10643 Liaison statement to JTC 1 SWG-ARM on Professional Archival AF10644 Liaison statement to EBU on MPEG URI Assets10645 Liaison statement to W3C on MXM 10646 Liaison statement to ATSC on Carriage of SVC over MPEG-2 TS10647 Liaison statement to SC6 on extended Call for Proposal on MPEG-V10648 Liaison statement to SC25 on extended Call for Proposal on MPEG-V
10634Liaison statement to IEEE TC on Haptics on extended Call for Proposal on MPEG-V
10687 Template Liaison statement on MPEG Media Transport workshop10688 Template Liaison statement on MXM Developer’s Day10554 Liaison statement to ITU-T SG9 re 3D Video Coding10555 Liaison statement to IEC TC100 re IEC DTS 6259210556 Liaison statement to SC37 re Biometric Data Interchange10557 Liaison statement to IEC TC100 re IEC CD 6208710558 Liaison statement to CEA re 3D Video Coding10559 Liaison statement to ITU-T SG16 Q.6/16 re AVC Development10665 Response to IEC TC-100 on IEC CDV 6257110611 List of Organisations with which MPEG entertains liaisons
10.4 Ad hoc groupsThe following document was approved
10519 List of ad hoc groups established at the Maui, HI MPEG meeting
SpecificallyThe following documents were approved
10527 Adhoc on MPEG Modern Transport (MMT)10563 AHG on 3D Video Coding10533 AHG on 3DGC documents, software maintenance and core experiments10679 AHG on Advanced IPTV Terminal10677 AHG on Application Format10667 AHG on Audio Standards Maintenance10565 AHG on AVC Development10678 AHG on Font Format Representation10564 AHG on High-Performance Video Coding10560 AHG on Maintenance of MPEG-4 Visual related Documents, Reference
14
Software and Conformance10676 AHG on MPEG File Formats10562 AHG on MPEG-7 Visual10681 AHG on MPEG-V10680 AHG on MXM10561 AHG on Reconfigurable Video Coding10668 AHG on SAOC, USAC and MetaData10675 AHG on Scene Representation
10.5 Asset managementThe following documents were approved
10605 Schema assets10606 Software assets10607 Conformance assets10608 Content assets10609 URI assets and MIME types
10.6 IPR managementThe following documents were approved
10610 Standards under development for which a call for patent statements is issued10693 Proposal to improve the Software Copyright Disclaimer
10.7 Work planThe following documents were approved
10601 MPEG Standards10602 Table of unpublished FDISs10603 Work plan and time line
11 Administrative matters
11.1 Responses to National BodiesThe following documents were approved
10692 Responses to National Bodies 10666 Response to Swedish NB on 960 and 1024 block lengths
11.2 Schedule of future MPEG meetingsThe following meeting schedule was approved
# City Country yy mm dd-dd88 Maui, HI US 09 04 20-24
15
89 London UK 09 06-07 29-0390 Xian CN 09 10 26-3091 Kyoto JP 10 01 18-2292 ? DE 10 04 19-2393 Geneva CH 10 07 ??-??94 Torino IT 10 10 11-1595 ? ? 11 01 ??-??
12 Promotional activitiesThe following documents were approved
10532 MPEG 3D Graphics FAQ v2210629 MPEG DRM Vision10684 MPEG Technologies for DRM10531 MPEG 3D Graphics Vision10522 Maui, HI press release
13 Resolutions of this meetingThese were approved
14 A.O.BThere was no other business
15 ClosingThe meeting closed at 2009/04/24T20:00
16
Annex A – Attendance list
FirstName LastName Company CountryJeong-Hwan Ahn Samsung Electronics Republic of KoreaKohtaro Asai Mitsubishi Electric Corporation JapanCheung Auyeung Sony Electronics Inc. USAGun Bang ETRI Republic of KoreaVittorio Baroncini Fondazione Ugo Bordoni ItalyGero Bäse Siemens GermanySeung Kwon Beack ETRI Republic of KoreaBruno Bessette VoiceAge Corporation CanadaLazar Bivolarski Droplet Technology, Inc., USAMiroslaw Bober Mitsubishi Electric United KingdomFrank Bossen NTT DOCOMO, Inc. SwitzerlandPaul Brasnett Mitsubishi Electric United KingdomBernard Brower ITT USAFons Bruls Philips The NetherlandsTim Bruylants Vrije Universiteit Brussel BelgiumMadhukar Budagavi Texas Instruments Inc. USAJihun Cha ETRI Republic of KoreaLekha Chaisorn Institute for Infocomm Research SingaporeTi Eu Chan Institute for Infocomm Research SingaporeLulin Chen Omneon Video Networks USAYing Chen Qualcomm Inc. USA
Ka Man, Carmen ChengHong Kong Applied Science and Technology Research Institute Co. Ltd China
Leonardo Chiariglione CEDEO.net ItalyMaeng-Sub Cho ETRI Republic of KoreaBumsuk Choi ETRI Republic of KoreaByeong Ho Choi KETI Republic of KoreaHaechul Choi ETRI Republic of KoreaJin Soo Choi ETRI Republic of KoreaKiho Choi Hanyang University Republic of KoreaMiran Choi ETRI Republic of KoreaYungHo Choi Konkuk University Republic of KoreaKeiichi Chono NEC Corporation JapanTakeshi Chujoh Toshiba Corporation JapanSungmoon Chun ECT Inc., Republic of KoreaLeszek Cieplinski Mitsubishi Electric United KingdomCyril Concolato Telecom Paris Tech FranceGiovanni Cordara Telecom Italia Lab ItalyMarzia Corvaglia CNIT - unit of Brescia ItalyLeon Denis Vrije Universiteit Brussel BelgiumFrançois Devaux intoPIX BelgiumMario Doeller University of Passau Germany
17
Stefan Döhla Fraunhofer IIS GermanyMarek Domanski Poznan University of Technology PoznanYujie Dun Xi'an Jiaotong University ChinaTouradj Ebrahimi EPFL SwitzerlandScott Foshee Adobe Systems USAEdouard Francois Thomson Inc FrancePer Frôjdh Ericsson SwedenToshiaki Fujii Tokyo Institute of Technology JapanTakahiro Fukuhara Sony JapanRalf Geiger Fraunhofer IIS GermanyJean H.A. Gelissen Philips Research The NetherlandsSebastian Gerke Fraunhofer HHI GermanyDaniele Giusto University Cagliari ItalyPhilippe Gournay VoiceAge Corporation CanadaKate Grant Nine Tiles United KingdomBernhard Grill Fraunhofer IIS GermanyMarc GuezVucher SCPP FranceJong-Ki Han Sejong University Republic of KoreaWoo-Jin Han Samsung Electronics Republic of KoreaMiska Hannuksela Nokia FinlandNoboru Harada NTT JapanOliver Hellmuth Fraunhofer IIS GermanyBrian Heng Broadcom USAJürgen Herre Fraunhofer IIS GermanyArianne T. Hinds Ricoh | IBM InfoPrint Solutions Company USAJeff Huang Qualcomm Inc. USATie Jun Huang Peking University ChinaYoung Huh KERI Republic of KoreaSungjin Hur ETRI Republic of KoreaWalt Husak Dolby Laboratories USASeoYoung Hwang Samsung Republic of KoreaFaisal Ishtiaq Motorola USAKota Iwamoto NEC Corporation JapanEuee S Jang Hanyang University Republic of KoreaInseon Jang ETRI Republic of KoreaSung-Kwan Je ETRI Republic of KoreaByeong Moon Jeon LG Electronics Republic of KoreaByeungwoo Jeon SKKU Republic of KoreaDong Seok Jeong Inha University Republic of KoreaHong Jiang Intel Corporation USAJukyong Jin Inha University Republic of KoreaSanghyun Joo ETRI Republic of KoreaHari Kalva Florida Atlantic University USAKyeongok Kang ETRI Republic of KoreaSandeep Kanumuri DOCOMO Communications Laboratories USA, Inc. USAMukta Kar Cable Labs USA
18
Marta Karczewicz Kimihiko Kazui Fujitsu Laboratories Ltd. JapanKei Kikuiri NTT DOCOMO, Inc. JapanDaeyeon Kim Sejong University Republic of KoreaDaiyong Kim Hanyang University Republic of KoreaDongwon Kim Sejong University Republic of KoreaHae Kwang Kim Sejong University KoreaHui Yong Kim ETRI Republic of KoreaHyungyu Kim Hanyang University Republic of KoreaKil Joong Kim Seoul National University Bundang Hospital Republic of KoreaKioh Kim Sejong University Republic of KoreaKyuheon Kim Kyunghee Univ. Republic of KoreaSang-Kyun Kim Myongji University Republic of KoreaSeonghoon Kim Varo Vision Co., Ltd. Republic of KoreaYeongmi Kim Gwangju Institute of Science and Technology Republic of KoreaYoungseop Kim Dankook University Republic of KoreaKristofer Kjoerling Dolby Sweden SwedenTakuyo Kogure Panasonic JapanPanos Kudumakis Queen Mary University of London United KingdomChaker Larabi University of Poitiers FranceJean Le Feuvre Telecom Paris Tech FranceDaniel Lee eBay USAGunhee Lee Kyunghee Univ. Republic of KoreaGwo Giun (Chris) Lee National Cheng Kung University TaiwanHyunkook Lee LG Electronics Republic of KoreaJaejoon Lee Samsung Electronics Republic of KoreaJaeseong Lee Yonsei University Republic of KoreaJangwon Lee Kyunghee Univ. Republic of KoreaJu Ock Lee Sejong University Republic of KoreaKangchan Lee ETRI Republic of KoreaSeung Wook Lee ETRI Republic of KoreaSunyoung Lee Hanyang University Republic of KoreaTaejin Lee ETRI Republic of KoreaWonsuk Lee ETRI Republic of KoreaYung-Lyul Lee Sejong University Republic of KoreaVladimir Levantovsky Monotype Imaging USAShangwen Li Zhejiang University ChinaTilman Liebchen LG Electronics GermanyChongSoon Lim Panasonic Singapore Laboratories SingaporeTaebeom Lim KETI Republic of KoreaYoungkwon Lim net&tv Inc. Republic of KoreaChristophe Lucarz EPFL SwitzerlandAjay Luthra Motorola USAShohei Matsuo Nippon Telegraph and Telephone (NTT) Corporation JapanMarco Mattavelli EPFL SwitzerlandKen McCann ZetaCast, representing Samsung United Kingdom
19
Jim Meany Boeing USAKeiji Mitsubuchi Digital Hollywood Graduate School JapanJooHee Moon Sejong University Republic of KoreaFrancisco Morán Burgos Universidad Politècnica de Madrid SpainTakehiro Moriya NTT JapanKarsten Müller Fraunhofer HHI GermanyMarkus Multrus Fraunhofer IIS GermanyTokumichi Murakami Mitsubishi Electric Corporation JapanSang-il Na ETRI Republic of KoreaNobuhiko Naka NTT DOCOMO, Inc. JapanTakayuki Nakachi NTT JapanSam Narasimhan Motorola USAAmbarish Natu Analog Devices Inc AustraliaMax Neuendorf Fraunhofer IIS GermanyNhut Nguyen Samsung Telecom America USATakahiro Nishi Panasonic JapanToshiyuki Nomura NEC Corporation JapanTakeshi Norimatsu Panasonic Corporation JapanRyoma Oami NEC Corporation JapanShigetaka Ogawa ICT-Link JapanYukiko Ogura IPSJ/ITSCJ JapanEunmi Oh Samsung Electronics Republic of KoreaSeoung-Jun Oh Kwangwoon University Republic of KoreaWeon Geun OH ETRI Republic of KoreaJens-Rainer Ohm RWTH Aachen GermanyWerner Oomen Philips Applied Technologies NetherlandsJoern Ostermann Leibniz Universiüt Hannover GermanyHochong Park Kwangwoon University Republic of KoreaHyoungmee Park Sejong University Republic of KoreaJe-Ho Park Dankook University Republic of KoreaJeongHoon Park Samsung Electronics Republic of KoreaJiho Park KETI Republic of KoreaKyungmo Park Samsung Republic of KoreaMincheol Park Sejong University Republic of KoreaYoungcheol Park Yonsei University Republic of KoreaStephane Pateux Orange Labs FranceWen-Hsiao Peng ITRI/NCTU TaiwanPierrick Philippe Orange Labs FranceMarius Preda Institut TELECOM FranceFrancoise Preteux Institut Telecom FranceHeiko Purnhagen Dolby Sweden SwedenSchuyler Quackenbush Audio Research Labs USAMajid Rabbani Eastman Kodak USAIvana Radulovic Ericsson SwedenMickael Raulet INSA/IETR of Rennes FranceYuriy Reznik Qualcomm Inc. USA
20
Jeha Ryu Gwangju Institute of Science and Technology Republic of KoreaSatoru Sakazume Victor Company of Japan, Limited JapanGen Sasaki MegaChips Corporation JapanJunichi Sato Shikino High-Tech Co.,Ltd. JapanAndreas Schneider Dolby germany GmbH GermanyStephan Schreiner Fraunhofer IIS GermanyShun-ichi Sekiguchi Mitsubishi Electric Corporation JapanTakanori Senoh NICT JapanChan-Won Seo Sejong University Republic of KoreaJeong-Hoon Seo Sejong University Republic of KoreaJeongil Seo ETRI Republic of KoreaShinya Shimizu NTT JapanHwa Seon Shin KETI Republic of KoreaHaiyan Shu Institute for Infocomm Research Singapore David Singer Apple USAIraj Sodagar Microsoft USAJoel Sole Thomson USAKyoung Soo Son Hanyang University Republic of KoreaJaeyeon Song Samsung Republic of Korea
RalphSperschneider Fraunhofer IIS Germany
Dale Stolitzka Analog Devices Inc USAKazuo Sugimoto Mitsubishi Electric Corporation JapanDoug Suh KHU Republic of KoreaJung Suk Suh Samsung Electronics Co., Ltd. KoreaGary Sullivan Microsoft USAHuifang Sun Mitsubishi Electric Research Labs USAJaewon Sung LG Electronics Republic of KoreaTeruhiko Suzuki Sony Corp JapanYoshinori Suzuki NTT DOCOMO, Inc. JapanHerve Taddei Huawei Technologies GermanySeishi Takamura NTT Cyber Space Labs., NTT Corporation JapanTK Tan NTT DOCOMO, Inc. JapanMasayuki Tanimoto Nagoya University JapanAkiyuki Tanizawa Toshiba Corporation JapanFrederik Temmermans Vrije Universiteit Brussel BelgiumAndrew Tescher Microsoft USADong Tian Thomson Inc USAChristian Timmerer Klagenfurt University AustriaYasuaki Tokumo Sharp Corporation JapanYoshihide Tonomura NTT JapanAlexandros Tourapis Dolby Laboratories USACong Thang Truong ETRI Republic of KoreaChun-Jen Tsai ITRI/NCTU TaiwanYi-Shin Tung MStar Semiconductor, Inc TaiwanKemal Ugur Nokia Finland
21
Juha Vartiainen SFS FinlandAnthony Vetro Mitsubishi Electric Research Labs USAXin Wang ContentGuard, Inc. USAYe-Kui Wang Huawei Technologies USAMenno Wildeboer Nagoya University JapanSteffen Wittmann Panasonic GermanyOliver Wuebbolt Thomson Inc GermanyMinjie Xie Huawei Technologies (USA) USAAkio Yamada NEC Corporation USATomoo Yamakage Toshiba Corporation JapanTomoyuki Yamamoto Sharp Corporation JapanJeong Ju Yoo ETRI Republic of KoreaYoung-Joe Yoo Sejong University Republic of KoreaKyoungro Yoon Konkuk University Republic of KoreaTomonobu Yoshino KDDI Corp. JapanHaoping Yu Huawei Technologies USAKugjin Yun ETRI Republic of KoreaAidong Zhang Huawei Technologies ChinaYin Zhao Zhejiang University ChinaHuan Zhou Panasonic Singapore Laboratories SingaporeYongwei Zhu Institute for Infocomm Research Singapore
22
Annex B – Agenda
Item1 Opening 2 Roll call of participants 3 Approval of agenda 4 Allocation of contributions 5 Communications from Convenor 6 Report of previous meeting 7 Processing of NB Position Papers 8 Work plan management 1 Media coding 1 HD-AAC Profile 2 New Profile for ALS 3 Constrained Baseline Profile 4 Multiview Field High Profile 4 5 960 frame length in MPEG-4 AAC 6 AFX 3rd edition 7 Multiresolution profile 8 Scalable-complexity 3D mesh compression 9 Open Font Format extensions 10 Media Value Chain Ontology 11 Codec Configuration Representation 12 Video Tool Library 13 Spatial Audio Object Coding 14 Unified Speech and Audio Coding 15 Interfaces with Virtual Worlds 16 3D Video Coding 17 High-Performance Video Coding 18 New directions in future audio coding 2 Composition coding 1 Interactive Digital Radio 2 LASeR Adaptation 4 Presentation of Structured Information 3 Description coding 1 Video Signature Tools 2 Metadata driven post processing of audio signals 3 Audio description coding standards 4 Extraction and Matching of Image Signature Tools 4 Systems support 5 IPMP 6 Digital Item 7 Transport and File formats 1 Carriage of SVC in MPEG-2 Systems 2 Carriage of MVC in MPEG-2 Systems 3 Miscellaneous additions to File Format 4 AVC File Format extensions for MVC 8 Multimedia architecture 1 Interfaces with virtual worlds 2 MXM Architecture and API 3 MXM API
23
4 Advanced IPTV Terminal 5 Rich Media UI Framework 9 Application formats 1 Media Streaming AF 2 DMB AF Harmonization of MPEG-2 TS storage 3 Interactive Music Application Format 10 Protocols 1 MXM Protocols 11 Reference implementation 1 AAC-ELD Reference Software 2 MVC Reference Software 3 File Format Reference Software 4 Geometry and Shadow Reference Software 5 3D Graphics Compression Model Reference Software 6 Scene Partitioning Reference Software 7 Image Signature Tools Reference Software 8 Protected Musical Slide Show MAF Reference Software 9 Musical Slide Show MAF Reference Software 10 Professional Archival MAF Reference Software 11 Video Surveillance MAF Reference Software 12 MXM Reference Software 12 Conformance 1 MVC Conformance 2 MPEG-4 Audio Conformance 3 AAC-ELD, OAFI and additional AAC Conformance 4 File Format Conformance 5 Scene Partitioning Conformance 6 MultiResolution Profile Conformance 7 3D Graphics Compression Model Conformance 8 Image Signature Tools Conformance 9 Photo Player MAF Conformance 10 Musical Slide Show MAF Conformance 11 Professional Archival MAF Conformance 12 Video Surveillance MAF Conformance 13 Video Tool Library Conformance 14 MXM Conformance 13 Maintenance 1 Systems coding standards 2 Video coding standards 3 Audio coding standards 4 3DG coding standards 5 Visual description coding standards 6 Audio description coding standards 7 MPEG-21 standards 8 MPEG-A standards 14 Work plan and time line9 Organisation of this meeting 1 Tasks for subgroups 2 Joint meetings
10 WG management 1 Terms of reference 2 Officers 3 Editors 4 Liaisons 5 Work item assignment
24
6 Ad hoc groups 7 Asset management 1 Reference software 2 Conformance 3 Test material 4 URI 8 IPR management 9 Work plan
11 Administrative matters 1 Responses to National Bodies 2 Schedule of future MPEG meetings 3 Promotional activities
12 Resolutions of this meeting 13 A.O.B 14 Closing
25
Annex C – Input contributions
No. Source Title
m16237 Webmaster Maui document register
m16238 Francisco Morán Burgos, Patrick GioiaAd Hoc Group on 3DGC documents, software maintenance and core experiments
m16239 Yi-Shin Tung, Teruhiko SuzukiAd Hoc Group on Maintenance of MPEG-4 Visual related Documents, Reference Software and Conformance
m16240Euee S. Jang, Marco Mattavelli, Kazuo Sugimoto
Ad Hoc Group on Reconfigurable Video Coding
m16241Miroslaw Bober, Paul Brasnett, Ryoma Oami
Ad Hoc Group on MPEG-7 Visual
m16242 Karsten Müller, Anthony Vetro Ad Hoc Group on 3D Video Coding
m16243Jens-Rainer Ohm, Jörn Ostermann, Vittorio Baroncini, Ajay Luthra, Jason Suh, T.K. Tan
Ad Hoc Group on High-Performance Video Coding
m16244 Gary Sullivan, Jens-Rainer Ohm Ad Hoc Group on AVC Development
m16245 R. SperschneiderAd Hoc Group on Audio Standards Maintenance
m16246 S. Quackenbush, Pierrick PhilippeAd Hoc Group on SAOC, USAC and MetaData
m16247Young-Kwon Lim, Jaeyeon Song, Cyril Concolato
Ad Hoc Group on Scene Representation
m16248 David Singer Ad Hoc Group on MPEG File Formats
m16249Kyuheon Kim, Hui Yong Kim, Noboru Harada
Ad Hoc Group on Application Format
m16250Jean Gelissen, Sanghyun Joo, Christian Timmerer
Ad Hoc Group on MPEG-V (including previous RoSE activities)
m16251 Vladimir LevantovskyAd Hoc Group on Font Format Representation
m16252 Xin Wang, Young Kwon LimAd Hoc Group on Advanced IPTV Terminal
m16253 Filippo Chiariglione, Christian Timmerer, Ad Hoc Group on MXM
26
Victor Rodriguez, Marius Preda
m16254 ITU-T SG 9 via SC 29 SecretariatLiaison Statement from ITU-T SG 9 [SC 29 N 10125]
m16255 JTC 1/SGSN via SC 29 SecretariatLiaison Statement from JTC 1/SGSN [SC 29 N 10126]
m16256 ITTF via SC 29 SecretariatTable of Replies on ISO/IEC 14496-4:2004/FDAM 31 [SC 29 N 10127]
m16257 IEC TC 100 via SC 29 Secretariat IEC DTS 62592 [SC 29 N 10130]
m16258 SC 37 via SC 29 SecretariatISO/IEC FCD 29109-2 [SC 29 N 10152]
m16259 SC 37 via SC 29 SecretariatISO/IEC FCD 29109-4 [SC 29 N 10153]
m16260 ITTF via SC 29 SecretariatTable of Replies on ISO/IEC 14496-11:2005/FDAM 6 [SC 29 N 10155]
m16261 IEC TC 100 via SC 29 SecretariatLiaison Statement from IEC TC 100 [SC 29 N 10157]
m16262 SC 29 SecretariatSummary of Voting on ISO/IEC 23000-4:200X/FPDAM 2 [SC 29 N 10158]
m16263 SC 29 SecretariatSummary of Voting on ISO/IEC 14496-12:2008/FPDAM 1
m16264 SC 29 SecretariatSummary of Voting on ISO/IEC FCD 23003-2
m16265 ITTF via SC 29 SecretariatTable of Replies on ISO/IEC 14496-4:2004/FDAM 32
m16266 ITTF via SC 29 SecretariatTable of Replies on ISO/IEC 14496-5:2001/FDAM 14
m16267 ITTF via SC 29 SecretariatTable of Replies on ISO/IEC 23000-3:2007/FDAM 1
m16268 ITTF via SC 29 SecretariatTable of Replies on ISO/IEC FDIS 14496-25
m16269 ITTF via SC 29 SecretariatTable of Replies on ISO/IEC 15938-3:2002/FDAM 3
m16270 ITTF via SC 29 SecretariatTable of Replies on ISO/IEC 21000-8:2008/FDAM 1
m16271 ITTF via SC 29 SecretariatTable of Replies on ISO/IEC 23000-7:2008/FDAM 1
27
m16272 ITTF via SC 29 SecretariatTable of Replies on ISO/IEC FDIS 23004-8
m16273 ITTF via SC 29 SecretariatTable of Replies on ISO/IEC 14496-10:2008/FDAM 1
m16274 ITTF via SC 29 SecretariatTable of Replies on ISO/IEC 23000-4:200X/FDAM 1
m16275 ITTF via SC 29 SecretariatTable of Replies on ISO/IEC FDIS 23000-6
m16276 ITTF via SC 29 SecretariatTable of Replies on ISO/IEC 14496-5:2001/FDAM 21
m16277 ITTF via SC 29 SecretariatTable of Replies on ISO/IEC FDIS 23000-10
m16278 SC 29 SecretariatSummary of Voting on ISO/IEC 14496-5:2001/FPDAM 24
m16279 SC 29 SecretariatSummary of Voting on ISO/IEC 15938-6:2003/PDAM 3
m16280 SC 29 SecretariatSummary of Voting on ISO/IEC 15938-7:2003/PDAM 5
m16281 SC 29 SecretariatSummary of Voting on ISO/IEC TR 15938-8:2002/PDAM 5
m16282 ITU-T SG 16 via SC 29 SecretariatLiaison Statement from ITU-T SG 16 to IEC TC 100
m16283 Jean Le Feuvre Editor's review of 14496-20 FPDAM2
m16284Madhukar BudagaviMinhua Zhou
MPEG4 Simple Profile, Levels 7, 8, 9 and MPEG4 Advanced Simple Profile, Levels 6, 7, 8, 9
m16285Minhua ZhouMadhukar Budagavi
Proposed text changes for ISO/IEC 14496-2 (MPEG-4 part 2)
m16286 SC 37 via SC29 Secretariat SC 37's CDs
m16287 IEC TC 100 via SC 29 Secretariat IEC CDV 62514 [SC 29N 10185]
m16288
Miran ChoiEuisok ChungMyung-Gil JangYunkeun Lee
Use Cases for Advanced IPTV Terminal
m16289 USNB DELETED
m16290 USNB DELETED
m16291 Andy Tescher for USNB USNB Contribution: Request for new
28
levels
m16292 IEC TC 100 via SC 29 Secretariat IEC CD 62087 [SC 29 N 10210]
m16293 SC 29 SecretariatTermination of Liaison between ISO TC 46 and SC 29
m16294 SC 29 SecretariatRequest for Internal Liaison between SC 29 and IEC TC 9
m16295 Anthony VetroRevisions of Applications & Requirements on 3D Video Coding
m16296Jean-Pierre EvainChristian Timmerer
On Usage of the MPEG URI Assets
m16297 Swedish NB via SC 29 SecretariatSwedish NB comment in response to Resolution 3.1.2 in N10312
m16298 SC29 SecretariatInformation on JTC1 Study Group on Digital Content Management and Protection
m16299 [email protected] Defect report on ISO/IEC14496-26
m16300Markus Waltl Christian Timmerer
Minor Corrections to SEDL and the Usage of Schematron for SEDL Conformance
m16301Markus Waltl Christian Timmerer
An API for Sensory Effect Metadata compliant to the MPEG Extensible Middleware (MXM)
m16302 CEA via SC 29 SecretariatLiaison Statement from CEA [SC 29 N 10248]
m16303 IEC TC 100 via SC 29 Secretariat CD of IEC TS 62579
m16304 SC 34 via SC 29 SecretariatLiaison Statement from JTC 1/SC 34/WG 2 [SC 29 N 10235]
m16305Sergio ArnaldoFrancisco MoránMarcos Avilés
IndexedRegionSet: Efficient Representation of Meshes with Multiple Textures
m16306 Leonardo Chiariglione The MPEG DRM vision
m16307
Christian TimmererMichael EberhardIngo KoflerRobert Kuschnig Michael Ransburg Michael Sablatschan Hermann Hellwagner
On MPEG Modern Transport over Networks
29
m16308Christian TimmererMichael Eberhard
Updated DIA APIs and Implementation for MXM
m16309
Jonas EngdegårdHeiko PurnhagenCornelia FalchLeonid TerentievAndreas HölzerOliver HellmuthJohannes HilpertJeroen Koppens
Report on corrections for the MPEG SAOC FCD text
m16310
Leonid TerentievJürgen HerreCornelia FalchOliver Hellmuth
Clarifications regarding the enhanced Karaoke/Solo processing mode
m16311Heiko PurnhagenKristofer Kjörling
Dolby Listening Test Results for USAC CE on AVQ-based LPC
m16312Heiko PurnhagenKristofer Kjörling
Dolby Listening Test Results for USAC CE on Phase Coding in MPS
m16313
Christian TimmererMark StuartNicola CapovillaFabrizio RovatiJari AholaNjål BorchFranc Kozamernik
Input on the Advanced IPTV Terminal (AIT) Architecture
m16314Kristofer KjörlingMax Neuendorf
Progress report on harmonic transposer CE for the USAC work item
m16315
Andreas SchneiderToshiyuki NomuraHeiko PurnhagenHolger Hoerich
proposed clarification on byte alignments in LOAS streams
m16316
Philippe GournayBruno BessetteRoch LefebvreRedwan Salami
CE Report on LPC Quantization for USAC
m16317Yongzhe WangPhilipp MerkleKarsten Müller
Results of Exploration Experiments in 3D Video Coding for Dog Data Set
m16318Ivana RadulovicPer Fröjdh
3DTV Exploration Experiments on Pantomime data set
m16319 Po-Lin Lai MPEG 3DV EEs on Leaving_Laptop
30
Dong TianPatrick LopezPaul Kerbiriou
m16320
Dong TianPo-Lin LaiFons BrulsLincoln LoboWiebe de Haan
On 2D + Depth SEI Message
m16321Stefan BayerMarkus Multrus
Proposed Additions to and Corrections of the USAC Reference Software
m16322Markus MultrusRalf Geiger
Fraunhofer IIS Listeningtest Results on USAC CE for AVQ-based LPC Quantizer
m16323Max NeuendorfTaejin Lee
Report on Merge of sys2 Technology into RM0: SBR Improvements
m16324Max NeuendorfMarkus MultrusNikolaus Rettelbach
Comments on new USAC reference bitstreams
m16325Philippe GournayRoch Lefebvre
VoiceAge Test Report for USAC CE on Unvoiced Coding
m16326Carmen CHENGYan HUOYu LIU
3DV results on Dog sequence
m16327Doug Young SuhYongju Cho
For realization of MANE
m16328Olgierd StankiewiczKrzysztof WegnerKrzysztof Klimaszewski
Results of Exploration Experiments in 3D Video Coding, described in w10360, for Alt Moabit sequence.
m16329ChongSoon LimSteffen WittmannTakahiro Nishi
Reference Software And Test Results For Multiview Field High Profile
m16330ChongSoon LimSteffen WittmannTakahiro Nishi
Contribution of Stereoscopic Test sequence
m16331
ChongSoon LimSteffen WittmannTakahiro NishiAjay LuthraAnthony VetroShun-ichi SekiguchiShinya ShimizuStéphane Pateux
Standardization of a new MVC profile as specified in Working Draft 1 of ISO/IEC 14496-10:200X/Amd.2 Multiview Field High Profile (N10344)
31
m16332Jérôme GorinMickaël Raulet
An efficient dataflow design for implementing MPEG-4 AVC decoder in the RVC framework
m16333Mickaël RauletMatthieu WipliezJörn W. Janneck
FU Parametrization and FU code generation
m16334 Stephan SchreinerProposed new Architecture of Metadata Driven Audio Post-Processing
m16335 removed removed
m16336 Jean H.A. Gelissen MPEG-V WD Contributions
m16337Khaled MamouFaouzi Ghorbel
Fast Array Encoder (FAE): an efficient extension to the QBCR compression technique
m16338Herve TaddeiDejun ZhangMinjie Xie
Huawei Core Experiment proposal for USAC
m16339Werner OomenJeroen Koppens
Philips Listening Test Results for USAC CE on Phase Coding in MPS
m16340 Andy Tescher for USNBUSNB Contribution: Comments on MMT (N10496)
m16341 IEC TC 100 via SC 29 SecretariatIEC NP: Transmission of time code in the ancillary data space
m16342 IEC TC 100 via SC 29 Secretariat
IEC NP: Terrestrial digital multimedia broadcasting (T-DMB) receivers -- Part 2: Interactive data services using BIFS (IEC 62516-2)
m16343 IEC TC 100 via SC 29 Secretariat IEC CDV 62571
m16344Filippo ChiariglioneTiejun Huang
Some issues regarding the 2nd edition of Media Streaming Application Format (MSAF)
m16345Filippo ChiariglioneTiejun Huang
Benefits from the use of Presentation of Digital Items and Event Reporting in MSAF
m16346Filippo ChiariglioneTiejun Huang
Proposed text of ISO/IEC 21000-4:200x AMD1 ? Protection of Presentation element
m16347Filippo ChiariglioneTiejun Huang
Proposal to change the title of ISO/IEC 21000-2:2005 AMD1 to ?Presentation of Digital Item?
32
m16348Filippo ChiariglioneTiejun Huang
Proposal for MXM engine and API of Presentation of Digital Item
m16349Filippo ChiariglioneTiejun Huang
Proposed text of ISO/IEC 23000-5 2nd Edition CD Media Streaming Application Format
m16350
Seo-Young HwangKyungmo ParkJaeyeon SongYoung-Kwon Lim
user interaction method on 14496-20
m16351 Seo-Young Hwang study text on LASeR PDAM3
m16352
Kyungmo ParkGiovanni GordaraCyril ConcolatoJean Le FeuvreJean-Claude Dufourd
Response to the call for technologies for MPEG RUIF
m16353 Jaeyeon song environments on MMT
m16354
Marc GauvinJaime DelgadoVictor RodriguezMiran Choi
21000-19 FCD revision
m16355
Marc GauvinJaime DelgadoVictor RodriguezMiran Choi
Editors Notes on MVCO Review
m16356
Takanori SenohKenji YamamotoRyutaro Oi Tomoyuki Mishina Makoto Okui
Report of 3D/FTV Exploration Experiment with Champagne Tower
m16357
Takanori Senoh Kenji Yamamoto Ryutaro Oi Tomoyuki Mishina Makoto Okui
Proposal of Depth Map Estimation Method in Response to Call for 3D Test Material: Depth Maps & Supplementary Information
m16358
Yasuaki TokumoShin-ya HasegawaTakuya IwanamiShuichi Watanabe
Study on MPEG-V Sensory Information
m16359 David SingerMinor or editorial errors in the Part 12 amendment
m16360 Shin-ya HasegawaYasuaki Tokumo
Study on representation of characteristics for MPEG-V Sensory
33
Takuya Iwanami Devices
m16361Anthony VetroDavid Singer
Extractors in the MVC file format
m16362 David Singer On Modern Media Transport (MMT)
m16363Jeongil SeoKyeongok KangKevin SeungChul Ham
Test Sequence Proposal for SAOC Verification Test
m16364Kangchan LeeSeungyun Lee
Proposal of the template of draft Advanced IPTV Terminal(AIT) Requirements
m16365Kangchan LeeSeungyun Lee
Proposal of conceptual diagram for advanced IPTV Terminal
m16366 Teruhiko SuzukiProposal to detect dependent view boundary in MVC
m16367 Teruhiko SuzukiClarification of track header in ISO media file format
m16368B.S. ChoiSangHyun Joo
Sensory Device Capability Metadata
m16369
Noboru HaradaHouariHendryYutaka KamamotoTakehiro MoriyaMunchurl Kim
Proposed update to the PA-AF reference software and description of the PA-AF APIs
m16370Sung-Kwan JeSang-Il NaWeon-Geun Oh
Proposal for the MPEG-7 Visual Extension, VCE-x: the ROI Signature
m16371
Sung-Kwan JeWeon-Geun OhSang-Il NaWon-Keun Yang
Preliminary feasibility test result for MPEG-7 Visual Extension, VCE-x : the ROI signature
m16372Tomonobu YoshinoSei NaitoShigeyuki Sakazawa
Performance Evaluation of Spatially Adaptive Macroblock Size Selection Scheme for HVC Test Sequences
m16373Hosang SungEunmi OhMiyoung Kim
Report on Unvoiced Speech Coding for USAC
m16374 JungHoe KimJulien RobilliardEunmi Oh
Report on Phase Coding in MPEG Surround for USAC
34
Bernhard Grill
m16375
Keiichi Chono Hirofumi Aoki Junji Tajime Yuzo Senda
Requests on Coding conditions in Call for Evidence on High-Performance Video Coding (HVC)
m16376
JungHoe KimJulien RobilliardEunmi OhBernhard Grill
Proposed Changes to WD2 for Phase Coding
m16377
Sang-Kyun KimJin-Seo KimMaeng-Sub ChoBon-Ki KooYong Soo Joo
The modified ColorCorrection sensory effect for spatio-temporal moving regions
m16378
Sang-Kyun KimJin-Seo KimMaeng-Sub ChoBon-Ki KooYong Soo Joo
Modifications of ColorCorrectionParameter Type of MPEG-V Part3 Sensory Information
m16379
Sang-Kyun KimJin-Seo KimMaeng-Sub ChoBon-Ki KooYong Soo Joo
Basic description languaues for Sensory Device Capabilities of MPEG-V Part2 Control Information
m16380
Sang-Kyun KimJin-Seo KimMaeng-Sub ChoBon-Ki KooYong Soo Joo
Basic description languaues for Sensory Device Commands of MPEG-V Part2 Control Information
m16381
Sang-Kyun KimJin-Seo KimMaeng-Sub ChoBon-Ki KooYong Soo Joo
Basic description languaues for User Sensory Preferences of MPEG-V Part2 Control Information
m16382SangHyun JooJongHyun Jang
Modified MPEG-V System Architecture
m16383
Taejin LeeMax NeuendorfJeremie LecomteMinje KimSeungkwon BeackKyeongok KangBernhard Grill
Report on Merge of sys2 Technology into RM0: TCX Improvements
35
m16384 Juergen Herre for GNBDetails of NB Position on SAOC Ballot
m16385B. S. ChoiSang Hyun Joo
User Sensory Preference Metadata
m16386
Inseon JangJeongil SeoHui Yong KimKyeongok KangKevin SeungChul Ham
Proposal of dynamic preset for IM AF
m16387
Inseon JangHui Yong KimJeongil SeoLaurent PrimauxOwen Lagadec
Editor's study on ISO/IEC 23000-12 CD Interactive music application format
m16388
Seok LeeJaejoon LeeIlsoon LimJin Young LeeHo-Cheon WeyDu-Sik Park
3DV EE1 & EE2 Results on Newspaper sequence
m16389
Masayuki TanimotoToshiaki Fujii Mehrdad Panahpour Tehrani Hisayoshi Furihata Menno Wildeboer
Error-resilient Free-viewpoint Image Generation for FTV
m16390
Masayuki TanimotoToshiaki Fujii Mehrdad Panahpour Tehrani Kazuyoshi SuzukiMenno Wildeboer
Depth Estimation Reference Software (DERS) 3.0
m16391
Masayuki TanimotoToshiaki FujiiMehrdad Panahpour TehraniNorishige FukushimaKazuyoshi SuzukiMenno Wildeboer
Semi-automatic Depth Estimation for FTV
m16392Per FröjdhTorbjörn EinarssonClinton Priddle
Adaptive Progressive Download
m16393Per FröjdhAndrey NorkinClinton Priddle
File format sub-track selection and switching
m16394 Cheon Lee EE1: Results of Depth Estimation on
36
Yo-Sung Ho 'Pantomime? Sequence
m16395Cheon LeeJiho ParkYo-Sung Ho1
EE2: Results of View Synthesis on 'Pantomime? Sequence
m16396Eun-Kyung LeeYo-Sung Ho
3-D Test Sequence - Multiview Video and Depth Map
m16397Kei KikuiriKousuke TsujinoNobuhiko Naka
Core Experiment Proposal on the eSBR module of USAC
m16398Hussein Aman-AllahIhab AmerMarco Mattavelli
AVC Entropy Coding and Bitstream Generation for the MPEG RVC Encoding Tools
m16399
Seungwook LeeBonki KooKyoungsoo SonDaiyong KimEuee S. Jang
CE Report on SC3DMC Ver 4.0
m16400Yin ZhaoLu Yu
3DV EE3 on Champagne_tower sequences
m16401
Kyoungsoo SonSeungwook Lee Bonki KooDaiyong KimEuee S. Jang
Algorithm descriptions of attribute data on SC3DMC
m16402
Seungwook LeeBonki KooKyoungsoo SonDaiyong KimEuee S. Jang
QBCR and SVA bitstream syntax update
m16403
Seungwook LeeBonki KooKyoungsoo SonDaiyong KimMingxiao ChenEuee S. Jang
A study on RVC based Graphics codec
m16404
Daiyong KimKyoungsoo SonSeungwook LeeBonki KooEuee S. Jang
Update and current status of SC3DMC
m16405 Yin ZhaoLu YuFons Bruls
LDV Reference Software for View Synthesis
37
Lincoln Lobo
m16406
Gun BangGi Mun UmNamho HurJinwoong Kim
3DV/FTV EE results of depth estimation and view synthesis on "lovebird1" sequence
m16407Yin ZhaoLu Yu
Perceptual measurement for evaluating quality of view synthesis
m16408
Jihun ChaInjae LeeYoung-kwon LimHan-Kyu LeeJinwoo Hong
Responses to LASeR PDAM3 Editor?s Note
m16409Ihab AmerMarco Mattavelli
Advances in the CE: Development of RVC Encoding Tools
m16410 KNB KNB Comment on 14496-20 PDAM3
m16411
Gun BangJaeho LeeNamho HurJinwoong Kim
Depth Estimation algorithm in SADERS1.0
m16412
Jihun ChaInjae LeeYoung-kwon LimHan-Kyu LeeJinwoo Hong
Utilization of LASeR on Rich Media UI Framework
m16413Karim MaaroufIhab AmerMarco Mattavelli
AVC Intra Prediction, Transform and Quantization for the MPEG RVC Encoding Tools
m16414
Weon-Geun OhJu-Kyong JinSang-il NaHae-Kwang KimDong-Seok Jeong
Response to the Core Experiments on Video Signature Tools
m16415 Cyril Concolato Editor's Text of 14496-1 4th edition
m16416Min-Jeong LeeHeung-Kyu Lee
Cross verification result for ETRI VCE-7 proposal
m16417Jaewon SungByeong-Moon Jeon
Improving view synthesis results based on depth quality measure
m16418Jaewon SungByeong-Moon Jeon
3DV EE results on Newspaper sequence
m16419 fons bruls Philips response to new Call for 3DV
38
rene klein gunnewiekpatrick van de walle
Test Material: Arrive book & Mobile
m16420
fons brulsrene klein gunnewiekpatrick van de walleYin ZhaoLu Yu
Philips & Zhejiang Uni response to new Call for 3DV Test Material: Champagne Tower
m16421
fons brulsrene klein gunnewiekpatrick van de wallerene van de vleuten
Philips (in coop with 3D4YOU) response to new Call for 3DV Test Material: Beergarden
m16422fons brulsrene klein gunnewiekpatrick van de walle
Philips 1st 3DV synthesis results using new test material for Arrive book, Mobile, Beergarden & Champagne Tower
m16423 fons bruls Philips 3DV EE results
m16424
Truong Cong ThangYongju ChoJung Won KangJeong-Ju Yoo
Proposed Scenario and Requirements for Advanced IPTV Terminal
m16425Blagica JovanovaMarius PredaFrançoise Preteux
Avatar Characteristics
m16426 Miska M. Hannuksela On MVC File Format
m16427Ivica ArsovMarius Preda
Integrated MXM API for 3D Graphics
m16428Sairus Patel Vladimir Levantovsky
Proposal for a new amendment of ISO/IEC 14496-22 ?Open Font Format?
m16429Aritz SanchezSebastian GerkePatrick Ndjiki-Nya
A Video Signature based on Robust Region Detectors
m16430
Paul BrasnettKota IwamotoStavros PaschalakisRyoma OamiMiroslaw Bober
Proposal on MPEG-7 Video Signature Tools
m16431Ehab Asaad HannaIhab AmerMarco Mattavelli
AVC Inter-frame Prediction for the MPEG RVC Encoding Tools
m16432 Sehoon Yea Results of Exploration Experiments in
39
Zafer AricanAnthony Vetro
3D Video for Lovebird2
m16433 Schuyler Quackenbush 87th MPEG Audio Report
m16434 Schuyler Quackenbush Draft Revised Audio CE Methodology
m16435
Khaled MamouTitus ZahariaMarius PredaFrançoise PRETEUX
TFAN bitstream syntax update
m16436Khaled MamouFaouzi Ghorbel
FAE software description
m16437Sehoon YeaAnthony VetroShun-ichi Sekiguchi
Verification of Test Results For Multiview Field High Profile (m16329)
m16438Yuriy A. ReznikRavi K. Chivukula
On design of transforms for high-resolution / high-performance video coding
m16439
Jeremie LecomteGuillaume FuchsMax NeuendorfRalf GeigerMarkus Multrus
Proposed improvements to WD2 of USAC
m16440Jeha RyuYeongmi Kim
Haptic Movie system and Service Scenario for MPEG-V
m16441Yuriy A. ReznikRavi K. Chivukula
Fast SBR filterbanks for AAC-ELD, HE-AAC, and USAC.
m16442Jeha RyuYeongmi Kim
Representation of Tactile Movie Control Information for MPEG-V
m16443Yuriy A. ReznikRavi K. Chivukula
On complexity of size 960 transform in AAC family of codecs
m16444Ye-Kui WangMiska Hannuksela
Comments to the MVC file format draft
m16445 Ye-Kui Wang A proposal to MVC file format
m16446Sungyong YoonHyunkook LeeYounghee Choi
Core experiment proposal on arithmetic coding
m16447Wonsuk LeeSeungyun Lee
Proposal for using the existing tools for generic metadata APIs of metadata engine on MXM APIs
m16448 Tomoyuki Yamamoto Analysis on partition and transform
40
Tomohiro Ikaiselection in the context of extended block sizes
m16449Sung-Kwan JeSang-Il NaWeon-Geun Oh
Proposal for a New Standard Item in MPEG-7 Visual descriptors, ROI Signature
m16450
Hyungyu KimSowon KimMinsoo ParkHwa Seon ShinByeongho ChoiChungku YieEuee S. Jang
MPEG-4 ASP Decoder Description and a Case Study of the Decoder Design Process in the RVC framework
m16451Shangwen LiLu Yu
delete
m16452
Hui Yong KimYong Han KimMunchrul KimHouari Sabirin
BWS Conformance file contribution and SW update for ISO/IEC 23000-9/Amd.1
m16453Hui Yong KimYong Han KimMunchurl Kim
* Withrawn *
m16454Hui Yong KimMyung Seok KiYong Han Kim
Study on ISO/IEC 23000-9 PDAM2 (DMB-AF Harmonization on MPEG-2 TS)
m16455Sung Jin HurWan Choi
A Proposal for Multimedia Application Interface for Collaborative Work
m16456Julien RobilliardMatthias NeusingerJohannes Hilpert
Fraunhofer Listening Test Results for USAC CE on Phase Coding in MPS
m16457Sung Jin HurWan Choi
Use Case of Multimedia Application Interface
m16458Laurent PrimauxOwen LagadecEmmanuel Bouix
iKlax improvement Proposal on ISO/IEC 23000-12 CD IM AF
m16459Next generation Broadcasting Forum(Korea)
Updated Text of ISO/IEC 23000-11 Stereoscopic Video AF Conformance and Reference Software
m16460Olgierd StankiewiczKrzysztof WegnerKrzysztof Klimaszewski
Additional results of Exploration Experiments in 3D Video Coding, described in w10360, for Alt Moabit sequence.
41
m16461Ruben TousJaime Delgado
Contribution of a Basic Interpreter Module to ISO/IEC 15938-12/Amd.1 MPEG Query Format Ref. Soft & Conf.
m16462Steffen KampMathias Wien
High Definition Test Sequences for High-Performance Video Coding (HVC)
m16463Steffen KampMathias Wien
AVC Anchor Streams for Evaluation of High-Performance Video Coding (HVC)
[email protected] behalf of experts from 18 bodies
Requirements Input on Pervasive AV Scene Coding
m16465
Bojan JOVESKI Mihai MITREA Pieter SIMOENS Iain-James MARSHALLFrançoise PRETEUX
BiFS-based solution and its deployment within the FP7 MobiThin project: event handling and streaming
m16466Walid HachichaKhaled MammouTitus Zaharia
Optimized implementation of the TFAN encoder
m16467 EBU via SC 29 SecretariatLiaison Statement from EBU [SC 29 N 10254]
m16468 French NB via SC 29 Secretariat French NB Contribution on JVT
m16469
Daiyong KimKyoungsoo SonSeungwook LeeBonki KooEuee S. Jang
Comparison result of conformance test on the SVN
m16470Gwo Giun (Chris) LeeHe-Yuan LinJia-Wei Liang
FU network of inverse scan, inverse quantization and inverse transform in AVC High Profile
m16471Shinya ShimizuHideaki Kimata
3DV/FTV EE Report on Doorflower sequence
m16472
Shun-ichi SekiguchiYoshihisa YamadaYoshiaki KatoKohtaro AsaiTokumichi Murakami
Information on new test material for HVC study
m16473 Shun-ichi SekiguchiKazuo SugimotoYoshihisa YamadaYoshiaki Kato
Additional coding performance evaluation of extended MB size
42
Kohtaro AsaiTokumichi Murakami
m16474Ruben TousJaime Delgado
Copy of m16461 (Contribution of a Basic Interpreter Module to ISO/IEC 15938-12/Amd.1 MPEG Query Format Ref. Soft & Conf.)
m16475
Marzia CorvagliaFabrizio GuerriniRiccardo LeonardiEliana RossiPierangelo Migliorati
A proposal for Video Signature Tool and Video Fingerprinting
m16476 DRM via SC 29 SecretariatLiaison Statement from DRM [SC 29 N 10262]
m16477 WorldDMB Forum via SC 29 SecretariatLiaison Statement from Liaison Statement from WorldDMB Forum [SC 29 N 10263]
m16478 Marius Preda Editor Comments on AFX 3rd Edition
m16479
Jungyoup YangKwanghyun WonByeungwoo JeonSu Nyeon Kim
Additional Experimental Result of MVOP with HD Sequences
m16480 Sebastian PossosEvaluation of Video Signature Based on Tomography
m16481
Alexis Michael TourapisAthanasios LeontarisPeshala PahalawattaYan Ye
JM Reference Software Enhancements
m16482
Hwa Seon ShinSung Moon ChunHyungyu KimByeongho ChoiEuee S. Jang
Status and Future Plan of Video Tool Library (VTL)
m16483David Oyarzun Jean H.A. Gelissen (editor)
Avatar Definition Markup Language
m16484 ... Withrawn
m16485 KNBKNB Comment on High Performance Video Coding
m16486Mario DoellerFlorian StegmaierGero Bäse
MPEG Query Format Semantic Requirements
43
m16487Mario DöllerFlorian StegmaierGero Bäse
Introducing Semantic Retrieval in MPQF
m16488 TTA via SC 29 Secretariat Liaison Statement from TTA
m16489 SC 34 via SC 29 Secretariat Liaison Statement from SC 34/WG 2
m16490Khaled MamouFaouzi Ghorbel
CE Report on SC3DMC: FAE versus QBCR, QBCR BP and QBCR AC
m16491 3GPP SA4 via SC 29 Secretariat Liaison Statement from 3GPP SA4
m16492 ITU-T SG 16 via SC 29 Secretariat Liaison statement from ITU-T SG 16
m16493Mark CallowItaru kanekoRichard Clark
Guide to the MPEG Subversion Repository
m16494Sebastian PossosHari Kalva
Verification of the Proposed MPEG-7 Video Signature Tools
44
Annex D – Output documents
No. Source Title
w10517 Convener List of Documents from the 88th Meeting in Maui, Hawaii, USA
w10518 Convener Resolutions of the 88th Meeting in Maui, Hawaii, USA
w10519 ConvenerList of AHGs Established at the 88th Meeting in Maui, Hawaii, USA
w10520 Convener Report of the 88th Meeting in Maui, Hawaii, USA
w10521 ConvenerGuidelines for Electronic Distribution of MPEG M and N Documents
w10522 Convener Press Release of the 88th Meeting in Maui, Hawaii, USA
w10523 Convener Meeting Notice of the 89th Meeting in London, UK
w10524 Convener Guide for WG 11 Meeting Hosts
w10525 Requirements Enhanced MPEG-7 Query Format Requirements
w10526 Requirements MPEG-V Extended Call for Proposals
w10527 Convener Adhoc on MPEG Modern Transport (MMT)
w10528 3DGCStudy Text of ISO/IEC 14496-16:2006/PDAM4 (Scalable Complexity 3D Mesh Compression)
w10529 3DGC Description of AFX CE and explorations
w10530 3DGC ISO/IEC 14496-16 3rd Edition
w10531 3DGC MPEG 3D Graphics Vision
w10532 3DGC MPEG 3D Graphics FAQ v22
w10533 3DGCAHG on 3DGC documents, software maintenance and core experiments
w10534 Video Defect Report on ISO/IEC 14496-2:2004
w10535 VideoStudy Text of ISO/IEC 14496-4:2004/FPDAM 38 Multiview Video Coding Conformance Testing
w10536 Video Defect Report on ISO/IEC 14496-4:2004
w10537 Video First Ideas on New MPEG-4 Video Bitstream Repository Structure
w10538 VideoStudy Text of ISO/IEC 14496-5:2001/FPDAM 15 Reference Software for Multiview Video Coding
45
w10539 Video Working Draft of Reference Software for Stereo High Profile
w10540 Video Study Text of ISO/IEC 14496-10:200X/FPDAM 1
w10541 Video Defect Report on ISO/IEC 14496-10:200X
w10542 Video WD 1.0 of 15938-3/Amd.4 Video Signature Descriptors
w10543 Video MPEG-7 Visual XM 35
w10544 VideoDescription of Core Experiments in Video Signature Description development
w10545 Video Disposition of Comments on ISO/IEC 15938-6:2003/PDAM 3
w10546 VideoText of ISO/IEC 15938-6:2003/FPDAM 3 Reference Software for Image Signature Tools
w10547 Video Disposition of Comments on ISO/IEC 15938-7:2003/PDAM 5
w10548 VideoText of ISO/IEC 15938-7:2003/FPDAM 5 Conformance Testing for Image Signature Tools
w10549 Video Disposition of Comments on ISO/IEC 15938-8:2002/PDAM 5
w10550 VideoText of ISO/IEC 15938-8:2002/DAM 5 Extraction and Matching of Image Signature Tools
w10551 Video Description of Core Experiments in RVC
w10552 Video Description of Exploration Experiments in 3D Video Coding
w10553 Video Call for Evidence on High-Performance Video Coding
w10554 Convener Liaison statement to ITU-T SG9 re 3D Video Coding
w10555 Convener Liaison statement to IEC TC100 re IEC DTS 62592
w10556 Convener Liaison statement to SC37 re Biometric Data Interchange
w10557 Convener Liaison statement to IEC TC100 re IEC CD 62087
w10558 Convener Liaison statement to CEA re 3D Video Coding
w10559 Convener Liaison statement to ITU-T SG16 re AVC Development
w10560 ConvenerAHG on Maintenance of MPEG-4 Visual related Documents, Reference Software and Conformance
w10561 Convener AHG on Reconfigurable Video Coding
w10562 Convener AHG on MPEG-7 Visual
w10563 Convener AHG on 3D Video Coding
w10564 Convener AHG on High-Performance Video Coding
w10565 Convener AHG on AVC Development
46
w10566 Video Request for ISO/IEC 15938-3:2000/Amd.4
w10567 RequirementsRequirements v3.0 for a new BIFS profile to support Interactive Digital Radio
w10568 RequirementsCall for Proposal on additional BIFS technologies for Interactive Services for Digital Radio
w10569 Requirements Draft Advanced IPTV Terminal (AIT) Requirements
w10570 Requirements Applications and Requirements for 3DV
w10571 RequirementsWorkshop on MMT (MPEG Modern Transport) - Call for Contributions
w10572 Systems Study of ISO/IEC 13818-1:2007/FPDAM4 Transport of MVC
w10573 Systems WD 2.0 of ISO/IEC 13818-1:2007 DCOR X
w10574 Systems Text of ISO/IEC CD 14496-1 4th Edition
w10575 SystemsWD of ISO/IEC 14496-1 PDAM4 Registration Authority and systems extensions
w10576 SystemsDoC on ISO/IEC 14496-5:2001/PDAM 23 Synthesized Texture Reference SW
w10577 SystemsText of ISO/IEC 14496-5:2001/FPDAM 23 Synthesized Texture Reference SW
w10578 SystemsText of ISO/IEC 14496-5:2001/AMD14:2009/DCOR 1 OFF Ref. SW
w10579 Systems DoC on ISO/IEC 14496-12:2008/FPDAM 1 General Improvements
w10580 Systems Text of ISO/IEC 14496-12:2008/FDAM 1 General Improvements
w10581 Systems Study of ISO/IEC 14496-15:2004/FPDAM 3 MVC File Format
w10582 SystemsStudy of ISO/EC 14496-20:2008 LASeR & SAF/FPDAM 2 Adaptation
w10583 Systems DoC on ISO/EC 14496-20:2008 LASeR & SAF/PDAM 3 PMSI
w10584 Systems Text of ISO/EC 14496-20:2008 LASeR & SAF/FPDAM 3 PMSI
w10585 SystemsUpdated Workplan for service example of LASeR Adaptation & PMSI
w10586 SystemsTuC for ISO/IEC 14496-20:2000 LASeR & SAF AMD 4 Advanced User Interaction
w10587 SystemsText of ISO/IEC 14496-22:200X/PDAM 1 Support for many-to-one range mapping
w10588 SystemsWD 4.0 of ISO/IEC 15938-12:200X AMD 1 MPQF Conf. and Ref. SW
47
w10589 SystemsWD 1.0 of ISO/IEC 15938-12:200X AMD 2 Semantic Enhancement
w10590 Systems Request for ISO/IEC 21000-2 AMD 1 Presentation of Digital Item
w10591 Systems Text of ISO/IEC 21000-2 PDAM 1 Presentation of Digital Item
w10592 SystemsRequest for ISO/IEC 21000-4 AMD 2 Protection of Presentation Element
w10593 SystemsText of ISO/IEC 21000-4 PDAM 2 Protection of Presentation Element
w10594 SystemsText of ISO/IEC 23000-4 MSSAF/FDAM2 Conf. & Ref. SW for Protected MSSAF
w10595 Systems Text of ISO/IEC CD 23000-5 MSAF 2nd edition
w10596 Systems Study of ISO/IEC 23000-6 PA-AF/PDAM1 Conf. and Ref. SW
w10597 SystemsWorkplan for ISO/IEC 23000-6 PA-AF Conformance and Reference Software
w10598 SystemsStudy of ISO/IEC 23000-9:2008/FPDAM1 DMB AF Conf. And Ref. Soft.
w10599 SystemsWorkplan for ISO/IEC 23000-9:2008/FPDAM1 DMB AF Conf. And Ref. Soft.
w10600 Convener Terms of reference
w10601 Convener MPEG Standards
w10602 Convener Unpublished standards at FDIS level
w10603 Convener MPEG work plan and time line
w10604 Convener MPEG Standard Editors
w10605 Convener Schema assets updates
w10606 Convener Software assets
w10607 Convener Conformance assets
w10608 Convener Content assets
w10609 Convener URI assets
w10610 ConvenerStandards under development for which a call for patent statements is issued
w10611 Convener List of Organisations with which MPEG entertains liaisons
w10612 SystemsStudy of ISO/IEC 23000-9:2008/PDAM2 DMB AF Harmonization of MPEG-2 TS storage
w10613 Systems Text of ISO/IEC 23000-11 PDAM 1 Stereoscopic Vide AF Ref.
48
Soft and Conf.
w10614 SystemsWorkplan for Stereoscopic Video Application Format Ref. Soft and Conf.
w10615 Systems Study of ISO/IEC CD 23000-12 Interactive Music AF
w10616 Systems WD 2.0 of Architecture
w10617 Systems WD 2.0 of Control Information
w10618 Systems WD 2.0 of Sensory Information
w10619 Systems WD 2.0 of Avatar Information
w10620 SystemsStudy of ISO/IEC CD 23006-1 MxM Architecture and Technologies
w10621 Systems Study of ISO/IEC CD 23006-2 MXM APIs
w10622 Systems Study of ISO/IEC CD 23006-3 MXM Conf. & Ref. SW
w10623 SystemsFirst ideas on normative APIs compliant to MXM framework for future MPEG standards
w10624 Systems MXM Developer's Day: Call for Participation
w10625 Systems Study of ISO/IEC CD 29116-1 2nd edition MXM Protocols
w10626 Systems WD MPEG Rich Media UI
w10627 SystemsRequest for ISO/IEC CD 14496-22:200X AMD 1 Support for many-to-one range mapping
w10628 SystemsRequest for ISO/IEC 23000-11 AMD 1 Stereoscopic Video AF Ref. Soft and Conf.
w10629 Systems MPEG DRM Vision
w10630 Systems Guide to the MPEG Subversion Repository
w10631 Convener Liaison statement to JTC 1/SGSN
w10632 ConvenerLiaison statement to ITU-T SG 16 on rights information interoperability
w10633 ConvenerLiaison statement to IEC TC 100 on Multimedia Gateway in Home Networks
w10634 ConvenerLiaison statement to IEEE TC on Haptics on extended Call for Proposal on MPEG-V
w10635 Convener Liaison statement to IEC TC 9
w10636 Convener Liaison statement to SGDCMP on DRM technologies in MPEG
w10637 ConvenerLiaison statement to IEC TC 100 on Multimedia home server systems - Conceptual model for domain management
49
w10638 ConvenerLiaison statement to JTC 1/SC 34/WG 2 on Open Font Format reference software
w10639 ConvenerLiaison statement to IEC TC 100 on Transmission of time code in the ancillary data space
w10640 Convener Liaison statement to TTA on New BIFS Profile for Digital Radio
w10641 ConvenerLiaison statement to JTC 1/SC 34/WG 2 on media types of ISO/IEC 14496-22
w10642 Convener Liaison statement to W3 C on MPEG Rich Media UI
w10643 ConvenerLiaison statement to JTC 1 SWG-ARM on Professional Archival AF
w10644 Convener Liaison statement to EBU on MPEG URI Assets
w10645 Convener Liaison statement to W3C on MXM
w10646 Convener Liaison statement to ATSC on Carriage of SVC over MPEG-2 TS
w10647 ConvenerLiaison statement to SC6 on extended Call for Proposal on MPEG-V
w10648 ConvenerLiaison statement to SC25 on extended Call for Proposal on MPEG-V
w10649 Video Evaluation and Testing of 3D Video Coding
w10650 Audio ISO/IEC 14496-3:2009/DCOR 1:200X Byte Alignment
w10651 AudioStudy on ISO/IEC 14496-3:2009/ FPDAM 1:200x, HD-AAC Profile, MPEG Surround Signaling
w10652 Audio WD on AAC family of profiles
w10653 Audio DoC on ISO/IEC 14496-5:2001/FPDAM 24, MPEG-4 AAC ELD
w10654 Audio ISO/IEC 14496-5:2001/FDAM 24, MPEG-4 AAC ELD
w10655 Audio Request for Amendment, 14496-26:2009/PDAM 2
w10656 AudioISO/IEC 14496-26:2009/PDAM 2, BSAC Conformance for Broadcasting
w10657 AudioStudy on ISO/IEC 14496-26:2009/DCOR 1, ALS, SLS and AAC updates
w10658 Audio Study on ISO/IEC 23003-1:2007/DCOR 2, Misc. Corrections
w10659 AudioStudy on ISO/IEC FCD 23003-2:200x, Spatial Audio Object Coding
w10660 Audio Status and Workplan on SAOC Core Experiments
w10661 Audio WD3 of USAC
50
w10662 Audio Workplan for USAC CEs
w10663 Audio Workplan on MPEG USAC Reference Encoder
w10664 Audio MPEG Audio CE methodology
w10665 Convener Response to IEC TC-100 on IEC CDV 62571
w10666 Convener Response to Swedish NB on 960 and 1024 block lengths
w10667 Convener AHG on Audio Standards Maintenance
w10668 Convener AHG on SAOC, USAC and MetaData
w10669 Audio MPEG Audio Test Material for Core Experiments
w10670 Systems MPEG comments on W3C Widget Recommendation
w10671 Requirements Context and Objectives for Advanced IPTV Terminal
w10672 Systems WD 1.0 of Reference Software
w10673 Systems WD 1.0 of Conformance
w10674 Systems Request for ISO/IEC 14496-22:200X AMD 1
w10675 Convener AHG on Scene Representation
w10676 Convener AHG on MPEG File Formats
w10677 Convener AHG on Application Format
w10678 Convener AHG on Font Format Representation
w10679 Convener AHG on Advanced IPTV Terminal
w10680 Convener AHG on MXM
w10681 Convener AHG on MPEG-V
w10682 Requirements LASER Requirements
w10683 Systems MXM Development Roadmap for Engines and Applications
w10684 Systems MPEG Technologies for DRM
w10685 Convener Liaison statement to UPnP on MPEG Rich Media UI
w10686 Convener Liaison statement to DLNA on MPEG Rich Media UI
w10687 Convener Template Liaison statement on Modern Media Transport workshop
w10688 Convener Template Liaison statement on MXM DevDay's
w10689 Convener Liaison statement to ITU-T Q.6/16
w10690 SystemsProposal for support of MPEG Rich Media UI in W3C Widget Recommendation
51
w10691 ConvenerLiaison statement to IEC TC 100 on rights information interoperability
w10692 Convener Response to NB comments
w10693 Convener Proposal to improve the Software Copyright Disclaimer
w10694 Convener Editors nominated for an ISO/IEC Certificate of Appreciation
52
Annex E – Requirements report
Source: Jörn Ostermann, Chair
16Requirements documents approved at this meeting
No. Title10525 Enhanced MPEG-7 Query Format Requirements10526 MPEG-V Extended Call for Proposals10527 Adhoc on MPEG Modern Transport (MMT)
10567Requirements v3.0 for a new BIFS profile to support Interactive Digital Radio
10568Call for Proposal on additional BIFS technologies for Interactive Services for Digital Radio
10569 Draft Advanced IPTV Terminal (AIT) Requirements10570 Applications and Requirements for 3DV10571 Workshop on MMT(MPEG Modern Transport) – Call for Contributions
10671 Context and Objectives for Advanced IPTV Terminal
10682 LASER Requirements
17MPEG-7 Query FormatThe current MPEG-7 query format does not allow to include semantics in the query. A new requirement (N10525) was added in order to enable semantics in queries using RDF and OWL.
18MPEG-4 VideoA request to add levels to MPEG-4 Video Simple and Advanced Simple Profiles were brought to MPEG. Especially, a request for 1080p@30/60/120 was formulated. Due to no support for new levels in the ASP profile and only one organization interested in extending the Simple Profile, no action was taken.
19LASeRCurrently, LASeR terminals are not able to read from multi touch screens or sensors like gravity, wind and motion sensors. Furthermore, haptic interfaces cannot be controlled. Since these interfaces and sensors become prevalent, a new requirement was added for LASeR (N10682).
20MPEG-V: Information exchange with virtual worldsAfter evaluating the response to the Call for Proposals (N10239), MPEG identified the need for technology in the areas of emotions of avatars, objects belonging to avatars, and haptics. Based on Requirements for MPEG-V Version 3.2 (N10498), an Extended Call for Proposals was issued (N10526).
21MAFNo input on the topic of Advanced Surveillance AF was received.
53
22MPEG-User Interface FrameworkIn response to the Call for Proposals N10232, MPEG received input from one organisation. The technical work started in the Systems subgroup.
Explorations
22.1 Interactive RadioRequirements for Interactive Radio were updated as captured in N10567 Requirements v3.0 for a new BIFS profile to support Interactive Digital Radio. Contrary to prior beliefs, it now appears that there is a need for new technology. MPEG issued a Call for Proposals at the 88th meeting (N10568) to be answered by the 89th meeting. Topics include image caching, image delivery, navigation and scene state management.
22.2 3D Video CodingFollowing the vision to enable both advanced stereoscopic display processing and improved support for auto-stereoscopic N-view displays as outlined in N10357 Vision on 3D Video Coding, MPEG aligned its applications and requirements document accordingly (N10570).
22.3 High Performance Video Coding (HVC)HVC targets mobile services, IPTV, and Ultra High Definition (UHD) displays with a focus on coding efficiency considering codec complexity as well. The current target is to increase coding efficiency by 25% at low complexity and 50% at full complexity. MPEG foresees that the reduction of complexity will be achieved by turning off some tools required to reach the full performance in terms of coding efficiency. A Call for Evidence on High Performance Video Coding (N10553) was issued.
22.4 Advanced IPTV Terminal
N10569 Draft Advanced IPTV Terminal (AIT) Requirements and N10671 Context and Objectives for Advanced IPTV Terminal was developed. Several use cases, their requirements and potential technical solutions are provided. It appears that there is a lack of technologies related to social networking and simple editing functions for audio and video.
22.5 Pervasive AV Scene CodingMPEG was informed that the performance of the state of the art video coders for certain applications like surveillance or soap operas is not sufficient. The common feature of these applications appears to be that pan/tile cameras are located at fixed locations. Currently, evidence is lacking that the performance of video codecs developed for these scenarios will outperform significantly AVC or other codecs.
22.6 MPEG Media TransportAccording to N10496 issued at the 87th meeting, there is a need for a transport and file format friendly stream format. Error resilience of current MPEG streams might not be optimal. The potential gains of joint optimization of coding and transport are not known. Conversions
54
between different transport mechanisms like from MPEG-2 Transport Stream to MPEG Program Stream are not straight forward or defined. Furthermore, MPEG does not provide any hint on how to adapt content to different networks. Contributions to this meeting verified that all the topics above should be investigated further. In order to understand the current technology used and the new technologies currently developed for future use of transporting media over networks, MPEG will host a Workshop on MPEG Media Transport (N10571) on 09/07/01 at its next meeting. Experts will be invited to present their work. An adhoc group (N10527) was established to prepare the workshop as well as to work on an applications and requirements document on MMT.
55
Annex F– Systems report
Source: Young-Kwon Lim, Chair
1 Opening of the Meeting
1.1 Approval of the agenda
1.2 Goals for the weekThe main outputs of the meeting from the Systems Sub-group perspective are:
No. Title TBP Available13818-1 MPEG-2 Systems
10572 Study of ISO/IEC 13818-1:2007/FPDAM4 Transport of MVC No 09/05/0810573 WD 2.0 of ISO/IEC 13818-1:2007 DCOR X No 09/05/08
14496-1 MPEG-4 Systems10574 WD of ISO/IEC 14496-1 4th Edition No 09/04/24
10575WD of ISO/IEC 14496-1 PDAM4 Registration Authority and systems extensions
No 09/04/24
14496-5 Reference Software
10576DoC on ISO/IEC 14496-5:2001/PDAM 23 Synthesized Texture Reference SW
No 09/04/24
10577Text of ISO/IEC 14496-5:2001/FPDAM 23 Synthesized Texture Reference SW
No 09/05/08
10578Text of ISO/IEC 14496-5:2001/AMD14:2009/DCOR 1 OFF Ref. SW
No 09/04/24
14496-12 ISO File Format10579 DoC on ISO/IEC 14496-12:2008/FPDAM 1 General Improvements No 09/04/2410580 Text of ISO/IEC 14496-12:2008/FDAM 1 General Improvements No 09/05/08
14496-15 AVC File Format10581 Study of ISO/IEC 14496-15:2004/FPDAM 3 MVC File Format No 09/05/08
14496-20 LASeR& SAF
10582Study of ISO/EC 14496-20:2008 LASeR & SAF/FPDAM 2 Adaptation
No 09/04/24
10583 DoC on ISO/EC 14496-20:2008 LASeR & SAF/PDAM 3 PMSI No 09/04/2410584 Text of ISO/EC 14496-20:2008 LASeR & SAF/FPDAM 3 PMSI No 09/04/24
10585Updated Workplan for service example of LASeR Adaptation & PMSI
No 09/04/24
10586TuC for ISO/IEC 14496-20:2000 LASeR & SAF AMD 4 Advanced User Interaction
No 09/04/24
14496-22 Open Font Format
10627Request for ISO/IEC 14496-22:200X AMD 1 Support for many-to-one range mapping
No 09/04/24
56
10587 Text of ISO/IEC 14496-22:200X/PDAM 1 Support for many-to-one range mapping
No 09/04/24
15938-12 MPEG Query Format
10588WD 4.0 of ISO/IEC 15938-12:200X AMD 1 MPQF Conf. and Ref. SW
No 09/04/24
10589WD 1.0 of ISO/IEC 15938-12:200X AMD 2 Semantic Enhancement
No 09/04/24
21000-2 Digital Item Declaration10590 Request for ISO/IEC 21000-2 AMD 1 Presentation of Digital Item No 09/04/2410591 Text of ISO/IEC 21000-2 PDAM 1 Presentation of Digital Item No 09/04/24
21000-4 IPMP Component
10592Request for ISO/IEC 21000-4 AMD 2 Protection of Presentation Element
No 09/04/24
10593Text of ISO/IEC 21000-4 PDAM 2 Protection of Presentation Element
No 09/04/24
23000-4 Musical Slide Show Application Format10594 Text of ISO/IEC 23000-4 MSSAF/FDAM2 Conf. & Ref. SW for
Protected MSSAFNo 09/04/24
23000-5 Media Streaming Application Format10595 Text of ISO/IEC CD 23000-4 MSAF 2nd edition No 09/05/08
23000-6 Professional Archival Application Format10596 Study of ISO/IEC 23000-6 PA-AF/PDAM1 Conf. and Ref. SW No 09/05/15
10597Workplan for ISO/IEC 23000-6 PA-AF Conformance and Reference Software
No 09/04/24
23000-9 Digital Multimedia Broadcasting Application Format
10598Study of ISO/IEC 23000-9:2008/FPDAM1 DMB AF Conf. And Ref. Soft.
No 09/05/29
10599Workplan for ISO/IEC 23000-9:2008/FPDAM1 DMB AF Conf. And Ref. Soft.
No 09/04/24
10612 Study of ISO/IEC 23000-9:2008/PDAM2 DMB AF Harmonization of MPEG-2 TS storage
No 09/04/24
23000-11 Stereoscopic Video Application Format
10628Request for ISO/IEC 23000-11 AMD 1 Stereoscopic Video AF Ref. Soft and Conf.
No 09/04/24
10613Text of ISO/IEC 23000-11 PDAM 1 Stereoscopic Vide AF Ref. Soft and Conf.
No 09/04/24
10614Workplan for Stereoscopic Video Application Format Ref. Soft and Conf.
No 09/04/24
23000-12 Interactive Music AF10615 Study of ISO/IEC CD 23000-12 Interactive Music AF No 09/05/29
23005 – MPEG-V Representation of Context and Control Information
10616 WD 2.0 of Architecture Yes 09/05/0810617 WD 2.0 of Control Information Yes 09/05/0810618 WD 2.0 of Sensory Information Yes 09/05/0810619 WD 2.0 of Avatar Information Yes 09/05/0810672 WD 1.0 of Reference Software No 09/04/2410673 WD 1.0 of Conformance No 09/04/24
23006 – MPEG eXtensible Middleware
57
10623First ideas on normative APIs compliant to MXM framework for future MPEG standards
No 09/04/24
10624 MXM Developer’s Day : Call for Participation Yes 09/04/2423006 -1 MXM Architecture and Technologies
10620Study of ISO/IEC CD 23006-1 MxM Architecture and Technologies
Yes 09/05/08
23006 -2 MXM Application Programming Interface10621 Study of ISO/IEC CD 23006-2 MXM APIs Yes 09/05/08
23006 -3 MXM Reference Software and Conformance10622 Study of ISO/IEC CD 23006-3 MXM Conf. & Ref. SW Yes 09/05/0810683 MXM Development Roadmap for Engines and Applications No 09/04/24
Supplemental Media Technologies – MPEG eXtensible Middleware10625 Study of ISO/IEC CD 29116-1 2nd edition MXM Protocols Yes 09/05/08
23007– MPEG-U (MPEG Rich Media UI Framework)10626 WD MPEG Rich Media UI Framework Yes 09/04/24
10690Proposal for support of MPEG Rich Media UI in WC3 Widget Recommendation
Yes 09/04/24
10670 MPEG comments on WC3 Widget Recommendation Yes 09/04/24Assets and Standing Documents
10605 MPEG Schema Assets Updates Yes 09/04/2410609 MPEG URIs and MIME Types Yes 09/04/24
SVN support10630 Guide to the MPEG Subversion Repository No 09/04/24
Liaison10631 Liaison statement to JTC 1/SGSN No 09/04/24
10632Liaison statement to ITU-T SG 16 on rights information interoperability
No 09/04/24
10691Liaison statement to IEC TC 100 on rights information interoperability
10633Liaison statement to IEC TC 100 on Multimedia Gateway in Home Networks
No 09/04/24
10635 Liaison statement to IEC TC 9 No 09/04/2410636 Liaison statement to SGDCMP on DRM technologies in MPEG No 09/04/24
10637Liaison statement to IEC TC 100 on Multimedia home server systems - Conceptual model for domain management
No 09/04/24
10638Liaison statement to JTC 1/SC 34/WG 2 on Open Font Format reference software
No 09/04/24
10639Liaison statement to IEC TC 100 on Transmission of time code in the ancillary data space
No 09/04/24
10640 Liaison statement to ITU-T SG 16 on Advanced IPTV Terminal No 09/04/24
10641Liaison statement to JTC 1/SC 34/WG 2 on media types of ISO/IEC 14496-22
No 09/04/24
10642 Liaison statement to W3 C on MPEG Rich Media UI Framework No 09/04/2410685 Liaison statement to UPnP on MPEG Rich Media UI No 09/04/2410686 Liaison statement to DLNA on MPEG Rich Media UI No 09/04/24
10643Liaison statement to JTC 1 SWG-ARM on Professional Archival AF
No 09/04/24
10644 Liaison statement to EBU on MPEG URI Assets No 09/04/2410645 Liaison statement to W3C on MXM No 09/04/24
58
10646 Liaison statement to ATSC on Carriage of SVC over MPEG-2 TS No 09/04/24
10647Liaison statement to SC6 on extended Call for Proposal on MPEG-V
No 09/04/24
10648Liaison statement to SC25 on extended Call for Proposal on MPEG-V
No 09/04/24
10634Liaison statement to IEEE TC on Haptics on extended Call for Proposal on MPEG-V
No 09/04/24
10687 Template Liaison statement on MPEG Media Transport workshop No 09/04/2410688 Template Liaison statement on MXM Developer’s Day No 09/04/24
Promotion10629 MPEG DRM Vision Yes 09/04/2410684 MPEG Technologies for DRM Yes 09/05/08
59
2 General issues
2.1 List of standards under development
Pr
Pt
Edit.
Project
Description CfP WD CD FCD FDIS
2 1 2007
AMD4
Transport of MVC 08/10
09/02
09/07
2 1 2007
CORX Miscellaneous 09/02
4 1 200x
AMD4
-RA-Use of LASeR in MPEG-2 & MPEG-4 Systems
09/07
4 4 2004
AMD37
File Format Conf. 08/04
08/10
09/02
09/07
4 5 2001
AMDxx
AVC File Format Ref. Soft TBS
4 5 2001
AMDxx
SVC File Format Ref. Soft TBS
4 5 2001
AMD23
Synth. Texture Ref. Soft 08/04
09/04
09/10
4 11
2005
AMD7
Digital Radio BIFS Profile 09/02
09/10
10/04
10/10
4 12
2008
AMD1
General Improvements 08/04
08/10
09/04
4 12
2008
COR2 Brands & box orders 08/10
09/02
4 12
2008
COR3 Minor corrections to IOS FF 09/02
09/07
4 14
2003
AMD1
Handling of MPEG-4 audio enhancement layers
09/02
09/07
10/01
4 15
2008
COR3 Minor corrections to AVC FF 08/07
09/02
4 15
200x
AMD3
MVC File Format 08/04
08/10
09/02
09/07
4 20
200X
AMD2
Scene Adaptation 08/07
09/02
09/07
4 20
200X
AMD3
PASI 08/07
08/10
09/04
09/10
4 22
2008
2nd Ed. Open Font Format 08/01
08/07
09/02
7 12
2008
COR1.
MPQF minor corrections 08/01
08/07
09/02
21
2 200x
AMDX.
PASI support 08/10
09/04
21
19
200x
1st Ed. Media Value Chain Ontology 08/07
08/10
09/04
09/10
A 4 200x
AMD2
Prot. MSSAF Conf. & Soft 08/04
08/10
09/04
60
A 5 200x
2nd Ed. MS AF 08/01
09/02
09/04
09/10
A 6 200x
AMD1
PA-AF Conf. & Ref. SW 08/07
09/02
09/07
A 8 200x
AMD1
PVP AF Soft. And Conf. TBS
A 9 200x
AMD1
DMB AF Soft. And Conf. 09/02
09/07
A 9 200x
AMD2
DMB AF MPEG-2 Storage 09/02
09/07
10/01
A 10
200x
AMD1
VS Conf. & Ref. SW 08/07
09/02
09/07
A 11
200x
1st Ed. SVAF Ref. Soft. And Conf. 09/02
A 11
200x
COR1 SVAF signaling of voice codecs 09/02
09/07
A 12
200x
1st Ed. Interactive Music AF 09/02
09/07
10/01
B 2 200x
AMD1
FRU Ref. Soft. And Conf.
E 8 200x
1st Ed. Ref. Soft. and Conformance 07/01
07/07
08/04
08/10
V 1 200x
200x Architecture 09/02
09/07
09/10
10/04
V 2 200x
200x Control Information 09/02
09/07
09/10
10/04
V 3 200x
200x Sensory Information 09/02
09/07
09/10
10/04
V 4 200x
200x Avatar Characteristics 09/02
09/07
09/10
10/04
M 1 200x
1st ed. MxM Architecture 08/07
09/02
09/10
10/04
M 2 200x
200x MxM APIs 08/07
09/02
09/10
10/04
M 3 200x
200x MxM Conf. & Ref. SW 08/07
09/02
09/10
10/04
? 1 200x
2nd Ed. MxM Protocols 09/02
09/10
10/04
U 1 200x
200x. Package, Delivery and Presentation of Widget
08/10
09/04
09/07
09/10
10/04
U 2 200x
200x Widget Communication 08/10
09/04
09/07
09/10
10/04
U 3 200x
200x Conf. & Ref. SW 08/10
09/04
09/07
09/10
10/04
61
2.2 Standing Documents
Pr Pt Documents No. Meeting1 1 MPEG-1 White Paper – Multiplex Format N7675 05/07 Nice1 1 MPEG-1 White Paper – Terminal Architecture N7676 05/07 Nice1 1 MPEG-1 White Paper – Multiplexing and
SynchronizationN7677 05/07 Nice
2 1 MPEG-2 White Paper – Multiplex Format N7678 05/07 Nice2 1 MPEG-2 White Paper – Terminal Architecture N7679 05/07 Nice2 1 MPEG-2 White Paper – Multiplexing and
SynchronizationN7680 05/07 Nice
2 11 MPEG-2 White Paper – MPEG-2 IPMP N7503 05/07 Poznan4 1 MPEG-4 White Paper – MPEG-4 Systems N7504 05/07 Poznan4 1 MPEG-4 White Paper – Terminal Architecture N7610 05/10 Nice4 1 MPEG-4 White Paper – M4MuX N7921 06/01 Bangkok4 1 MPEG-4 White Paper – OCI N8148 06/04 Montreux4 6 MPEG-4 White Paper – DMIF N8149 06/04 Montreux4 11 MPEG-4 White Paper – BIFS N7608 05/10 Nice4 12 MPEG-4 White Paper – ISO File Format N8150 06/04 Montreux4 14 MPEG-4 White Paper – MP4 File Format N7923 06/01 Bangkok4 15 MPEG-4 White Paper – AVC FF N7924 06/01 Bangkok4 13 White Paper on MPEG-4 IPMP N7505 05/07 Poznan4 13 MPEG IPMP Extensions Overview N6338 04/03 München4 17 White Paper on Streaming Text N7515 05/07 Poznan4 18 White Paper on Font Compression and Streaming N7508 05/07 Poznan4 20 Presentation Material on LASER N6969 05/01 Hong-
Kong4 20 White Paper on LASeR N7507 05/07 Poznan4 22 White Paper on Open Font Format N7519 05/07 Poznan7 1 MPEG-7 White Paper - MPEG-7 Systems N7509 05/07 Poznan7 1 MPEG-7 White Paper – Terminal Architecture N8151 06/04 Montreux21 9 MPEG-21 White Paper – MPEG-21 File Format N7925 06/01 BangkokA X MPEG Application Format Overview N9421 07/10 ShenzhenA X MAF Overview Document N9840 08/04
ArchampsA X MAF Overview Presentation N9841 08/04
ArchampsB X MPEG-B White Paper – BinXML N7922 06/01 BangkokE X MPEG Multimedia Middleware Context and
ObjectivesN6335 04/03 München
E X 1rst M3W White paper N7510 05/07 PoznanE X 2nd M3W White Paper : Architecture N8152 06/04 Montreux
62
E X Tutorial on M3W N8153 06/04 MonreuxE X M3W White Paper : Multimedia Middleware
ArchitectureN8687 06/10 Hanzhou
E X M3W White Paper : Multimedia API N8688 06/10 HanzhouE X M3W White Paper : Component Model N8689 06/10 HanzhouE X M3W White Paper : Resource and Quality
ManagementN8690 06/10 Hanzhou
E X M3W White Paper : Component Download N8691 06/10 HanzhouE X M3W White Paper : Fault Management N8692 06/10 HanzhouE X M3W White Paper : System Integrity
ManagementN8693 06/10 Hanzhou
63
2.3 Mailing Lists Reminder
Topic InformationKindly Hosted
byGeneral Systems
List
Reflector : [email protected]: http://lists.uni-klu.ac.at/mailman/listinfo/gen-sysArchive: http://lists.uni-klu.ac.at/mailman/private/gen-sys/
Klagenfurt
University
File Format
Reflector : [email protected]: http://lists.uni-klu.ac.at/mailman/listinfo/mp4-sysArchive: http://lists.uni-klu.ac.at/mailman/private/mp4-sys/
Klagenfurt
University
LASeR
Reflector : [email protected]: http://lists.uni-klu.ac.at/mailman/listinfo/mpeg-laserArchive: http://lists.uni-klu.ac.at/pipermail/mpeg-laser/
Klagenfurt
University
MAFReflector : [email protected]: http://lists.uni-klu.ac.at/mailman/listinfo/maf-sysArchive: http://lists.uni-klu.ac.at/mailman/private/maf-sys/
Klagenfurt
University
ISO File Format Transpo
rt
Reflector: [email protected]: http://lists.uni-klu.ac.at/mailman/listinfo/isoff-transportArchive: http://lists.uni-klu.ac.at/mailman/private/isoff-transport/
Klagenfurt
University
AITReflector: [email protected]: http://lists.uni-klu.ac.at/mailman/listinfo/jiptvArchive: http://lists.uni-klu.ac.at/mailman/private/jiptv/
Klagenfurt
University
Metaverse
Reflector: [email protected]: http://lists.uni-klu.ac.at/mailman/listinfo/metaverseArchive: http://lists.uni-klu.ac.at/mailman/private/metaverse/
Klagenfurt
University
MXMReflector: [email protected]: http://lists.uni-klu.ac.at/mailman/listinfo/mxmArchive: http://lists.uni-klu.ac.at/mailman/listinfo/mxm
Klagenfurt
University
2.4 Demonstrations
64
3 General Input Documents
3.1 General Technical Issues
3.1.1 AHG reports
Session Number
Title Source
General m16247 Ad Hoc Group on Scene Representation Young-Kwon Lim, Jaeyeon Song, Cyril Concolato
General m16248 Ad Hoc Group on MPEG File Formats David SingerGeneral m16249 Ad Hoc Group on Application Format Kyuheon Kim,
Hui Yong Kim, Noboru Harada
General m16250 Ad Hoc Group on MPEG-V (including previous RoSE activities)
Jean Gelissen, Sanghyun Joo, Christian Timmerer
General m16251 Ad Hoc Group on Font Format Representation Vladimir Levantovsky
General m16252 Ad Hoc Group on Advanced IPTV Terminal Xin Wang, Young Kwon Lim
General m16253 Ad Hoc Group on MXM Filippo Chiariglione, Christian Timmerer, Victor Rodriguez, Marius Preda
All AHG reports are accepted.
3.1.2 Contributions
Session Number
Title Source
General m16272 Table of Replies on ISO/IEC FDIS 23004-8 ITTF via SC 29 Secretariat
General m16271 Table of Replies on ISO/IEC 23000-7:2008/FDAM 1
ITTF via SC 29 Secretariat
General m16267 Table of Replies on ISO/IEC 23000-3:2007/FDAM 1
ITTF via SC 29 Secretariat
General m16274 Table of Replies on ISO/IEC 23000-4:200X/FDAM 1
ITTF via SC 29 Secretariat
General m16275 Table of Replies on ISO/IEC FDIS 23000-6 ITTF via SC 29 Secretariat
General m16277 Table of Replies on ISO/IEC FDIS 23000-10 ITTF via SC 29 Secretariat
65
General m16270 Table of Replies on ISO/IEC 21000-8:2008/FDAM 1
ITTF via SC 29 Secretariat
General m16296 On Usage of the MPEG URI Assets Jean-Pierre EvainChristian Timmerer
General m16306 The MPEG DRM vision Leonardo Chiariglione
m16272, m16271, m16267, m16274, m16275, m16277, m16270 Ballot results are reviewed. All standards are approved.
m16296 EBU metadata specification maintenance page have recently been updated with the audio and visual profiles as defined in our URI assets. EBU would like to receive notifications concerning updates of it. Informing EBU classification schemes is extending the list of “Genre” and “Role” originally defined in MPEG-7
m16306 This document contains short introduction about IPMP as an enabler of DRM in MPEG and the list of relevant technologies. It is agreed to produce an output document describing vision based on this input contribution and another document containing detailed descriptions about each technologies separately.
3.1.3 Liaisons
Session Number
Title Source
General m16255 Liaison Statement from JTC 1/SGSN [SC 29 N 10126]
JTC 1/SGSN via SC 29 Secretariat
General m16282 Liaison Statement from ITU-T SG 16 to IEC TC 100
ITU-T SG 16 via SC 29 Secretariat
General m16287 IEC CDV 62514 [SC 29N 10185] IEC TC 100 via SC 29 Secretariat
General m1629 3 Termination of Liaison between ISO TC 46 and SC 29
SC 29 Secretariat
General m16294 Request for Internal Liaison between SC 29 and IEC TC 9
SC 29 Secretariat
General m16298 Information on JTC1 Study Group on Digital Content Management and Protection
SC29 Secretariat
General m16303 CD of IEC TS 62579 IEC TC 100 via SC 29 Secretariat
General m16304 Liaison Statement from JTC 1/SC 34/WG 2 [SC 29 N 10235]
SC 34 via SC 29 Secretariat
General m16341 IEC NP: Transmission of time code in the ancillary data space
IEC TC 100 via SC 29 Secretariat
General m16488 Liaison Statement from TTA TTA via SC 29 Secretariat
General m16489 Liaison Statement from SC 34/WG 2 SC 34 via SC 29 Secretariat
3.2 FAQThe FAQ were updated as needed.
66
3.3 AOBNone.
4 MPEG-2 Systems (13818-1)
4.1 Topics
4.1.1 ISO/IEC 13818-1:2007 AMD4 Transport of MVC
4.1.2 ISO/IEC 13818-1:2007 COR X Minor Corrections
4.2 Contributions
Session Number
Title Source
MPEG-2 m16366 Proposal to detect dependent view boundary in MVC
Teruhiko Suzuki
• The AVC base layer stream for SVC and MVC should be defined separately instead of a single definition (AVC video sub-bitstream that uses ‘either or’) to avoid confusion in the STD sections.
• We also need to clarify the difference between MVC base view only stream from a combined stream (MVC base + 1 view - called MVC base view video sub-bitstream) and how to signal this to an MVC decoder.
• Even though AVC precludes use of stream_type = 0x1F (SVC enhancement) and stream_type = 0x20 (MVC view enhancement) to enhance the same base layer stream (as the NAL unit type 14 differs between SVC and MVC) within a program (PMT or PSM), system should allow applications to use base layer specific to SVC and base layer specific to MVC within same program.
• We should add text to MVC extension descriptor saying that if this is absent for stream_type = 0x1B, then this is an MVC base layer stream with no additional views.
• Rx is not defined for enhancement layers in STD for MVC. This needs to be the re-assembled level based value if HRD is present or level based value from Annex A if HRD is absent.
• Extending the hierarchy descriptor has made it very complex as it is used for MPEG-2, SVC and MVC. In addition, it may be difficult to check backward compatibility for all cases. The suggestion is to look at using a new hierarchy descriptor for MVC and combine the MVC extension descriptor with this new descriptor.
• Re-name DRD NAL Unit to VDRD (View or Dependency Representation Delimiter) and add it for use with MVC enhancements.
• Suggestion to video group to delete use of prefix NAL unit for base layer of MVC as this allows applications to add additional views to a base layer that was deployed earlier without the prefix NUT
4.3 Action Points
67
5 MPEG-4 System (14496-1)
5.1 Topics
5.1.1 ISO/IEC 14496-1:200X AMD 4 RA and Usage of LASeR in MPEG-4 Systems
5.1.2 ISO/IEC 14496-1:200X 4th edition
5.2 Contributions
Session Number
Title Source
General m16415 Editor's Text of 14496-1 4th edition Cyril Concolato
m16415 Proposed text for new edition of ISO/IEC 14496-1. Three amendments (Profile and Level indication for Text, Profile and Level indication for 3D Compression, objectTypeIndication for JPEG200 support) and two corrigenda It is agreed to remove redundant definitions and abbreviations regarding file format. It is needed to include short description on text stream, and etc. Seo-Young Hwang is appointed as new reviewer
5.3 Action Points
68
6 MPEG-4 Conformance (14496-4)
6.1 Topics
6.1.1 ISO/IEC 14496-4:2007 AMD37 File Format Conformance
6.2 ContributionsNone
6.3 Action Points
7 MPEG-4 Reference Software (14496-5)
7.1 Topics
7.1.1 ISO/IEC 14496-5:2007 AMD 23 Sythesized Texture Reference Software
7.2 ContributionsNone
7.3 Action Points
8 MPEG-4 BIFS (14496-11)
8.1 Topics
8.1.1 ISO/IEC 14496-11:2002 AMD X Digital Radio Profile
8.2 Contributions
Session Number
Title Source
General m16465 BiFS-based solution and its deployment within the FP7 MobiThin project: event handling and streaming
Bojan JOVESKI Mihai MITREA Pieter SIMOENS Iain-James MARSHALLFrançoise PRETEUX
m16465 Not presented.
69
8.3 Action Points
9 MPEG-4 ISO Base File Format (14496-12)
9.1 14496-12/COR 2 Usage of brands and box order in sample entry
9.1.1 ISO/IEC 14496-12:2008 COR 2 Usage of brands and box order in sample entry
9.1.2 ISO/IEC 14496-12:2008 AMD 1 General improvements
9.1.3 ISO/IEC 14496-12:2008 COR 3 Minor Corrections
9.2 Contributions
Session Number
Title Source
File Format
m16263 Summary of Voting on ISO/IEC 14496-12:2008/FPDAM 1
SC 29 Secretariat
File Format
m16359 Minor or editorial errors in the Part 12 amendment
David Singer
File Format
m16367 Clarification of track header in ISO media file format
Teruhiko Suzuki
File Format
m16392 Adaptive Progressive Download Per FröjdhTorbjörn EinarssonClinton Priddle
File Format
m16393 File format sub-track selection and switching Per FröjdhAndrey NorkinClinton Priddle
9.2.1 m16263 Summary of Voting on ISO/IEC 14496-12:2008/FPDAM 1Thank you to the NBs for these detailed, helpful, comments.We also reviewed the editor’s comments, notes, and other arcana in the document.
9.2.2 m16367 Clarification of track header in ISO media file formatThank you, but we hope that this was covered in DCOR3, which is out for ballot. Comments on DCOR3 to improve its clarity would, of course, be welcome.
9.2.3 m16359 Minor or editorial errors in the Part 12 amendmentThank you, the editors will deal with these.
9.2.4 m16392 Adaptive Progressive DownloadThank you. This seems to relate to Modern Media Transport. There are some concerns here: for example, this presumes an intelligent HTTP server (really, an adaptive media server that happens to use HTTP for transport, rather than a general HTTP server). This proposal isn’t oriented towards the client choosing which parts it wants, and doing byte-range requests for the areas of the file it wants (as covered in academic papers, such as http://www.zju.edu.cn/jzus/2006/A06S1/A06S118.pdf). When doing byte-range requests, there is of course the (minor?) issue that the byte-range read by the client issues for the moov and each moof box has to be estimated. Also, under this proposal the bytes received could be
70
written as a file and constitute a valid MP4 file, whereas byte-range scatter-gather results in an incomplete (unrecordable) file, or at least a file that would need padding bytes to keep everything in the same place. Do we need to add functionality to the ISO file format for HTTP-streaming, or is it time for a stream format that is ISO file friendly (which we’ll discuss tomorrow)? 3GPP SA4 is working in this area as well; we should make sure we communicate appropriately.On this and the following contribution, some delegates asked for time to study both the use cases and the proposal, and the proponent graciously agreed to re-present these in the next meeting.
9.2.5 m16393 File format sub-track selection and switchingThis is an interesting and detailed proposal, thank you. We wonder about using multiple tracks and (for example) extractors, but the response is that a given SVC track cannot easily stop you from switching to the base layer. The proposal straddles media-specific and file-format general, in that what constitutes the sub-track needs media-specific definition. Should this be covered in e.g. the tier definitions? We wonder how often this arises in practice, and the use cases. Some asked whether SVC layer switching should be also used for program content selection switching; aren’t program selection and layer selection more properly orthogonal? Some felt that this was OK if it brought coding efficiency.On this and the preceding contribution, some delegates asked for time to study both the use cases and the proposal, and the proponent graciously agreed to re-present these in the next meeting.
9.3 Action Points
10 MPEG-4 File Format (14496-14)
10.1 Topics
10.1.1 ISO/IEC 14496-14:2003 AMD 1 Handling of MPEG-4 Audio enhancement layers
10.2 Contributions
10.3 Action Points
11 MPEG-4 AVC File Format (14496-15)
11.1 Topics
11.1.1 ISO/IEC 14496-15:2004/FPDAM 3 MVC File Format
71
11.2 Contributions
Session Number
Title Source
File Format
m16361 Extractors in the MVC file format Anthony Vetro David Singer
File Format
m16426 On MVC File Format Miska M. Hannuksela
File Format
m16444 Comments to the MVC file format draft Ye-Kui WangMiska Hannuksela
File Format
m16445 A proposal to MVC file format Ye-Kui Wang
11.2.1 m16361 Extractors in the MVC file formatWe will try this in the study. We need to document the relationship between operating point and mvc2 tracks, and also if the mvc2 tracks are dense (in the sense that all views are decoded). We need to have three sample entry types to satisfy different usage models:
mvc1 - pure mvc data, maybe using aggregators mvc2 - mvc and extractors and aggregators mvc3 –only: extractors and aggregators, i.e. no VCL NAL units
The editors should try making a single common definition for avc2. We do indeed need an informative section on these ‘usage models’ that will attempt to relay our hard-won understanding to future readers. That text should cover something like:“The support for MVC includes a number of tools, and there are various ‘models’ of how they might be used. In particular, an MVC stream can be placed in tracks in a number of ways, among which are the following:
1. all the views in one track, labeled with sample groups;2. each view in its own track, labeled in the sample entries;3. a hybrid, one track with the whole set, and other one-view tracks for some or all of the
independent views;4. the expected operating points each in a track (e.g. the AVC base, a stereo pair, a
multiview scene).”
11.2.2 m16426 On MVC File FormatThank you. We agree with this but note that there is no 7.3.4, but the box should be in section 8. Also note that method_count == 1 means the old sample to group box should be used. Into the study.
11.2.3 m16444 Comments to the MVC file format draftThank you for the very careful detailed read and comments. We’re not sure we need to put in restrictions on the use of parameter set tracks (they are linked from the video tracks that need them, after all).We agree, the ‘text to be deleted’ should move ( at least partly) to F.1, along with the new information proposed above.The ‘scal’ track reference would be better as a track group, but it’s also better the same as SVC which we can’t change to use track groups.We should use this as the basis for the study text.
11.2.4 m16445 A proposal to MVC file formatThis allows the identification of a converted view as the needed base view; this means that that view is stored in the file twice (once as a non-base-view and once as a base view). If the
72
use case is purely for servers, we could use hint tracks with ‘fat’ hint samples – but it is not. We could use a combination of data copy-and-rewrite and extractors (or copies) to do some of this. Hm, we might not have dealt with the case that a set of tracks contain the same view coded as base and nonbase, or that that set of tracks contains several apparently base views. Nor do the view declarations in a track indicate whether a view is coded as a base.We need to label virtual base views, and make sure that either extractors point at them, or this box (or something like it), and that they are not used other than in the track they are in, or tracks that explicitly get them by extraction or by this reference. Then we can include this as well. Editors to find a way to label “in this track this is view N transcoded as a base view” in the sample entry…
11.3 Action Points
12 LASeR & SAF (14496-20)
12.1 Topics
12.1.1 ISO/IEC 14496-20:2008 AMD 2 Adaptation
12.1.2 ISO/IEC 14496-20:2008 AMD 3 PSI
12.2 Contributions
Session Number
Title Source
Scene m16283 Editor's review of 14496-20 FPDAM2 Jean Le FeuvreScene m16351 study text on LASeR PDAM3 Seo-Young
HwangScene m16408 Responses to LASeR PDAM3 Editor’s Note Jihun Cha
Injae LeeYoung-kwon LimHan-Kyu LeeJinwoo Hong
Scene m16410 KNB Comment on 14496-20 PDAM3 KNBScene m16350 user interaction method on 14496-20 Seo-Young
HwangKyungmo ParkJaeyeon SongYoung-Kwon Lim
M16283: Discussion on the SDL syntax: some feedback from Expway received. It should be BiM Compatible but there is no guarantee. The BiM experts are not interested in thoroughly checking the BiM encoding.Editor's notes:'on specific conditions' is not defined. Proposal to change it to "up to the implementation". Accepted.minSize on grouping elements. Accepted.
73
Mistakes in LASeR ML. Corrected version uploaded.We need to produce a study and have a resolution for NB to consider the study as ballot text.M16351:This contribution is not a study.Proposal of a new 'entry' attribute on the externalReference element of PSI. Notion of "entrance node" to factorize multiple accesses to the same subtree. Semantics need to be clarified to describe the behavior of nested externalReference elements and default behavior if not present. Need to restrict the value of the attribute to point to element.Question whether the entry attribute should be on the 'externalReference' element or on the 'g' element? The same question applies to the update attribute also.Attribute accepted. Discussion on the externalReference element: choices are keep it or spread its attribute on containers. If we keep the element, its content is not rendered by a regular SVG player. But it's not a problem, one can always use defs and use elements. Plus, in terms of processing it keeps the processing of all containers as is. And it keeps the LASeR and SVG specs separate. So, we resolve to keep the LASeR externalReference element and add the 'entry' attribute.
M16408:Answers to the editor's note + new proposals.Editor's Notes:- The name PSI creates confusion with MPEG-2 PSIProposal to change the activity, the name of the amendment to "Presentation and Alteration of Structured Information". Comments on the 'alteration' word. The name 'Modification' is preferred to 'Alteration'. Accepted for both the activity name and the scheme (mpeg-pmsi).
- Catch internal events and not only external resources using PSIGeneralization of the resourceUpdate event. Accepted.Comment on the difference between Structured Information and general signal (like phone call notification). We need to clarify if events like 'phone call notification' can be treated as Structured Information. If yes, the phone call notification structure should be clearly defined with a schema. The usage of PSI for events depends on the complexity of the information being carried in the event. The question is what is the threshold.We cannot remove the externalValueEvent unless we make a corrigendum. The externalValueEvent is more efficient for events carrying a single value while the proposal is able to carry complex events. The specification should have a note indicating the difference between the two types of events.The example in the presentation shows that the usage of the resourceUpdate event is not clear. Update of psi referenced text is automatic and can be changed using the update attribute. However, the event can be used to trigger script, navigation or animation, as usual.We have 2 options for the event: typed event or xmlUpdateEvent with string value. One participant prefers the XML event and the others have no objection. Resolve to use the xmlUpdateEvent.
Discussion on the clarification of what happens when the PSI scheme points to a Node Set. Resolved to use the first node in the Node Set.
- Applicable media typesThe problem is to define a fragment identifier scheme for all XML documents because the name of the scheme may be in conflict with the name of a scheme defined by the specification owning the media type of the XML document. However, this case is very unlikely. We will
74
fix the problem if it arises. We could minimize this probability by calling the scheme mpeg-pmsi(). Accepted.
Proposals:- LASeR importData elementThe problem is that text in tref cannot be edited and saved because tref is a 1.1 element and the editable attribute is a 1.2 attribute. Chances that the editable attribute is added to the tref element are high. We prefer that solution. The importData element is refused but the problem is solved.
- LASeR Drag elementDefines a new element to monitor new types of events (slide, rotations) and modifies the transform attribute of the parent. The element is very similar to the animateMotion element. We could put the proposal in a TuC for a new amd on Advanced Interactions.
M16410:KNB comment. Accepted.
M16350:Description of latest trends in user interaction: touch sensor, motion sensor, g-sensor, haptic, … Presents needs for new scene description tools to support these new interaction sensor. Proposal to start a new activity. The contribution highlights a potential relationship with MPEG-V Part 3 (RoSE) and with ISO TC159. We could liaise with TC159 to inform them about our wish to work in this area and to welcome their input/comments.Accepted. We create a TuC on Advanced User Interactions with the Drag element and the study of current trends. We welcome contributions on the requirements for this activity and on the technical elements.
12.3 Action Points
13 Open Font Format (14496-22)
13.1 Topics
13.1.1 ISO/IEC 14496-22:200X AMD 1 Full Unicode Character Repertoire Supports
13.2 Contributions.Session Numbe
rTitle Source
General m16266 Table of Replies on ISO/IEC 14496-5:2001/FDAM 14
ITTF via SC 29 Secretariat
General m16428 Proposal for a new amendment of ISO/IEC 14496-22 ?Open Font Format?
Sairus Patel Vladimir Levantovsky
75
m16266 FDAM passed final ballotm16248 Proposal to overcome 64K limit. Accepted for PDAM.
13.3 Action Points
14 MPEG Query Format (15938-12)
14.1 Topics
14.1.1 ISO/IEC 15938-12:200X AMD 1 MPQF Conf. and Ref. SW.
14.2 Contributions
Session Number
Title Source
General m16461 Contribution of a Basic Interpreter Module to ISO/IEC 15938-12/Amd.1 MPEG Query Format Ref. Soft & Conf.
Ruben TousJaime Delgado
General m16486 MPEG Query Format Semantic Requirements Mario DoellerFlorian StegmaierGero Bäse
General m16487 Introducing Semantic Retrieval in MPQF Mario DöllerFlorian StegmaierGero Bäse
m16461Proposed software for MPEG Query Format Ref. Soft. This will be integrated into the Ref. Soft..
m16486 & m16487 The integration of semantic retrieval concepts into the current version of MPQF by using basic technologies in the fields of defining and retrieving semantic information based on RDF/OWL developed by W3C is proposed. In order to facilitate semantic retrieval in MPQF only minor adaptations of the existing standard are necessary. Additionally the SPARQL query type and expressions describing semantic relations are to be included:
- adding a query type for SPARQL queries,- adding semantic expressions,- extending affected parts such as OutputDescription, QFDeclaration and- extending the Data Model.
14.3 Action Points
76
15 MPEG-21 DID (21000-2)
15.1 Topics
15.1.1 ISO/IEC 21000-2:200X AMD X PSI support
15.2 Contributions
Session Number
Title Source
General m16347 Proposal to change the title of ISO/IEC 21000-2:2005 AMD1 to ?Presentation of Digital Item?
Filippo ChiariglioneTiejun Huang
m16347
It is proposed to name the ISO/IEC 21000-2:2005 AMD1 as “Presentation of Digital Item”. Accepted.
15.3 Action Points
16 MPEG-21 IPMP Components (21000-4)
16.1 Topics
16.1.1 ISO/IEC 21000-4:200x AMD X Protection of Presentation element
16.2 Contributions
Session Number
Title Source
General m16346 Proposed text of ISO/IEC 21000-4:200x AMD1 ? Protection of Presentation element
Filippo ChiariglioneTiejun Huang
m16346Proposed IPMP DIDL element <ipmpdidl:Presentation> for Presentation element. Accepted for PDAM.
16.3 Action Points
77
17 Media Value Chain Ontology (21000-19)
17.1 Topics
17.1.1 ISO/IEC 21000-19 Media Value Chain Ontology
17.2 Contributions
Session Number
Title Source
MVCO m16354 21000-19 FCD revision Marc GauvinJaime DelgadoVictor RodriguezMiran Choi
MVCO m16355 Editors Notes on MVCO Review Marc GauvinJaime DelgadoVictor RodriguezMiran Choi
m16354 & m16355 Not presented.
17.3 Action Points
18 Musical Slide Show AF (23000-4)
18.1 Topics
18.1.1 ISO/IEC 23000-4 AMD 2 Conf. and Ref. SW for MSSAF
18.2 Contributions
Session Number
Title Source
MAF m16262 Summary of Voting on ISO/IEC 23000-4:200X/FPDAM 2 [SC 29 N 10158]
SC 29 Secretariat
m16262 No disapproval
18.3 Action Points
19 Media Streaming AF (23000-5)
19.1 Topics
19.1.1 ISO/IEC 23000-5 2nd edition
78
19.2 Contributions
Session Number
Title Source
MAF m16344 Some issues regarding the 2nd edition of Media Streaming Application Format (MSAF)
Filippo ChiariglioneTiejun Huang
MAF m16345 Benefits from the use of Presentation of Digital Items and Event Reporting in MSAF
Filippo ChiariglioneTiejun Huang
MAF m16349 Proposed text of ISO/IEC 23000-5 2nd Edition CD Media Streaming Application Format
Filippo ChiariglioneTiejun Huang
M16344: Reviewed. Provides the summary of the changes for the 2nd Edition CD.M16345: Reviewed. Proposed to put as normative references LASeR (ISO/IEC 14496-20) and Event Reporting (ISO/IEC 21000-15). It is recommended
Edition of text of ISO/IEC 23000-5 2nd Edition CD Media Streaming Application Format with LASeR (ISO/IEC 14496-20) and Event Reporting (ISO/IEC 21000-15) as references
M16349: The editorial changes have been reviewed. The text is going to be promoted.
19.3 Action Points
20 Professional Archival AF (23000-6)
20.1 Topics
20.1.1 ISO/IEC 23000-6 AMD1 PA-AF Conf. and Ref. SW
20.2 Contributions
Session Number
Title Source
MAF m16369 Proposed update to the PA-AF reference software and description of the PA-AF APIs
Noboru HaradaHouariHendryYutaka KamamotoTakehiro MoriyaMunchurl Kim
M16369: Produce the study with normative changes It is recommends
update the PA-AF reference software and description
20.3 Action Points
79
21 DMB AF (23000-9)
21.1 Topics
21.1.1 ISO/IEC 23000-9 AMD1 DMB AF Conf. and Ref. SW
21.1.2 ISO/IEC 23000-9 AMD2 DMB AF Harmonization of MPEG-2 TS Storage
21.2 Contributions
Session Number
Title Source
MAF m16452 BWS Conformance file contribution and SW update for ISO/IEC 23000-9/Amd.1
Hui Yong KimYong Han KimMunchrul KimHouari Sabirin
MAF m16454 Study on ISO/IEC 23000-9 PDAM2 (DMB-AF Harmonization on MPEG-2 TS)
Hui Yong KimMyung Seok KiYong Han Kim
M16452: Reviewed and promoted for the approvalM16454: Reviewed by FF group and promoted for the approval. It is recommended to produce Study on ISO/IEC 23000-9 PDAM2 (DMB-AF Harmonization on MPEG-2 TS) with editorial changes
21.3 Action Points
22 Video Surveillance AF (23000-10)
22.1 Topics
22.1.1 ISO/IEC 23000-10 AMD1 VS AF Conf. and Ref. SW
22.2 ContributionsNone.
22.3 Action Points
80
23 Stereoscopic Video AF (23000-11)
23.1 Topics
23.1.1 ISO/IEC 23000-11 AMD1 SV AF Conf. and Ref. SW
23.1.2 ISO/IEC 23000-11 COR1 SV AF Signaling of voice codecs
23.2 Contributions
Session Number
Title Source
MAF m16459 Updated Text of ISO/IEC 23000-11 Stereoscopic Video AF Conformance and Reference Software
Next generation Broadcasting Forum(Korea)
M16459: Reviewed and showed the demonstration of the current Ref. SW. It is recommended
To update Text of ISO/IEC 23000-11 Stereoscopic Video AF Conformance and Reference Software. To update the change of the Workplan for the Ref. SW
23.3 Action Points
24 Interactive Music AF (23000-12)
24.1 Topics
24.1.1 ISO/IEC 23000-12 Interactive Music AF
24.2 Contributions
Session Number
Title Source
MAF m16386 Proposal of dynamic preset for IM AF Inseon JangJeongil SeoHui Yong KimKyeongok KangKevin SeungChul Ham
MAF m16387 Editor's study on ISO/IEC 23000-12 CD Interactive music application format
Inseon JangHui Yong KimJeongil SeoLaurent PrimauxOwen Lagadec
81
MAF m16458 iKlax improvement Proposal on ISO/IEC 23000-12 CD IM AF
Laurent PrimauxOwen LagadecEmmanuel Bouix
M16386: Put the extra two preset functionalities such as dynamic volume preset and dynamic object preset, which provide the 3D audio effect. Accepted.
M16387: Reviewed. Annex A and B are going to be merged into one Annex due to that both Annex A and B cover a mode usage in the IM AF, which shall be reflected during the editorial period of the output document.
. Terms are changed from ‘constraint’ to ‘rule’, which shall be reflected to the output document during the editorial period. Rule checking will be revisited for clear usage in the next meeting.
M16458: . Group box modification for controlling elements in a box is accepted such as group_activation_elements_number and group_reference_volume are added.
group_activation_elements_number – is an integer that indicates the number of elements which must be switched on when the group is switched on. This number, according to the order of the elements in elements_ID, determines which elements will be switched on.group_reference_volume - is an integer that indicates the index corresponding to the reference volume gain of the group (this is the volume gain that must be set on the element when it is switched on).
. Simplification of both selection and mixing constraint are accepted.
It is recommends To produce study on ISO/IEC 23000-12 CD Interactive music application format with the above
changes
24.3 Action Points Revisit the checking rule in the next meeting
25 MPEG-V Architecture (23005-1)
25.1 Topics
25.1.1 ISO/IEC 23005-1 MPEG-V Architecture
25.2 Contributions
Session Number
Title Source
MPEG-V
m16382 Modified MPEG-V System Architecture SangHyun JooJongHyun Jang
m16382:— Proposes a new architecture (cf. Fig. 3 in the contribution)
82
— Technical inputs from sensor network group/activity will be appreciated. Proponents are kindly asked to provide such inputs.
— After intensive discussions it has been found by the group that the “blue ellipses” in the architecture needs to be updated.
25.3 Action Points
26 Control Information (23005-2)
26.1 Topics
26.1.1 ISO/IEC 23005-2 Control Information
26.2 Contributions
Session Number
Title Source
MPEG-V
m16379 Basic description languaues for Sensory Device Capabilities of MPEG-V Part2 Control Information
Sang-Kyun KimJin-Seo KimMaeng-Sub ChoBon-Ki KooYong Soo Joo
MPEG-V
m16380 Basic description languaues for Sensory Device Commands of MPEG-V Part2 Control Information
Sang-Kyun KimJin-Seo KimMaeng-Sub ChoBon-Ki KooYong Soo Joo
MPEG-V
m16381 Basic description languaues for User Sensory Preferences of MPEG-V Part2 Control Information
Sang-Kyun KimJin-Seo KimMaeng-Sub ChoBon-Ki KooYong Soo Joo
MPEG-V
m16360 Study on representation of characteristics for MPEG-V Sensory Devices
Shin-ya HasegawaYasuaki TokumoTakuya Iwanami
MPEG-V
m16368 Sensory Device Capability Metadata B.S. ChoiSangHyun Joo
MPEG-V
m16385 User Sensory Preference Metadata B. S. ChoiSang Hyun Joo
m16379, Proposed schema for device capabilities aligned with sensory effects. Schema is consisted of Base attribute and specific attributes for each effect types
m16380,
83
Proposed schema for sensory device commands aligned with each sensory effect. Schema is consisted of Base attribute and specific attributes for each effect types.
m16381Proposed schema for user sensory preferences aligned with each sensory effect. Schema is consisted of Base attribute and specific attributes for each effect types.
m16360Proposed schema for device capabilities Base attributes include minimum intensity, preparation time, and used position in addition to previous contributions. Proposed to include expected capability as a part of sensory effect metadata by using schema for device capability.
question about how to signal “preambles”question about what is an appropriate schema of “preambles”
m16368question about position : we don’t know propagation matrix of device. so, position cannot give meaningful information to RoSE engine unless there is universal physical law can be applied to this. delayTime : the time difference between the time the device started and the time when the device reaches its maximum intensity need to be harmonized with m16360minIntensity : this does not needed for some device types. so, it will not be part of base attribute but specific attributes for some device types which need this.some attributes need to improve.try to keep consistency among different devicesuse “device capability type” instead of “device type”
m16385Similar to m16368
26.3 Action Points
27 Sensory Information (23005-3)
27.1 Topics
27.1.1 ISO/IEC 23005-3 Sensory Information
27.2 Contributions
Session Number
Title Source
MPEG-V
m16300 Minor Corrections to SEDL and the Usage of Schematron for SEDL Conformance
Markus Waltl Christian Timmerer
MPEG-V
m16301 An API for Sensory Effect Metadata compliant to the MPEG Extensible Middleware (MXM)
Markus Waltl Christian
84
TimmererMPEG-
Vm16358 Study on MPEG-V Sensory Information Yasuaki Tokumo
Shin-ya HasegawaTakuya IwanamiShuichi Watanabe
MPEG-V
m16377 The modified ColorCorrection sensory effect for spatio-temporal moving regions
Sang-Kyun KimJin-Seo KimMaeng-Sub ChoBon-Ki KooYong Soo Joo
MPEG-V
m16378 Modifications of ColorCorrectionParameter Type of MPEG-V Part3 Sensory Information
Sang-Kyun KimJin-Seo KimMaeng-Sub ChoBon-Ki KooYong Soo Joo
MPEG-V
m16440 Haptic Movie system and Service Scenario for MPEG-V
Jeha RyuYeongmi Kim
MPEG-V
m16442 Representation of Tactile Movie Control Information for MPEG-V
Jeha RyuYeongmi Kim
MPEG-V
m16336 MPEG-V WD Contributions Jean H.A. Gelissen
m16300 Minor corrections accepted
min occurrence of GroupOfEffect should be 2. Add “ReferenceEffect” element to reference some effects already defined
Additional validation rules to verify schema validation accepted Schematron (ISO standard) to define computer readable description to check
additional (semantic) validation rules for schema WD for conformance
m16301MXM compliant APIs for Sensory Effect Metadata access and creation automatically
generated from schema by JAXB Proposed to define set of MXM compliant APIs as a normative part of reference
software WD for reference software
m16358modified semantics of “activate” : if it is true then previous intensity shall be kept
rejected because this can be described as a sequence of intensity changes for single effect.
modified semantics of “fade-in” and “fade-out” : fade-in shall reach its non-zero intensity and fade-out shall reach zero further study is need to agree on final semantics of “fade-in” and “fade-out”, could be included in WD as a study item
modified semantics of “autoExtraction” to represent “auto with hint” by using MPEG-7 localization scheme accepted
m16377, m16378:— Include a ReferenceRegion (similar to autoExtraction) where color correction shall be
applied: okay but may share the same syntax as for autoExtraction but with different semantics
85
— Intensity not defined but for this effect it is just 0 or 1, i.e., 0 means not applied / not activated and 1 means activated. Need to check how to implement that in the standard.
— Minor corrections to ColorCorrectionParameterType: okay!m16440, m16442:
— Addressing the sense of touch… okay, nice!— Needs to be aligned with current formatting/style of sensory information— Needs help in editing the working draft – willing to become a co-editor? Need to
check where and how to include this in the current WD.m16336:Similar to m16440/m16442 and proponents are urged to propose it as a formal response to the Call for Proposals…
27.3 Action Points Another call for proposal on missing parts such as haptic, tactile, emotions, and so on.
28 Avatar Information (23005-4)
28.1 Topics
28.1.1 ISO/IEC 23005-4 Avatar Information
28.2 Contributions
Session Number
Title Source
MPEG-V
m16425 Avatar Characteristics Blagica JovanovaMarius PredaFrançoise Preteux
MPEG-V
m16483 Avatar Definition Markup Language David Oyarzun Jean H.A. Gelissen (editor)
m16425 & m16483 reviewed by 3DG. Accepted for WD. Agreed to have further harmonization.
28.3 Action Points
86
29 MXM (23006 & 29116-1)
29.1 Topics
29.1.1 ISO/IEC 23006-1 MXM Architecture
29.1.2 ISO/IEC 23006-2 API
29.1.3 ISO/IEC 23006-3 Conf. and Ref. SW
29.1.4 ISO/IEC 29116-1 2nd edition MXM Protocols
29.2 Contributions
Session Number
Title Source
MXM m16447 Proposal for using the existing tools for generic metadata APIs of metadata engine on MXM APIs
Wonsuk LeeSeungyun Lee
MXM m16348 Proposal for MXM engine and API of Presentation of Digital Item
Filippo ChiariglioneTiejun Huang
MXM m16308 Updated DIA APIs and Implementation for MXM
Christian TimmererMichael Eberhard
MXM m16427 Integrated MXM API for 3D Graphics Ivica ArsovMarius Preda
MXM m16455 A Proposal for Multimedia Application Interface for Collaborative Work
Sung Jin HurWan Choi
MXM m16457 Use Case of Multimedia Application Interface Sung Jin HurWan Choi
m16477:— Has been reviewed during the AhG meeting on Sunday and is reflected in the AhG
report. See recommendation 5 of the AhG report.— In particular, WG11 will add to the native MPEG-7 API a generic API for media
description supporting the set of tags that is common across a number of metadata sets that is being identified by W3C Media Annotation Working Group.
— WG11 should be able to implement a significant subset of the generic API by the end of 2009.
— WG11 should send a liaison statement to W3C communicating our intention to base the generic API on the work that W3C is doing and have a first implementation by the end of 2009.
m16348:— The input has been acknowledged by WG11 and WG11 is willing to include the
proposed engines into the MXM standard.— However, the timeline proposed in 16344 lags MXM by one meeting. This means that
the two proposed engines (i.e., engines for the Presentation of Digital Item (PDI) and Protection of Presentation element (PPE)) could not be part of the MXM standard. This is not an isolated case. Therefore, the three following possibilities can be considered:
o Delay MXM FDIS by one meeting
87
o Develop the two engines as reference software of the two proposed amendments
o Add the two engines (and possibly other engines) to a new edition of MXM
m16308:— Accepted and WG11 will provide a SVN account for Klagenfurt University
m16427:— Proposes some changes to the media framework engine incorporating Graphics3D
engine— Accepted and appreciated.
Other related input documents:— m16301: What are the pros and cons regarding the policy of having MPEG reference
software in the form of an MXM engine if applicable?o Pros:
Easy development of applications that exercise the reference software because the new engine can be easily integrated with the other existing engines.
Easy reuse of the engine by other developers of MPEG standards (or even by the industry) thanks to the existence of the interfaces.
This would follow a common trend in the standardization environment. Facilitate the upgrade of service provisioning.
o Cons: Additional effort to specify and implement the API.
o Notes: Need to be clear about the normative value of the API. By developing an API the designers have to pay more attention to the
requirements of the technology.— m16313:
o The example mapping between NextShare functionalities and MXM engines is very useful and should be adopted by AIT.
o Shall the MXM standard only contain engines for MPEG technologies or shall it also contain engines for non-MPEG technologies (P2P delivery, access to payment system, social networks, …)?
— m16455, m16457: Interesting contribution as they reflect what WG11 is already undertaking as part of the MXM project. We encourage to bring details on:
o Identification of multimedia application;o Classification of multimedia application;o Multimedia application discovery;o Multimedia application instantiation;o Multimedia application access;o Multimedia application execution environment;o Multimedia application composition.
— License:o Presented in Busan and became an output document + resolution asking NBs
to respondo No input in Lausanneo License was submitted to Open Source Initiative (OSI).o Some open source extremists raised concerns.
88
o We are unable to handle the concerns because MPEG experts have no time to dedicate to the matter.
o The following two possibilities have been identified: Adopt the Mozilla license (which is open source, and the starting point
of the MXM license) for the MXM layer and reference external code. Continue adopting the MPEG copyright disclaimer
o In both cases, if the external code is from MPEG, the existing reference software maybe modified retaining the original MPEG copyright disclaimer.
o In order to keep it simple, we propose to adopt the MPEG copyright disclaimer.
— SVN account:o The MXM repository is now available. I have updated the Guide to the MPEG
Subversion Repository to include the new repository and submitted it as input document m16483 for Maui. The most important part of the update is in Section 4.1. If you want to see all the differences, use svn. You can find the document here in the repository. I have also updated the Administrators Resources at http://wg11.sc29.org/svnadmin/index.xalter.
o The URL of the repository is http://wg11.sc29.org/mxmsvn/repos. It can be browsed with a web-browser or checked-out to an SVN client. Users 'sc29wg11' and 'mxmpubro' have read-only access. MPEG-member accounts have read-write access to the main svn repository and read-only access to mxmsvn by default. Upon request member accounts can be upgraded to read-write access of mxmsvn. Non-MPEG-members have read-write access to mxmsvn and no access to svn or the MPEG committee web site.
o The public read-only account is the aforementioned 'mxmpubro' whose password is 'mpegmxmro'.
o Filippo's and Emiliano's acounts have been upgraded to have mxmsvn read-write access.
29.3 Action Points
30 MPEG Rich Media UI Framework (???)
30.1 Topics
30.1.1 Widgets
30.1.2 Communication
30.1.3 Conf. and Ref. SWTopics
30.2 Contributions
Session Number
Title Source
Scene m16412 Utilization of LASeR on Rich Media UI Framework
Jihun ChaInjae LeeYoung-kwon Lim
89
Han-Kyu LeeJinwoo Hong
Scene m16352 Response to the call for technologies for MPEG RUIF
Kyungmo ParkGiovanni GordaraCyril ConcolatoJean Le FeuvreJean-Claude Dufourd
m16352Proposal based on W3C widget. Accepted for working draft.
Overview Based on ongoing W3C recommendation
Widely accepted widget format Agnostic to scene representation format Mandating to use “zip” for “packaging” format
Extensibility for MPEG media types Extensibility for multiple transport Extensibility for non-Web domains
Widget representation format Manifest (configuration file)
metadata, localization (language), icon one scene description and associated resources an optional additional scene description and associated resources description about communication capabilities
Widget packaging format ISO Base Media File Format for media centric widgets Supporting unpackaged delivery Guidelines for streaming and broadcast environments
Widget Manager Concept of entity managing widgets Normative Behavior
Widget Packaging Formats Widget Representation Format (manifest) Widget Localization Widget Life Cycle Handling Widget Communication Handling Widget Individual Rendering
Widget Life Cycle Loading of widget
Parsing widget Showing something except simplified or full version of scene representation
Independent Activation/Deactivation of full or simplified version Independent Show/Hide of full or simplified version Dynamic binding/unbinding of external communication entities
Widget Communications external communication entity
identified by type described by “interface”
interface Demonstrations
GPAC based implementation Widget representation based on BIFS with SVG icons Pushing widget to different devices (uPnP & DLNA) Communication between devices (different representation)
Questions? What is the status of widget in W3C? WD. Three CRs are targeting June. packaging and
configuration, digital signature (informative in MPEG), and APIs and Events. What is the relationship between W3C widget and MPEG UIF? W3C widget is subset of
MPEG UIF. Do we need to mandate to use “zip”? Yes, to be compliant to “W3C widget package format”
90
Can we have multiple icons? No. But you can have multiple “localized” representation including icons.
Can we use different languages for simplified version? Yes. But it would be better to use same language for synchronization and so on.
How do we find “entry point” for unpackaged format In that case, manifest will be used as an entry point.
Is it mandatory to list all external communication required by the widget in the manifest? It is not clear because it is under draft. We could make it mandatory in MPEG.
m16412Proposal to use LASeR for UI Framework. Accepted to incoporate
30.3 Action Points
31 Exploration
31.1 AIT
31.1.1 Topics1. Advanced IPTV Terminal
31.1.2 Contributions
Session Number
Title Source
AIT m16313 Input on the Advanced IPTV Terminal (AIT) Architecture
Christian TimmererMark StuartNicola CapovillaFabrizio RovatiJari AholaNjål BorchFranc Kozamernik
AIT m16364 Proposal of the template of draft Advanced IPTV Terminal(AIT) Requirements
Kangchan LeeSeungyun Lee
AIT m16365 Proposal of conceptual diagram for advanced IPTV Terminal
Kangchan LeeSeungyun Lee
AIT m16424 Proposed Scenario and Requirements for Advanced IPTV Terminal
Truong Cong ThangYongju ChoJung Won KangJeong-Ju Yoo
AIT m16350 LASeR in IPTV Seo-Young Hwang
AIT m16288 Use Cases for Advanced IPTV Terminals Miran ChoiEuisok ChungMyung-Gil JangYunkeun Lee
91
m16288: Use Cases for Advanced IPTV TerminalIPTV Speech Interface for VOD and QA ServicesAdvertisement, what you see, what you wantOver the Language Barrier
m16424: Proposed Scenario and Requirements for Advanced IPTV TerminalAdaptive service discovery and delivery in heterogonous environments
m16364: Proposal of the template of draft Advanced IPTV Terminal (AIT) RequirementsOutline of the Requirements documentNeed to link User Cases with Requirements
m16365: Proposal of conceptual diagram for advanced IPTV TerminalArchitectural context diagram for IPTV terminal
m16313: Input on the Advanced IPTV Terminal (AIT) ArchitectureNextShare from P2P-next.orgNew architectural functionalities: p2p?
m16327 : Realization of MANE (Media Aware Network Element)New transport layer format and functionality Requirements Group
31.1.3 Action Points
31.2 MMT
31.2.1 Topics Transport- and file format friendly stream format Cross layer optimization between video and transport layer Error resilience for MPEG streams Conversion between transport mechanisms Content adaptation to different networks
31.2.2 Contributions
Session Number
Title Source
MMT m16327 For realization of MANE Doug Young SuhYongju Cho
MMT m16353 Environments on MMT Jaeyeon songMMT m16392 Adaptive Progressive Download Per Fröjdh
Torbjörn EinarssonClinton Priddle
MMT m16362 On Modern Media Transport (MMT) David SingerMMT m16307 On MPEG Modern Transport over Network Christian
TimmererMichael EberhardIngo KoflerRobert Kuschnig Michael Ransburg Michael Sablatschan
92
Hermann Hellwagner
m16307 File Format
P2P traffic is increasing but no SDO feeling responsible. (IETF is considering PPSP as an exploration activity)
De-facto standard such as BitTorrent does not support P2P (video) streaming. MPEG-21 DID provides support for P2P systems e.g., fragment identifier can be
used to refer parts of content which can be used to integrate one file at the receiver side.
Propose to start working towards a file format explicitly support P2P streaming and associated metadata
Cross-Layer Optimizations Preferred approached to achieve cross-layer design is jointly optimizing
parameters at each layers. ENTHRONE cross-layer model describes the relationship between QoS metrics
at different levels and Cross-Laer adaptation decision-taking engine solves optimization problem.
Propose to improve metadata to describe cross-layer model to close interoperability gaps
Context and Content aware Networks context describes the environment and metadata describe content mismatch between context and content may require adaptation Not only media aware clients but also media aware network are becoming more
and more important Propose to work on generic interface between the coding layer and delivery layer
m16327 AVC and SVC standards assumed media aware network element deciding discard or
forwarding packets. MANE has two roles such as identification of each packet for extraction and context
aware adaptation
m16353Broadcasting and mobile convergence supporting of generic mechanism to identify
streams when hand-over between networks is happenedQoE more metadata to support better QoE
m16362Use cases and environment
one-to-one over IP is predominant “punch-in” is needed to for live stream RTP-UDP and MPEG-2 TS over UDP both contain a “blast and hope”
assumption while TCP aware congestion Supporting content protection is necessary Metadata is becoming increasingly important
m16392HTTP streaming with MP4 files requires new mechanism for live & adaptive service
scenario
93
The past and the future of multimedia transport On the current multimedia transport : m16353, m16362, m16392 On the emerging transport : m16307, m16327
31.2.3 Action Points
32 Liaison
Cf. Liaison output.
33 Resolutions of Systems
Cf. WG11 resolution.
34 Action Plan
N° What Who When Status Trace
1.
94
Annex G – Video report
Source: Jens Ohm and Gary Sullivan, Chairs
1 MPEG-4 Part 2 Video
The video and requirements subgroups received a request for new levels 7 (matching 1080/30p), 8 (1080/60p) and 9 (1080/120p) for Simple Profile and levels 6 (720/60p) as well as 7-9 for Advanced Simple Profile. The NB of US also gave support for levels 7&8 in Simple, whereas the engagement for the remaining requested profiles was less evident. Video and Requirements requested for more evidence about the use cases, as well as more company support before any action could be taken in these matters.
Two more issues were raised on 14496-2, which are– Divergence between software and text in vertical field MV conversion. Probably, most
current implementations have followed the software definition on this (diverging from the text part in both Momusys and Microsoft software), which is also implemented in the current conformance streams. The recommendation is to change the text according to the software operation.
– Clarification of scaling in quarter-pel MC where the operation is currently only expressed as textual description; it is recommended to add pseudo code similar to the style in other parts of the text.
A defect report was issued on this to allow further study on the best solutions, and further improve the proposed text changes when a corrigendum is issued.
Another defect report was issued on 14496-4 video conformance, where a report was made about illegal level definitions in three Simple Profile streams. In addition, it is planned to perform an action of re-structuring the video conformance (organize the sets of streams by profiles/levels instead of the number of amendment by which they were first defined). A document which formulates some first ideas about such a possible new structure was issued (N10537).
Documents reviewed:m16284 MPEG4 Simple Profile, Levels 7, 8, 9 and MPEG4 Advanced Simple
Profile, Levels 6, 7, 8, 9Madhukar BudagaviMinhua Zhou
m16285 Proposed text changes for ISO/IEC 14496-2 (MPEG-4 part 2) Minhua ZhouMadhukar Budagavi
m16291 USNB Contribution: Request for new levels Andy Tescher for USNB
Documents approved:No. Title TBP Available10534 Defect Report on ISO/IEC 14496-2:2004 N 09/04/2410536 Defect Report on ISO/IEC 14496-4:2004 N 09/04/2410537 First Ideas on New MPEG-4 Video Bitstream Repository
StructureN 09/04/24
2 Development of AVC
95
The video subgroup and the Sunday AHG on AVC development discussed the issues related to the ongoing development of AVC amendments and the corrigendum work related to fifth edition. The most relevant work items in this context were – Study of FPDAM1 (Constrained Baseline, SEI): Some minor changes related to the frame
packing arrangement SEI message: Clarification of relationship with aspect ratio, introduction of a row interleaving method
– Stereo High Profile: This new profile, which restricts MVC to 2 views but includes interlaced, is strongly supported by 5 companies. Software implementation was done in both JM and MVC reference software, and consistency was shown by cross-checks. In the case off usage for stereo progressive, it is a subset of multiview high profile (using same profile_idc, and introducing constrained_set5_flag for signaling usage of interlaced). The Stereo High Profile is also included in the Study of FPDAM1, and NBs are asked to comment on this. This includes the option of not promoting it into FDAM by July, and start a separate amendment on Stereo High. The video subgroup however recommends to issue this new profile as early as possible, such that possibly a new edition of AVC could be issued after the finalization of Amd.1 and Cor.1.
– New version of defect report (planning DCOR1 by July) was also issued. This now also includes a number of corrigendum items related to non-MVC parts of edition 5. Some of the issues that were resolved as a result of the discussion are also explained in the VCEG liaison N10559, but more effort and discussion with VCEG will be necessary until the DCOR can be issued.
– Study documents were issued for the FPDAMs on MVC software and conformance (both intended tom reach FDAM status by July)
– Integration of AVC related code bases was further discussed. From the implementation of the new Stereo High Profile in both JM and JMVC, it appears realistic that the two code bases could be unified. It must however carefully be checked whether the memory management of JM is sufficient for the purpose of a full MVC (Multiview high with larger number of views) implementation. Unification of JM and JSVM would also be desirable by the longer term, but so far, no effort was made on exploring this possibility.
Documents reviewed:m16320 On 2D + Depth SEI Message
Follows JVT-AD017 (JVT Geneva Jan 09), now joint proposal of Thomson and Philips.Supports side-by-side of video-plus-depth (both at same resolution), note: It is certainly not necessary here to allow both options, i.e. it can always be assumed that the image is left and depth is right. stereo-plus depth (left/right images and depth maps at top/bottom of 4-split frame), and LDV (same as before, but foreground/background and their depth maps at top/bottom.Would be useful to combine it with horizontal squeezing as in the current SEI message from FPDAM1.This could be signaled by sample aspect ratio. Clarification of sample aspect ratio needed there (e.g. horizontal squeezing side by side 2:1, vertical squeezing 1:2), also for the current SEI message in FPDAM1.No support to the proposal than by the proponents.
Dong TianPo-Lin LaiFons BrulsLincoln LoboWiebe de Haan
m16329 Reference Software And Test Results For Multiview Field High ProfileImplementation for stereo interlaced MVC in both JM and MVC reference software. Results brought for three sequences (Trapeze, Tunnel SD, BMX HD). BR reduction between 3 and 10 percent compared to simulcast, between 5 and 25 compared to MVC frame-based coding (field based coding was not tested).Software needs more improvement, proponents would do that.Potentially this could still be included in the FDAM of MVC software. Still to be decided whether the track of JM or MVC ref software should be followed. (recommendation)
ChongSoon LimSteffen WittmannTakahiro Nishi
m16330 Contribution of Stereoscopic Test sequenceBMX sequence: Copyright conditions provided.
ChongSoon LimSteffen WittmannTakahiro Nishi
m16331 Standardization of a new MVC profile as specified in Working Draft 1 ChongSoon Lim
96
of ISO/IEC 14496-10:200X/Amd.2 Multiview Field High Profile (N10344)Support by 5 companies. No change relative to the previous WD. Agreement that this shall be done. More discussion needed whether it could still be included in Amd.1 (then FDAM either July or October) as a Study of FPDAM, or whether this should become a separate amendment (then PDAM now or July, FPDAM October, FDAM 10/04)
Steffen WittmannTakahiro NishiAjay LuthraAnthony VetroShun-ichi SekiguchiShinya ShimizuStéphane Pateux
m16437 Verification of Test Results For Multiview Field High Profile (m16329)Confirm that results of M16329 match.
Sehoon YeaAnthony VetroShun-ichi Sekiguchi
m16481 JM Reference Software EnhancementsAdopt this, i.e. Karsten Sühring will publish as JM16 after appropriate checking.
Alexis Michael TourapisAthanasios LeontarisPeshala PahalawattaYan Ye
m16492 Liaison statement from ITU-T SG 16Technical: Various issues related to AVC corrigendum. Issue of Stereo High Profile: How to make this (in progressive mode) compatible with Multiview High
ITU-T SG 16 via SC 29 Secretariat
Documents approved:No. Title TBP Available10540 Study Text of ISO/IEC 14496-10:200X/FPDAM 1 N 09/04/2410541 Defect Report on ISO/IEC 14496-10:200X N 09/05/3110535 Study Text of ISO/IEC 14496-4:2004/FPDAM 38 Multiview
Video Coding Conformance TestingN 09/05/15
10538 Study Text of ISO/IEC 14496-5:2001/FPDAM 15 Reference Software for Multiview Video Coding
N 09/05/15
10539 Working Draft of Reference Software for Stereo High Profile N 09/04/2410559 Liaison statement to ITU-T SG16 Q.6/16 re AVC Development N 09/04/24
3 MPEG-7 Visual
3.1 MPEG-7 Visual related workThe MPEG-7 breakout group was active during the whole week. Input documents related to the Visual part in 15938-3 are listed in the table below.
m16279 Summary of Voting on ISO/IEC 15938-6:2003/PDAM 3 SC 29 Secretariat
m16280 Summary of Voting on ISO/IEC 15938-7:2003/PDAM 5 SC 29 Secretariat
m16281 Summary of Voting on ISO/IEC TR 15938-8:2002/PDAM 5 SC 29 Secretariat
m16335 Proposal for a complementary evaluation process of MPEG-7 Video Signature Tool.
Marzia CorvagliaRiccardo Leonardi
m16370 Proposal for the MPEG-7 Visual Extension, VCE-x: the ROI Signature
Sung-Kwan JeSang-Il NaWeon-Geun Oh
m16371 Preliminary feasibility test result for MPEG-7 Visual Extension, VCE-x : the ROI signature
Sung-Kwan JeWeon-Geun OhSang-Il NaWon-Keun Yang
m16414 Response to the Core Experiments on Video Signature Tools Weon-Geun OhJu-Kyong JinSang-il NaHae-Kwang KimDong-Seok Jeong
m16416 Cross verification result for ETRI VCE-7 proposal Min-Jeong LeeHeung-Kyu Lee
m16429 A Video Signature based on Robust Region Detectors Aritz Sanchezsebastian.gerke Patrick.Ndjiki-Nya
m16430 Proposal on MPEG-7 Video Signature Tools Paul BrasnettKota IwamotoStavros PaschalakisRyoma Oami
97
Miroslaw Boberm16449 Proposal for a New Standard Item in MPEG-7 Visual descriptors,
ROI SignatureSung-Kwan JeSang-Il NaWeon-Geun Oh
m16475 A proposal for Video Signature Tool and Video Fingerprinting Marzia CorvagliaFabrizio GuerriniRiccardo LeonardiEliana RossiPierangelo Migliorati
m16480 Evaluation of Video Signature Based on Tomography Sebastian Possos
The main activity during the week was the evaluation of the results of Core Experiments. Unfortunately, there was some inconsistency due to the late detection of cases where the used video decoder crashed and produced black frames with an information message, which in one class of experiments could bias the results. Some of the experiments needed to be re-done, such that results partially arrived only after the official document update deadline. Even though, only M16414 and M16430 were able to provide complete CE results during the meeting; other parties were not able to finalize their computations (for which the flaw described above did not play any role). From the results available, better performance was observed from the contribution M16430. In fact, this is a merging of two original CfP proposals, using a ternary frame signature from regions of interest, from which “words” and "bags of words" (histograms) of the signature values are constructed. It was therefore decided to adopt this method for WD, and include the related matching procedures in the XM. As in the case of Image Signature Descriptor, the method is closely coupled to the underlying image processing algorithm, such that extraction needs to be specified as a normative part. It is further recommended to shift the issuing of PDAM4 into the July meeting, which still will allow to produce FPDAM by October.
Software, conformance and matching tools for the Image Signature Descriptor were also progressed as the respective FPDAM / DAM texts. With regard to the conformance, it is raised in one NB comment that the aspect of normative extraction method should have implication on the conformance definition (e.g. allowing a certain margin of divergence when extracting the descriptor from image data), which will be considered in the FPDAM text.
3.2 Output documents related to MPEG-7 Visual
No. Title TBP Available15938-3 Visual
10566 Request for 15938-3:2000/Amd.4 N 09/04/2410542 WD 1.0 of 15938-3/Amd.4 Video Signature Descriptors N 09/05/0810543 MPEG-7 Visual XM 35 N 09/05/1510544 Description of Core Experiments in Video Signature Description
developmentN 09/04/24
10545 Disposition of Comments on ISO/IEC 15938-6:2003/PDAM 3 N 09/04/2410546 Text of ISO/IEC 15938-6:2003/FPDAM 3 Reference Software
for Image Signature ToolsN 09/05/08
10547 Disposition of Comments on ISO/IEC 15938-7:2003/PDAM 5 N 09/04/2410548 Text of ISO/IEC 15938-7:2003/FPDAM 5 Conformance Testing
for Image Signature ToolsN 09/06/12
10549 Disposition of Comments on ISO/IEC 15938-8:2002/PDAM 5 N 09/04/2410550 Text of ISO/IEC 15938-8:2002/DAM 5 Extraction and Matching
of Image Signature ToolsN 09/05/08
98
4 23002 MPEG-C Video Technologies
4.1 23001-4 and 23002-4 Reconfigurable Video Coding (RVC)
4.1.1 General status of work
The two parts related to RVC (ISO/IEC 23001-4 Codec Configuration Representation in MPEG-B and ISO/IEC FCD 23002-4 Video Tool Library in MPEG-C) were progressed into FDIS status in Lausanne, but some final editing work was dedicated in Maui, in particular to guarantee a 1:1 matching between the textual description in the FDIS and the software implementation in Amd.1 which together with the conformance gives the “anchor” reference of normative FU input/output behavior. In particular, a number of inconsistencies were detected in the implementation of the AVC BP de-blocking filter, which needs to be split into two functional units to provide correct operation and timing.
Regarding conformance definitions in PDAM1, intense discussions were performed during the week and it was concluded that the currently defined conformance at the FU level is not sufficient to describe the normative behaviour, but interoperation of FUs, synchronization and timing is potentially even more relevant. It would however be too early to include any draft text on this in a Study of PDAM. Instead, a Core Experiment was started, which is expected to bring more clarification about viable solutions.
It is still the understanding that RVC is still in "phase 1", which is re-implementation of existing decoder conformance points
• MPEG-4 SP and MPEG-4 AVC CBP in FDIS• MPEG-2 MP, MPEG-4 ASP, AVC HP and SVC to follow in Amd.2.
Due to ongoing investigation about best way of defining CABAC related parsers in the RVC/BSDL context, as well as due to the time that needed to be spent on the finalization of text and software/conformance issues, the progress on Amd.2 is slower than expected, such that issuing the PDAM will certainly be delayed by at least one more meeting cycle.
"Phase 2" could open up more options that were previously discussed in the RVC context, in particular
• Possible simplification of standards development by adding new FUs. Whereas this approach sounds attractive, it appears clear that a real simplification (by adding / exchanging single FU modules) would only be given for cases where a preceding standard is fully implemented in RVC schema – it needs to be clarified whether RVC-based encoders are necessary in this context as well. First investigations on this were reported and will be continued in Core Experiments.
• downloadable (on the fly) decoder solutionsMore of this will be discussed and provided in the expected update of the vision document, which will be discussed in the AHG reflector.
The main activity (besides discussion of vision) until next meeting will be the conduction of Core Experiments related to– RVC Encoding Tools– RVC Conformance Testing
(will be updated after 4 weeks)– Re-usability of tokens and parametrization– RVC-BSDL extension, in particular with relation to CABAC
99
4.2 Assignment of output documents & editors Editors
Documents EditorsRVC Core Experiments Kazuo-san
Output documentsNo. Title P.A
.Editing period
Description of Core Experiments in RVC
N 4 weeks
4.3 Review of input contributions On-going issues
Doc. No.
Category Title Authors
m16332
MPEG-BVideo
An efficient dataflow design for implementing MPEG-4 AVC decoder in the RVC framework
Jérôme Gorin, Mickaël Raulet
Notes “a new way to test the efficiency and the reliability of the CAL tools, especially Synthesizing tools, by modelling one of the most complex decoder provided by MPEG”
Deblocking filter needs to be checked. Bitstream parser needs verified. What is described in VTL FDIS is explained in detail as a report. [Recommendation] Offline meeting (Tuesday Afternoon) to
solve the deblocking filter problem.m16450
MPEG-CVideo
MPEG-4 ASP Decoder Description and a Case Study of the Decoder Design Process in the RVC framework
Hyungyu Kim, Sowon Kim, Minsoo ParkHwa Seon Shin, Byeongho Choi, Chungku Yie, Euee S. Jang
Notes ECMAScript-based ASP parser description is proposed. Design process including conformance testing is proposed. [Recommendation] Include the proposed design process into
PDAM1 (Study)m16482
MPEG-CVideo
Status and Future Plan of Video Tool Library (VTL)
Hwa Seon Shin, Sung Moon Chun, Hyungyu Kim, Byeongho Choi, Euee S. Jang
Notes Status of the current VTL descriptions of FU is given. [Recommendation] Start a CE on Token definition (HS,
Christophe, Mikael, Chris)m16470
AllGeneral/All
FU network of inverse scan, inverse quantization and inverse transform in AVC High Profile
Gwo Giun (Chris) Lee, He-Yuan Lin, Jia-Wei Liang, Ming-Jiun Wang
Notes [Recommendation] conduct crosschecking (HYU) and accept it in VTL (WD).
m16333
FU parameterization and FU code generation
M. Raulet, M. Wiplize and J. W. Janneck
Notes A report on duplication and parameterization of FUs
100
[Recommendation] Start a CE on the reusability of token and parameterization
Core Experiment on Encoding ToolsDoc. No.
Category
Title Authors
m16398
MPEG-CVideo
AVC Entropy Coding and Bitstream Generation for the MPEG RVC Encoding Tools
Hussein Aman-Allah, Ihab Amer, Marco Mattavelli
Notes Outline of AVC Entropy coding modules in high level. Status report [Recommendation] the encoding tools should demonstrate the
efficiency and usefulness in a generic way to be applicable to many applications.
[Recommendation] update CE description to specify the objectives of the work. (Marco)
m16409
MPEG-CVideo
Advances in the CE: Development of RVC Encoding Tools
Ihab Amer, Marco Mattavelli
Notes Status reportm16413
MPEG-CVideo
AVC Intra Prediction, Transform and Quantization for the MPEG RVC Encoding Tools
Karim Maarouf, Ihab Amer, Marco Mattavelli
Notes Outline of AVC intra prediction, transform and quantization is given.
Status reportm16431
MPEG-CVideo
AVC Inter-frame Prediction for the MPEG RVC Encoding Tools
Ehab Asaad Hanna, Ihab Amer, Marco Mattavelli
Notes - Status report
RVC for GraphicsDoc. No.
Category
Title Authors
m16403
MPEG-CVideo
A study on RVC based Graphics codec
Seungwook Lee, Bonki Koo, Kyoungsoo Son, Daiyong Kim, Mingxiao Chen, Euee S. Jang
4.4 Output Document Processing4.4.1 CE description update (Editor: Kazuo-san)
- CE 3 (RVC Encoding tools) update Update on the objective of this CE will be integrated.
- CE 4 (RVC Conformance Testing) Existing two codecs (MPEG-4 SP, MPEG-4 AVC ConstrainedBP) will be checked
first during the conformance testing CE till the next meeting. - CE 5 (Reusability of tokens and parameterization)
Update of the CE description discussed and updated- CE 6 (RVC-BSDL extensions)
Update of the CE description discussed and updated
101
4.5 RVC-related schedule of the 88th meeting
Day Topic Room Time Status
Monday Video Plenary (planning of this week)
Video 14 - 15 DONE
RVC time allocation Wilcox 16 -1615 DONEReview of input contributions (on-going issues)
Wilcox 1615 – 1930
DONE
RVC output decision Wilcox 1730 - 18
DONE
Tuesday RVC Conformance Wilcox 930 – 12 DONEJoint meeting with 3DG on RGC
3DG (Pioneer)
14 - 15 DONE
Core Experiments (Encoding Tools)
Wilcox 15 – 17 DONE
RVC Conformance (cont’d) Wilcox 17 – 1720
DONE
Review of Input contribution (M16333)
Wilcox 1720 - 1640
DONE
Wednesday
CE update review Wilcox 1430 – 1530
DONE
RVC Conformance (cont’d) Wilcox 1530 – 17
DONE
RVC Vision Wilcox 17 - 18 DONEThursday CE update review Wilcox 13 –
1530DONE
Friday Video Plenary Video DONE
Output Document:
No. Title TBP Available10551 Description of Core Experiments in RVC N 09/04/24
5 Explorations – 3D Video
The goal of 3D video is to generate interpolated views from available videos of multiview camera configurations. The target application is mostly seen for upcoming generations of various (auto-) stereoscopic displays, either requiring multiple views internally or providing means for baseline adjustment. In the new format, only a low number (1) of video sequences shall be transmitted, but rendering of additional views would be enabled by associated depth information.
In the exploration experiments preceding the Maui meeting, further progress was made in improving (automatic) depth estimation and view rendering / interpolation. Two approaches were investigated, namely layered depth video (including background video plus depth) and stereo/multiview plus depth, both of which seem to be conceptually working and suitable for producing anchors in an upcoming CfP. However, according to the subjective viewing
102
judgements that were again performed in Maui, deficiencies in synthesis quality are mainly caused by false depth estimation.
Following the Call for test material and depth maps that had been issued in Lausanne, responses were received which in particular provide hand-generated depth maps either for sequences from the existing test set, or along with new sequences (2). Even though more effrt seems to be necessary in this, satisfactory view synthesis results are now achieved for the cases of 8 sequences in total (previously only 3), including cases of high depth variation and extremely localized content for which automatic algorithms typically fail. As also the depth maps look more smooth and natural, it is concluded that for the first time it is reasonable to perform the second step in anchor preparation for a CfP, which is coding experiments, using AVC/MVC for both video and depth map coding. Main achievements in this activity have been review of technical developments from the exploration experiments, further clarification about the vision, applications and requirements, and planning of next steps. The planned exploration experiments are therefore solely designed for the purpose of CfP preparation:– EE1: Depth map generation; goal: Improving semi-automatic methods, get good/better
depth maps for LDV rendering and multi-view based rendering– EE4: Coding experiments; goal: Find best data rate points (video, depth) for upcoming
CfP
If the results are satisfying, a draft CfP could be issued in July. To further progress on this, a document “Evaluation and Testing of 3D Video Coding” (N10649) was produced, which– Describes viewing methodologies for stereoscopic and autostereoscopic displays, and an
expert viewing method– Has the goal to get a "measurable" video quality on a more granular scale (not binary or
ternary decision as in the viewing performed so far for the purpose of evaluating the performance of depth estimation and view synthesis algorithms).
Further, the document “Applications and Requirements of 3D Video Coding” was slightly updated (N10570).
Documents reviewed in AHG (see AHG report)m16295 Revisions of Applications & Requirements on 3D Video Coding Anthony Vetro
m16317 Results of Exploration Experiments in 3D Video Coding for Dog Data Set
Yongzhe WangPhilipp MerkleKarsten Müller
m16318 3DTV Exploration Experiments on Pantomime data set Ivana RadulovicPer Fröjdh
m16319 MPEG 3DV EEs on Leaving_Laptop Po-Lin LaiDong TianPatrick LopezPaul Kerbiriou
m16326 3DV results on Dog sequence Carmen CHENGYan HUOYu LIU
m16328 Results of Exploration Experiments in 3D Video Coding, described in w10360, for Alt Moabit sequence.
Olgierd StankiewiczKrzysztof WegnerKrzysztof Klimaszewski
m16356 Report of 3D/FTV Exploration Experiment with Champagne Tower
Takanori SenohKenji YamamotoRyutaro Oi Tomoyuki MishinaMakoto Okui
m16357 Proposal of Depth Map Estimation Method in Response to Call for 3D Test Material: Depth Maps & Supplementary Information
Takanori Senoh Kenji Yamamoto Ryutaro Oi Tomoyuki Mishina Makoto Okui
m16388 3DV EE1 & EE2 Results on Newspaper sequence Seok Lee
103
Jaejoon LeeIlsoon LimJin Young LeeHo-Cheon WeyDu-Sik Park
m16389 Error-resilient Free-viewpoint Image Generation for FTV Masayuki TanimotoToshiaki FujiiMehrdad Panahpour Tehrani Hisayoshi Furihata Menno Wildeboer
m16390 Depth Estimation Reference Software (DERS) 3.0 Masayuki TanimotoToshiaki Fujii Mehrdad Panahpour Tehrani Kazuyoshi SuzukiMenno Wildeboer
m16391 Semi-automatic Depth Estimation for FTV Masayuki TanimotoToshiaki FujiiMehrdad Panahpour TehraniNorishige FukushimaKazuyoshi SuzukiMenno Wildeboer
m16394 EE1: Results of Depth Estimation on 'Pantomime? Sequence Cheon LeeYo-Sung Ho
m16395 EE2: Results of View Synthesis on 'Pantomime? Sequence Cheon LeeJiho ParkYo-Sung Ho1
m16396 3-D Test Sequence - Multiview Video and Depth Map Eun-Kyung LeeYo-Sung Ho
m16400 3DV EE3 on Champagne_tower sequences Yin ZhaoLu Yu
m16405 LDV Reference Software for View Synthesis Yin ZhaoLu YuFons BrulsLincoln Lobo
m16406 3DV/FTV EE results of depth estimation and view synthesis on "lovebird1" sequence
Gun BangGi Mun UmNamho HurJinwoong Kim
m16407 PSPNR measurement for evaluating view synthesis quality Yin ZhaoLu Yu
m16411 Depth Estimation algorithm in SADERS1.0 Gun BangJaeho LeeNamho HurJinwoong Kim
m16417 Improving view synthesis results based on depth quality measure
Jaewon SungByeong-Moon Jeon
m16418 3DV EE results on Newspaper sequence Jaewon SungByeong-Moon Jeon
m16419 Philips response to new Call for 3DV Test Material: Arrive book & Mobile
fons brulsrene klein gunnewiekpatrick van de walle
m16420 Philips & Zhejiang Uni response to new Call for 3DV Test Material: Champagne Tower
fons brulsrene klein gunnewiekpatrick van de walleYin ZhaoLu Yu
m16421 Philips (in coop with 3D4YOU) response to new Call for 3DV Test Material: Beergarden
fons brulsrene klein gunnewiekpatrick van de wallerene van de vleuten
m16422 Philips 1st 3DV synthesis results using new test material for Arrive book, Mobile, Beergarden & Champagne Tower
fons brulsrene klein gunnewiekpatrick van de walle
m16423 Philips 3DV EE results fons bruls
m16432 Results of Exploration Experiments in 3D Video for Lovebird2 Sehoon YeaZafer AricanAnthony Vetro
m16460 Additional results of Exploration Experiments in 3D Video Coding, described in w10360, for Alt Moabit sequence.
Olgierd StankiewiczKrzysztof WegnerKrzysztof Klimaszewski
m16471 3DV/FTV EE Report on Doorflower sequence Shinya ShimizuHideaki Kimata
104
Output documents:No. Title TBP Available10570 Applications and Requirements of 3D Video Coding N 09/04/2410552 Description of Exploration Experiments in 3D Video Coding N 09/04/2410649 Evaluation and Testing of 3D Video Coding N 09/04/24
6 Explorations – High-Performance Video Coding
A large quantity of video material is already distributed in digital over broadcast channels, digital networks and packaged media. More and more of this material will be distributed with increased resolution and quality demand. Technology evolution will soon make possible the capture and display of video material with a quantum leap in quality (temporal and spatial resolution, color fidelity, amplitude resolution). Networks are already finding it difficult to carry HDTV resolution and data rates economically to the end user. Therefore, further data rate increase will put additional pressure on the networks. Therefore a new generation of video compression technology that has sufficiently higher compression capability than the existing AVC standard in its best configuration (the High Profile), is needed. A study has been started on the feasibility of HVC, which is mainly intended for high quality applications, and a Call for Evidence is under preparation.
Before the Maui meeting, AVC anchors that shall be compared against prospective new technologies were provided as announced in the Draft Call for Evidence (see m16462, m16463). First investigations on the anchors were made (see m16375, m16463) and continued during the week. As some of the AVC anchors already seemed to be at subjectively transparent quality, it was decided to adapt the QP values for some of the test cases. Further, as one input contribution reported that the search ranges of motion estimation might be insufficient for some of the cases, the corresponding parameters were also changed. One contribution for an additional set of test sequences (1920x1080, 24fps, 10 bit RGB) was received (m16472). Usage conditions in the context of standards development are clarified. After further review of those sequences, including initial results of AVC anchor encoding, three additional sequences were selected for the class B (1080 HD) category. It was further decided to remove the previous sequences (same as class A UHD but downsampled, which did not seem to give relevant additional information) from the test set, and retain the corresponding sequences only fo the class A case. Further, for classes A and C, the sequences “Crowd Run” and “Mobisode1” were removed, because they seemed to be redundant in terms of noise and scene characteristics with “Park Joy” and “Mobisode2”, respectively. It is out of question, however, though the current set of sequences may be appropriate for the ongoing Call for Evidence for an initial guess on existence of improved compression technology, it is certainly not good enough for a CfP and following standards development. In particular, material from state-of-the-art 720/60p and 1080/60p cameras, as well as more diversity in the category A (UHD) would be needed.
After the testing cases are settled, the Call for Evidence was issued with responses expected for July. This includes description of the expert viewing methodology which will be applied in London.
6 input documents were registered that present or discuss technical approaches of improved-compression video coding (m16372, m16438, m16448, m16451, m16473, m16479). Comments are included in the list below.
105
m16372 High Definition Test Sequences for High-Performance Video Coding (HVC)Documentation how the test sequences were produced (including cropping and downsampling)
Tomonobu YoshinoSei NaitoShigeyuki Sakazawa
m16375 AVC Anchor Streams for Evaluation of High-Performance Video Coding (HVC)Documentation how the AVC anchors were encoded. Some initial findings: QP22 may be too close to transparent for UHD cases; People on Street 5 sec may be too short; motion range may be too low for Park Joy; for WQVGA, difficult to see coding artifacts in case of Keiba, coding artifacts difficult to see in some cases due to high speed of global motion. The two Mobisode sequences may be redundant in terms of their properties.
Keiichi ChonoHirofumi AokiJunji Tajime Yuzo Senda
m16438 Requests on Coding conditions in Call for Evidence on High-Performance Video Coding (HVC)Significant reduction of rate if motion vector range is increased for Keiba. Same may be true for Park Joy.Further discussion on settings after visual review of the results.
Yuriy A. ReznikRavi K. Chivukula
m16448 Information on new test material for HVC studyCOSME 1920x1080/24p 4:4:4 sequence, provided under clear copyright conditions for development of standards; to be reviewed during the week.
Tomoyuki YamamotoTomohiro Ikai
m16462 Performance Evaluation of Spatially Adaptive Macroblock Size Selection Scheme for HVC Test SequencesRefers to previous contribution M16082 where fixed (larger) MB size was used. Now, local update of MB size within frames. “Control unit” 64x64, which can be subdivided into 32x32 or 16x16 MBs. Tested on UHD (2560x1600) test sequences from CfE. Most effect in Park Joy (approx. 30% average bitrate reduction). Other sequences small or no improvement. Only IPPP investigated. Results were made with different search range for each of the different MB sizes. Note: Current document is without PSNR figures. Will be updated.
Steffen KampMathias Wien
m16463 On design of transforms for high-resolution / high-performance video codingTo be presented later during the week.
Steffen KampMathias Wien
m16472 Analysis on partition and transform selection in the context of extended block sizesMB sizes up to 64x64; transform size up to 16x16 (also 16x8 and 8x16; in some cases 16x1 is used). Larger partitions are typically more often used when QP becomes higher. Based on statistics, certain transform sizes are disabled in combination with certain MB sizes. Only IPPP coding. 0.3% average bitrate reduction, but is is claimed that complexity is reduced.
Yoshihisa YamadaYoshiaki KatoKohtaro AsaiTokumichi Murakami
m16473 Additional coding performance evaluation of extended MB sizeExtended MB size to maximum 32x32, motion partitioning 32x16 and 16x32 additionally. P picture only. For UHD sequences, relevant gain is observed only for traffic (13%), other sequences low gain, on average around 3-4%. Claimed that 2 out of 4 sequences are film-scan sources that may prevent good inter coding. Gain for other 1080p and 720p sequences higher (10-15% average).
Kazuo SugimotoYoshihisa YamadaYoshiaki KatoKohtaro AsaiTokumichi Murakami
m16479 Additional Experimental Result of MVOP with HD SequencesMotion vector coding with optimum predictor: New results for 720p and 1080p sequences. Decoder decides on optimum motion vector to be used for prediction, but flag is used to signal that the median should be used instead. 5 candidates (3 spatial neighbors, co-located MB in previous frame, median). Template matching is used to determine the optimum. Experiments with only CAVLC, IPPP. Average for 720p 8.7%, for 1080p 7.8% (much less for Crowd Run and Park Joy). Usually, the performance goes up with increased spatial resolution.
Jungyoup YangKwanghyun WonByeungwoo JeonSu Nyeon Kim
106
Output documents:No. Title TBP Available10553 Call for Evidence on High-Performance Video Coding Y 09/04/24
107
Annex H – Audio report
Source: Schuyler Quackenbush, Chair, Audio Subgroup
1 Opening of the meeting....................................................................................................1072 Administrative matters.....................................................................................................107
2.1 Communications from the Chair 1072.2 Approval of agenda and allocation of contributions 1072.3 Creation of Task Groups 1072.4 Approval of previous meeting report 1072.5 Review of AHG reports 1072.6 Joint meetings 1072.7 Received National Body Comments and Liaison matters 1072.8 Plenary Discussion 108
2.8.1 Profiles and 960/1024 transform lengths................................................................1082.8.2 Miscellaneous.........................................................................................................108
3 Record of AhG meetings..................................................................................................1083.1 AhG Meeting on SAOC and USAC Sunday 1000-1700 108
3.1.1 SAOC.....................................................................................................................1083.1.2 USAC.....................................................................................................................110
4 Task group activities........................................................................................................1134.1 Joint meetings 1134.2 Task Group discussions 113
4.2.1 MPEG-2, MPEG-4, MPEG-26 Audio, Audio Conformance, reference software. 1134.2.2 MPEG-D Spatial Audio Object Coding.................................................................1144.2.3 MPEG-D Unified Speech and Audio.....................................................................1144.2.4 Exploration: Meta-Data..........................................................................................118
5 Audio closing plenary discussions...................................................................................1186 Meeting deliverables........................................................................................................118
6.1 Responses to Liaison and NB comments 1186.2 Recommendations for final plenary 1186.3 Establishment of Ad-hoc Groups 1196.4 Approval of output documents 1196.5 Press statement 119
7 Future activities................................................................................................................1197.1 Schedule of future meetings 1197.2 Agenda for next meeting 1197.3 All other business 1197.4 Closing of the meeting 119
Annex A Participants..........................................................................................................120Annex B Audio Contributions and Schedule.....................................................................121Annex C Task Groups........................................................................................................125Annex D Output Documents..............................................................................................126Annex E Agenda for the 89th MPEG Audio Meeting........................................................127
108
109
1 Opening of the meeting
The MPEG Audio Subgroup meeting was held during the 88th meeting of WG11, April 20-24, 2009, Maui, HI, USA. The list of participants is given in Annex A.
2 Administrative matters
2.1 Communications from the Chair
The Chair summarised the issues raised at the Sunday evening Chair’s meeting, proposed task groups for the week, and proposed agenda items for discussion in Audio plenary.
2.2 Approval of agenda and allocation of contributions
The agenda and schedule for the meeting was discussed, edited and approved. It shows the documents contributed to this meeting and presented to the Audio Subgroup, either in the task groups or in Audio plenary. The Chair brought relevant documents from Requirements, Systems to the attention of the group. It was revised in the course of the week to reflect the progress of the meeting, and the final version is shown in Annex B.
2.3 Creation of Task Groups
Task groups were convened for the duration of the MPEG meeting, as shown in Annex C. Results of task group activities are reported below.
2.4 Approval of previous meeting report
The 87th Audio Subgroup meeting report was registered as a contribution, and was approved.
2.5 Review of AHG reports
There were no requests to review any of the AHG reports.
2.6 Joint meetings
There were no joint meetings.Groups What Where Day Time
2.7 Received National Body Comments and Liaison matters
The NB Comments and Liaison documents for the meeting that require a response are as shown below.No. From Title Response
m16297Swedish NB via SC 29 Secretariat
Swedish NB comment in response to Resolution 3.1.2 in N10312
Kjörling
m16343 IEC TC 100 IEC CDV 62571: Digital Audiobook File Format and Player Requirements [IEC 100/1543/CDV]
Quackenbush
M16467 EBU Liaison Statement from EBU [SC 29 N 10254]
No response
m16476 DRM Audio Liaison Statement from DRM [SC 29 N 10255]
No response
m16477 WorldDMB Forum
Audio Liaison Statement from WorldDMB Forum [SC 29 N 10256]
No response
m16491 3GPP Liaison Statement from 3GPP SA4 No response
110
2.8 Plenary Discussion
2.8.1 Profiles and 960/1024 transform lengths
Kristofer Kjörling, Dolby, presented
m16297Swedish NB comment in response to Resolution 3.1.2 in N10312
Swedish NB via SC 29 Secretariat
The group then reviewed the following Liaison contributions:m16467 EBU Liaison Statement from EBU [SC 29 N 10254]m16476 DRM Audio Liaison Statement from DRM [SC 29 N 10255]m16477 WorldDMB
ForumAudio Liaison Statement from WorldDMB Forum [SC 29 N 10256]
M16491 3GPP Liaison Statement from 3GPP SA4The Liaisons can be summarized as:EBU requires that 960 continue to be supported in the MPEG specifications. The best long-term solution would to be for devices to support both 960 and 1024. However, they concede that since many devices implement only 1024, a restricted set of profiles might be appropriate.DRM specifies MPEG-4 AAC with SBR and PS with 960 in combination with MPEG-4 Error Resilience bitstream format. The best long-term solution would to be for devices to support both 960 and 1024.WorldDMB notes that one of its systems specifies HE-AAC V2 at 960 while another specifies HE-AAC V2 at 1024. The best long-term solution would to be for devices to support both 960 and 1024.3GPP only mandates 1024 in their aacPlus specifications. They are not impacted by whether or not 960 is mandated in an MPEG profile. DiscussionYuriy Resnik, Qualcomm, noted that the Liaison referenced only one of several 3GPP specifications. One proposal for going forward is to:
No longer mandate support for 960 in AAC, HE-AAC and HE-AAC V2 profiles. Create a new profile “AAC Grand Alliance” that is a single profile that mandates
AAC, SBR and PS tools that operate at 960 and 1024 transform lengths.
Ralph Sperschneider, FhG, noted that the latter profile is exactly the HE-AAC V2 profile. Yuriy Reznik, Qualcomm, noted that 3GPP2 references MPEG-4 AAC profile and MPEG-4 HE-AAC profile (3gpp2 C.S0046-0 and C.S0045-A).
2.8.2 Miscellaneous
The Chair raised the following issues in order to assess interest and to schedule task group break-outs for later in the week:
SAOC timetable Normative start-up and shut-down of audio coders, e.g. AAC, HE-AAC as described in N8837. SVN repository for audio
3 Record of AhG meetings
3.1 AhG Meeting on SAOC and USAC Sunday 1000-1700
3.1.1 SAOC
Oliver Hellmuth, FhG, presented
m16310Clarifications regarding the enhanced Karaoke/Solo processing mode
Leonid TerentievJürgen HerreCornelia FalchOliver Hellmuth
111
This contribution presents information requested by LG at the 87th meeting. The points discussed were:
Combining regular mode and Enhanced Karaoke/Solo mode (EKS). Decoder processing in regular vs. EKS mode (wrt prediction or energy based
decoding). Mapping of residual signals to objects.
The Chair noted that language clarifying the topic of the last bullet item should be included into the FCD text.Jeongil Seo, ETRI, asked a number of questions aimed at better understanding the limitations of the current specification. This discussion will be continued via e-mail since one FhG expert, Leonid Terentiev, was unable to attend this meeting.Heiko Purnhagen, Dolby, presented
m16309 Report on corrections for the MPEG SAOC FCD text
Jonas EngdegårdHeiko PurnhagenCornelia FalchLeonid TerentievAndreas HölzerOliver HellmuthJohannes HilpertJeroen Koppens
This contribution presented a number of editorial corrections or technical clarifications to the FCD text. In addition, it proposes to draft changes to MPEG-2 AAC and MPEG-4 Audio and MPEG Surround (on carriage of SAOC in the core coder bitstreams or in MPEG Surround bitstreams) for use as amendments to MPEG-2 AAC, MPEG-4 Audio and MPEG Surround.It was the consensus of the group to put the editorial corrections or technical clarifications into the FCD text and, secondly, to draft text for use in amending MPEG-2 AAC, MPEG-4 Audio and MPEG Surround.Jeongil Seo, ETRI, presented
m16363 Test Sequence Proposal for SAOC Verification TestJeongil SeoKyeongok KangKevin SeungChul Ham
This contribution makes available to MPEG for use in the Verification Test of SAOC several items from the Korean Music 2.0 project. These will be made available to the Verification Test Encoding Administrator to construct Reference rendering and SAOC rendering waveforms.Juergen Herre, FhG, presentedm16384 Details of NB Position on SAOC Ballot Juergen Herre for GNB
This contribution gives details on the German NB position on the SAOC FCD ballot (which closed March 14, 2009). The ballot comment requests the usual correction of errors and clarification of text, plus requests that a number of technical changes be incorporated into the SAOC FDIS. These are
Definition of profiles and levels Control of requests for “extreme” rendering that may result in artefacts in the
processed signal. Expand the specification to support low-delay encoding multiple objects that are
rendered as a multichannel signal.
The contribution further indicates technical changes that are required, editorial changes that are recommended and technical changes that are desirable.The Chair noted that transport of SAOC in base audio coder bitstreams is covered in the previous contribution. The Chair also noted that it would be very desirable to have the results of the formal Verification Test available at the same MPEG meeting that the FDIS text is approved.Jeongil Seo, ETRI, acknowledged that there is additional work to be done, but that more that two meetings delay impact the personal music mix applications for SAOC in the marketplace.
112
The Chair requested that a workplan be drafted for approval at this meeting that organizes the first components of the work requested by the German NB.
3.1.2 USAC
Max Neuendorf, FhG, presented
m16324 Comments on new USAC reference bitstreamsMax NeuendorfMarkus Multrus
This contribution reported on the results of integrating the new arithmetic codebooks into the proponent USAC encoder and the RM decoder (denoted here as the Reference Quality System, or RQS). It notes that, at the previous MPEG meeting, the CE demonstrated a bitsavings, when averaged over all signals, due to using the new arithmetic coding tables. When this bitsavings was fed back into the bit reservoir, the wLPT tool was selected more often that TCX tool wrt the previous arithmetic tables. Hence it was not possible to maintain a bit-identical decoded waveform in the RQS. The Chair asked whether, in creating a lossless decoding, the new tables strictly obeyed the bit buffer requirements. Max Neuendorf confirmed that this was the case.Eunmi Oh, Samsung, noted that informal listening done at Samsung suggested that some items sounded worse as a result of this change. Herve Taddei, Huawei Technologies, noted that listening tests in their lab also concluded that the new tables resulted in a slight degradation in audio quality. It was agreed that Samsung and Huawei will give details on what test items were judged to be degraded. The Chair requested that FhG report at the next MPEG meeting any new information or insight gained in the use of the new arithmetic tables.Max Neuendorf, FhG, presented
m16323Report on Merge of sys2 Technology into RM0: SBR Improvements
Max NeuendorfTaejin Lee
The contribution reported on incorporating technology from Sys2 into the RM. Listening tests were presented that showed that the performance of the new RQS had a higher mean score than the RM0 system, but not better at the 95% level of significance. However, the enhancements related only to the encoder implementation. There was considerable discussion as to how such enhancements would be incorporated into the RQS. The Chair noted that what was of paramount importance was simply that all CE proponents have access to the same RQS.Taejin Lee, ETRI, presented
m16383Report on Merge of sys2 Technology into RM0: TCX Improvements
Taejin LeeMax Neuendorf
This contribution proposes to change the window shape for frames using TCX encoding. It presented listening test results for the proposed changes, in which a test over 6 items showed better performance wrt RM0. It was noted that, for these items and at 12 kb/s mono, the encoder was modified such that only TCX mode was used.ETRI proposes to do additional work and bring a complete CE proposal to the next MPEG meeting. Chair noted that the CE process requires an independent listening test report cross-check. The Chair further noted that ETRI has the burden of bringing evidence that is sufficiently compelling on the merit of their technology, and that they should use this week to understand the concerns of the group.Heiko Purnhagen, Dolby, presented
m16312Dolby Listening Test Results for USAC CE on Phase Coding in MPS
Heiko PurnhagenKristofer Kjörling
The contribution presented listening test results at 32 kb/s stereo which showed a tendency in the mean for the performance of RM+CE to be better than the performance of RM. The presenter suggested that more conclusive results could be achieved if the data from all test results were pooled.Werner Oomen, Philips, presentedm16339 Philips Listening Test Results for USAC CE on Werner Oomen
113
Phase Coding in MPS Jeroen Koppens
The contribution presented listening test results at 24 kb/s stereo. The results showed a tendency in the mean for the performance of RM+CE to be better than the performance of RM. The presenter also suggested that more conclusive results could be achieved if the data from all test results were pooled.Markus Multrus, FhG, presented
m16456Fraunhofer Listening Test Results for USAC CE on Phase Coding in MPS
Julien RobilliardMatthias NeusingerJohannes Hilpert
The listening test showed results at 24 kb/s and 32 kb/s stereo. The results showed that the mean performance of RM+CE is better than the mean performance of RM at the 95% level of confidence when averaged over all test items. When looking at each test items, none performed worse and some performed better at the 95% level of confidence.Eunmi Oh, Samsung, presented
m16374Report on Phase Coding in MPEG Surround for USAC
JungHoe KimJulien RobilliardEunmi OhBernhard Grill
The contribution described the proposed technology and presented listening test data. A summary of the technology follows:
can select fine or coarse phase quantization tables decoder applies smoothing to unwrapped phase and uses linear interpolation in
magnitude/phase domainThe technology provides the following performance:
at 32 kb/s the IPD rate is 0.479 kb/s at 24 kb/s the IPD rate is 0.271 kb/s
The Samsung listening test result showed clear improvement for several items at the 95% level of significance for both bit rates and a clear improvement when scores for all items are pooled together.Kristofer Kjörling, Dolby, expressed some concern with the details of the decoding process, specifically that the phase smoothing component of the proposed technology was not discussed in previous contributions. Heiko Purnhagen, Dolby, and Werver Oomen, Philips, expressed similar concerns. The Chair asked Samsung, Dolby and Philips experts to have an off-line discussion and report back to the group on how best to proceed.Heiko Purnhagen, Dolby, presented
m16311Dolby Listening Test Results for USAC CE on AVQ-based LPC
Heiko PurnhagenKristofer Kjörling
The contribution presented listening test results at 16 kb/s mono that showed no statistically significant differences between RM and RM+CE at the 95% level of significance. Markus Multrus, FhG, presented
m16322Fraunhofer IIS Listeningtest Results on USAC CE for AVQ-based LPC Quantizer
Markus MultrusRalf Geiger
The contribution presented listening test results at 16 kb/s mono. The test included a 7.0 kHz LPF anchor and the two codecs comprising the VC (i.e. AMR-WB+ and HE-AAC) and showed no statistically significant differences between RM and RM+CE at the 95% level of significance.Philippe Gournay, VoiceAge, presented
m16316 CE Report on LPC Quantization for USAC
Philippe GournayBruno BessetteRoch LefebvreRedwan Salami
114
The contribution reviewed the CE technology, which is to replace the RM0 quantizer (based on trained codebooks) with an algebraic vector quantizer (AVQ). The advantages of AVQ are:
it uses less ROM (19456 32-bit words for RM0 vs 4096 32-bit words (first stage) and 1150 16-bit words (for the AVQ quantizer).
permits better control of spectral distortion (i.e. fewer outliers)
The contribution presented objective data on spectral distortion for the AVQ coded excitation. The AVQ quantizer showed an order of magnitude fewer outliers (more that 1.5 dB spectral distortion) than the RM0 trained VQ quantizer.It is the consensus of the group to incorporate this technology into the USAC WD.Philippe Gournay, VoiceAge, presented
m16325VoiceAge Test Report for USAC CE on Unvoiced Coding
Philippe GournayRoch Lefebvre
The contribution reports listening test results for two operating modes: 12 kb/s mono and 16 kb/s mono.Analysis of difference scores indicates that, at 12 kb/s mono, one test item is worse for the CE technology (at the 95 % level of significance), while at 16 kb/s there is no statistical differences. Single-sided tests of the differences between scores showed that for one item (es01) at 12 kb/s mono the hypothesis that the mean of the two systems were the same was rejected. The presenter acknowledged that the proposed technology is able to save bits, which is quite valuable. However, it may be that the hypothesis of modelling unvoiced speech by linear filtered Gaussian noise is not appropriate.Eunmi Oh, Samsung, presented
m16373 Report on Unvoiced Speech Coding for USACHosang SungEunmi OhMiyoung Kim
The contribution presented the proposed CE technology, which consists of Gaussian codebook of excitation vectors and gain factor for Low Energy segments
(LEN) Gaussian codebook of excitation vectors, gain factor and an additional LP filter for
Unvoiced segments (UV)
It reviewed the bit savings possible by using the CE technology UV 408 bits saved per superframe (as compared to LPD mode) LEN 88 bits saved per superframe (as compared to LPD mode)
The contribution noted that the experiment could product bit-identical results, (except for UV and LEN coded segments), but because saved bits were fed back to the bit reservoir, the entire decoded waveform was different. It showed that the fraction of frames that are UV mode are a significant minority, with LEN quite a bit less. At 12 kb/s, the bitrate savings was as large as 0.92 kb/s, with 5 items larger than 0.5 kb/s savings. At 16 kb/s, the bitrate savings was as large as 1.15 kb/s, with 5 items larger than 0.5 kb/s savings.It presented a listening test results for 12 kb/s mono and 16 kb/s mono. At 12 kb/s an analysis of difference between RM and RM+CE, RM+CE showed better performance for two items, at the 95% level of significance. At 16 kb/s an analysis of difference between RM and RM+CE, RM+CE showed better performance for one items, at the 95% level of significance. The presenter noted that these results are quite different from those in the previous contribution. Samsung and VoiceAge will investigate the reason for these differences and report back at the next meeting.Samsung plans to bring more information to the next meeting that will make the CE a complete proposal and cross-check.
4 Task group activities115
4.1 Joint meetings
There were none.
4.2 Task Group discussions
4.2.1 MPEG-2, MPEG-4, MPEG-26 Audio, Audio Conformance, reference software
Yuriy A. Reznik, Qualcomm, presented
m16441Fast SBR filterbanks for AAC-ELD, HE-AAC, and USAC.
Yuriy A. ReznikRavi K. Chivukula
This contribution presents fast implementations for SBR filterbanks. Qualcomm will make software for the implementations available to MPEG.
Yuriy A. Reznik, Qualcomm, presented
m16443On complexity of size 960 transform in AAC and related codecs
Yuriy A. ReznikRavi K. Chivukula
This contribution presents information on the complexity of 960 length MDCT (as compared to 1024 length transform). The contribution concludes that there exist quite fast forms of a 960 length MDCT such that, when considering a count of multiplies and adds, the 960 length is less complex than the 1024 length by nearly a factor of 2.
Ralph Sperschneider, FhG, presentedm16299 Defect report on ISO/IEC14496-26 [email protected]
This contribution presents two issues that need correction in the new MPEG-4 part 26, Audio Conformance: al15 should not have an independently switched coupling channel (but it does). It is proposed to remove this
sequence.
For the PNS conformance test procedure, the analysis window must be made dependent on the frame length. This change impacts both the standard text which describes the conformance test procedure and the related conformance testing software.
It was the consensus of the Audio Subgroup to issue a Study on 14496-26/DCOR 1 that contains these changes.
960/1024 frame length discussionKristofer Kjörling, Dolby, presented
Define a new profile “Common AAC Profile” (which is equivalent to what is currently HE-AAC v2)
For three existing profiles (…). “Streams conforming to this profile shall not used the 960 length MDCT transform. For maximum interoperability decoders should support the Common AAC Profile in preference to this profile.
The Chair will seek guidance from the Convenor as to whether the planned actions are appropriate. Kristofer Kjörling, Dolby, will bring additional information concerning which fielded products explicitly claim conformance to one of the family of MPEG-4 AAC profiles and the whether the fielded products using AAC or HE-AAC technology support both block lengths. Andreas Schneider, Dolby, presented
m16315proposed clarification on byte alignments in LOAS streams
Toshiyuki Nomura
The contribution notes that MPEG-4 defines two functions for byte alignment: ByteAlign() byte_alignment()
It is very desirable that there be only one function for byte alignment that it is unambiguous which bit is the reference bit for byte alignment that what is specified in the textual description is the same as what is implemented in the reference source code
The contribution proposes to issue a DCOR against MPEG-4 Audio to byte align to the first bit in the raw data block use only one function to do so
The contribution proposes to issue a DCOR against MPEG-4 Reference Software to align software to text
It was the consensus of the Audio Subgroup to DCOR against MPEG-4 Audio and to incorporate any needed changes to MPEG-4 Reference Software into ISO/IEC 14496-5:2001/FDAM 24, MPEG-4 AAC ELD.
116
4.2.2 MPEG-D Spatial Audio Object Coding
It was confirmed that a response to the German NB could be to incorporate the requested technology into SAOC over the course of the next two MPEG meetings and to re-issue the SAOC FCD shortly after the 89th MPEG meeting.
4.2.3 MPEG-D Unified Speech and Audio
Markus Multrus, FhG, presented
m16321Proposed Additions to and Corrections of the USAC Reference Software
Stefan BayerMarkus Multrus
This contribution proposes to add a software implementation of the Time Warped MDCT module for the encoder. As a second point, it identifies a bug in
the buffer reset/update flag TW-MDCT incorrect delay handling
These bugs have no impact on the RM0 bitstreams since RM0 never used these modes.Finally, it notes that there are currently two MDCT implementations in the RM0 code base. It proposes to merge these two such that only one code base is used.It was the consensus of the Audio Subgroup to accept these corrections and additions into the RM reference software.The Chair presentedm16434 Draft Revised Audio CE Methodology Schuyler Quackenbush
The Chair highlighted areas of the document that need study and input from the group. It was agreed in principal that it is of paramount importance that the revised document serve USAC, but that it also be as generic as possible while at the same time carry forward the minimum of “special case conditions” for old work items (e.g. lossless coding).Kristofer Kjörling, Dolby, presented
m16314Progress report on harmonic transposer CE for the USAC work item
Kristofer KjörlingMax Neuendorf
The contribution reports on status of this CE. The proponents expect to have a complete CE submitted to the next MPEG meeting. Kei Kikuiri, NTT DoCoMo, presented
m16397Core Experiment Proposal on the eSBR module of USAC
Kei KikuiriKousuke TsujinoNobuhiko Naka
This contribution describes a CE to add a new tool to the eSBR module. It notes that, currently, the eSBR module can only adjust the temporal enveolope in the SBR sample domain with a granularity of 2 subband samples. This is not as fine a granularity as is provided in the LP time-domain coding tool and may be an issue when coding speech signals.What is proposed is enhanced Temporal Envelope Shape, similar to what is present in MPEG Surround. A listening test shows the performance of the new tool. It showed improvement for 1 of 4 speech items in the test at the 95% level of significance. It reports that the TES tool requires an additional 2 bits per SBR envelope per channel (approximately 90 bps), and has some modest increase in complexity. The control of TES was done as a stand-alone module that operated on the RM0 bitstreams. The Chair asked the group which operating modes and test items should be used to assess the performance of this CE. The Chair asked for which test items was Beta different from zero (i.e. the TES tool was active). Hyunkook Lee, LG, presented
m16446 Core experiment proposal on arithmetic codingSungyong YoonHyunkook LeeYounghee Choi
117
This contribution proposes a CE in which the USAC global gain and differential scale factors are coded using arithmetic entropy coding rather than Huffman entropy coding. It reported an average bitrate savings of 0.39 percent.The Chair asked what fraction of the reported gain is due to compressing the 8-bit PCM global gain. A question was raised as to the corpus that was used to train the arithmetic coder. The Chair suggested that, in the workplan, that the CE proponent supplies a corpus of material to the RM proponent to encode and then supply the resulting bitstreams to the CE proponent. Kristofer Kjörling, Dolby, noted that there is potentially room for major improvements in the signal processing in USAC. The proposal will be discussed again during the week when additional information is available (i.e. 1) exclude global gain and 2) reset between coding scale factors and spectral coefficient.Further discussionLater in the week, Hyunkook Lee, LG, presented additional information on their CE. He showed a detailed performance table, and the average over all test items and operating modes is shown here:
5 Item 6 Global Gain
7 SF Only
8 SF and
Spectral
9 All
10 Average compression gain
110.1
4%
12 0.09%
13 0.18%
14 0.40%
Kristofer Kjörling, Dolby, noted that the proposal appears to offer limited compression advantage, and hence does not recommend any action at this time. The Chair noted suggested that the group should consider
whether global gain should remain a 8-bit PCM value whether there should be a reset between SF and Spectral entropy coding the extent that it is advantageous to have one and only one entropy coder (i.e.
arithmetic coding) whether channel errors should be considered when changing bitstream elements
It was the consensus of the Audio Subgroup to study the above questions (bullet items) as part of the workplan on this CE. An additional element of the workplan is that LG will give RM proponent a training corpus to encode at 9 operating modes with bitstreams returned to LG.Herve Taddei, Huawei, presented
m16338 Huawei Core Experiment proposal for USACHerve TaddeiDejun ZhangMinjie Xie
This contribution proposes a pulse indexing technique for coding the excitation in the ACELP coding mode. The proposed technology is lossless and shows a bit savings of 32 to 64 bits per super-frame. It proposes to use the pulse indexing tool when there are more than 5 pulses per track, which typically occurs in the higher bitrates where ACELP is employed.Phillipe Gournay VoiceAge, noted that a savings of 64 bits per superframe corresponds to a bitrate savings of much less than 3 kb/s. He further noted that the current RM0 bitstreams either do not ever use the modes cited in the contribution.The Chair suggested it would be very desirable to report the bitrate savings as weighted by the relative frequency that the coding mode occurred in the RM0 bitstreams. What the group suggests as additional information can be captured in the Workplan.Additional discussionHerve Taddei, Huawei, presented additional information on the performance of this proposed tool. Philip Gournay, VoiceAge, noted that the proposed complexity reduction is modest
118
when put in the context of the complexity of the entire decoder. The Chair noted that it would be very useful information to show the benefits delivered by the proposed tool in the context of operating the USAC system over the test set at some operating mode (bitrate for mono or stereo). Bernhard Grill, FhG, noted that if these modes are never used then removing them would give an even bigger complexity reduction. Kristofer Kjörling, Dolby, commented that the proposed CE is improving on a mode that is not used at any time for the set of operating modes and test items. It was agree that the RM proponent will run the USAC RQE from 12 to 64 at 4kbps increments for the CfP test set and report the relative frequency of the AMR-WB+ coding modes. This will be captured in the USAC Workplan.Ralf Geiger, FhG, presented
m16439 Proposed improvements to WD2 of USAC
Jeremie LecomteMax NeuendorfRalf GeigerMarkus Multrus
The contribution proposes a new method for handling the transitions between wLPT (i.e. TCX) mode in LPF to FD coding modes. Currently transitions from LPD to FD have an overhead of 128 samples and necessitate at 1152 length MDCT, which is more complex than a 1024 MDCT. LPD (ACELP) to FD requires a 128 sample overhead for overlap. This transition is not affected by the proposal.LPD (TXC) to FD discards 128 samples and overlaps 128 samples. The discarded 128 samples are not needed if the TDAC would actually work (i.e. TDAC by the frequency domain data). The contribution shows that by exchanging the order of MDCT and LP filtering, a proper TDAC overlap can be achieved.A listening test at 24 kb/s mono for test items that actually entailed a TCX to FD transition showed no audible difference at the 95% level of significance. It was noted that these transitions occur infrequently. In those frames affected, it showed an average a bit savings of 2.5%, and overall showed a savings of 500 bits per second. In a similar sense of the comment made by the Chair for m16338, Huawei suggested that it would be very desirable to report the bitrate savings as weighted by the relative frequency that the coding mode occurred in the RM0 bitstreams.Experts noted that, with this contribution, the transition from LPD to FD can be done in a critically sampled way, but that the transition from FD to LPD is still done in a non-critically sampled way. Kristofer Kjörling, Dolby, noted that a solution that permits the T/F coefficients to remain in critically sampled mode would be very desirable, even if bitrate savings were modest. This topic will continue to be discussed.Further discussionMax Neuendorf, FhG, presented new information on the relative frequency of this window transition for the 9 operating modes when coding the CfP items. On average, there were approximately 10 such transitions for each operating mode for the set of concatenated items. Juergen Herre, FhG, noted that a CEs cannot necessarily be judged on compression efficiency alone, but rather there may be an engineering design issue (i.e. “clean design”) that may also play a factor. Herve Taddei, Huawei, urged the group to insure that all CEs are treated fairly and judged using the same decision criterion. The Chair noted that he very much agreed with this comment.Eunmi Oh, Samsung, presented
m16376 Proposed Changes to WD2 for Phase Coding
JungHoe KimJulien RobilliardEunmi OhBernhard Grill
This contribution relates to previously presented contributions m16312, m16339, m16456, m16374.
119
First the presenter showed additional listening test results when all listening test sites are averaged together. The pooled results showed that the performance of RM+CE, at the 95% level of significance:
at 24 kb/s stereo, six items were better than RM, one was worse and the overall average was better
at 32 kb/s stereo, seven items were better than RM, none were worse and the overall average was better
The contribution presents WD changes needed for the Phase Coding CE (syntax tables and decoding semantics text).Kristofer Kjörling, Dolby, noted that the syntax in the contribution shows MPEG Surround pilot-based coding apparently used with IPD parameters. The presenter will check whether this is intentional or is an editing error.The Chair verified that the following issues must be addressed before experts can make a final decision on this CE:
clarify pilot-based syntax clarify whether syntax and sematics are consistent (e.g. bsIPDdataMode[][]) experts at Philips and Dolby will assess whether smoothing gives benefit for a critical
item
Further discussionEunmi Oh, Samsung, presented additional information about the above bullet items:
pilot-based syntax: it was clarified that pilot-base coding is not used in phase coding, so that this syntax item will be set to zero.
whether syntax and sematics are consistent: a number of inconsistencies were identified and corrected.
assess whether smoothing gives benefit for an example critical item: It was decided to retain smoothing and to add an additional threshold that selects between fine and coarse quantization tables.
Heiko Purnhagen, Dolby, requested that he have access to the bitstreams and decoder to verify performance of the technology. It was agreed to make the bitstreams for 24 kb/s stereo and 32 kb/s stereo and the associated decoder executable available to Dolby and Philips no later than May 1 so that experts at one or both companies can perform “sanity cross-check.”It is the consensus of the Audio Subgroup to accept this CE into the RM pending a positive outcome of the bitstream/decoder cross-check. Workplan for MPEG Reference Encoder SoftwareThe Chair drafted text for a Workplan for MPEG Reference Encoder Software. Max Nuendorf, FhG, indicated that the software supplied in m16321 had all signal flow modules present, such that if appropriate control software modules are added the software could achieve the quality of a Reference Quality Encoder. This software will be committed to the SVN server as the first item in the workplan. To organize tasks in the AhG period, a table of USAC encoder control modules was made. For each module in the table, companies stated their commitment to supply improved versions of that module on or before the time of a given MPEG meeting.SVN Repository and USAC SoftwareThe Chair constructed an directory tree for the MPEG Audio software in a form that facilitated committing it to the MPEG SVN server. In this framework:
• The location of MPEG-1, -2 and -4 -7 Reference Software indicated in a “ReadMe.txt” file, since it exists as tar archive “snapshots” on an external ftp server.
• The USAC Reference Software is on the SNV server
In the “Workplan for MPEG Reference Encoder Software” there is a milestone to commit test vectors (i.e. conformance bitstreams and reference waveforms) to the SVN server so that there is a framework to confirm the correct operation of the normative decoder.
120
Audio CE Methodology for USACJuergen Herre, FhG, presented his edited version of the CE Methodology, as did Heiko Purnhagen, Dolby. These two versions were merged in a break-out and reviewed by the group. An additional round of editing and review was done Thursday afternoon.
14.1.1 Exploration: Meta-Data
Stephan Schreiner, FhG, presented
m16334Proposed new Architecture of Metadata Driven Audio Post-Processing
Stephan Schreiner
This contribution discusses conventional practice in the audio industry. One step is mastering, which indicates application of e.g. compression, equalization or balance and level adjustment In one conventional practice, a stereo mix is produced and mastering is applied to that mix. In another, an intermediate step is introduced of producing stem mixes (“stems”), e.g. of similar sounds, mastering is applied to the stems and then the result is used to produce a final stereo signal. Post-processing is “mastering on the playback side” which can be done in the decoder or player. When considering this in practice, the contribution suggests that there may be an advantage to adopting the Stem-Mastering methodology.The contribution notes opportunities for standardization might be:
metadata bitstream syntax separator/discriminator data (i.e. re-create “stems”) manipulators (e.g. compression functions) profiles appropriate to classes of applications
Hyunkook Lee, LG, asked if it is necessary to transmit the stem signals if it is expected that the metadata would manipulate the stem signals. He further noted that mastering engineers would typically wish to use proprietary, state-of-the-art dynamics processors (e.g. compressors). Pierrick, Philippe, Orange Labs, noted that one of the figures in the contribution looks surprisingly like the Interactive Music MAF, and asked if this work it is the intention to extend this MAF.Kristofer Kjörling, Dolby, noted that a broadcaster that aggregates for broadcast will do significant post-processing (e.g. level adjustment, time-stretching, program insertion), and metadata should serve that purpose. He further noted that each National Body that has broadcasting members should try to submit information on current practice.Kate Grant, Nine Tiles, noted that IEC 62379 specifies ways to support audio metadata in the broadcasting chain. Mukta Kar, CableLabs, noted that metadata should have a low datarate, be linked and synchronized to the audio representation and be easily accessible (e.g. easy to extract from compressed bitstream or available as a readily accessible side-information stream). He noted that audio might learn about current practice in e.g. ATSC specification.
15 Audio closing plenary discussions
16 Meeting deliverables
16.1 Responses to Liaison and NB comments
The responses to Liaison and NB comments were prepared and approved.
16.2 Recommendations for final plenary
The Audio recommendations were presented and approved.
16.3 Establishment of Ad-hoc Groups
The following ad-hoc groups were established by the Audio subgroup:
121
No. Title Mtg10667 AHG on Audio Standards Maintenance No
10668AHG on Unified Speech and Audio Coding and Spatial Audio Object Coding
Yes
16.4 Approval of output documents
All output documents, shown in Annex D, were presented in Audio plenary and were approved.
16.5 Press statement
There was no Audio contribution to the press statement.
17 Future activities
17.1 Schedule of future meetings
Ad Hoc group meetings are indicated in Section 16.3. Unless otherwise indicated, Ad Hoc group meetings will be held at the location of the next MPEG meeting on the weekend preceding that meeting.
17.2 Agenda for next meeting
The agenda for the next MPEG meeting is shown in Annex F.
17.3 All other business
There was none.
17.4 Closing of the meeting
The 88th Audio Subgroup meeting was adjourned Friday at 12:30!
122
Annex A Participants
First Name Last Name Country AffiliationBruno Bessette CA Voiceage CorporationTi Eu Chan SG I2RYujie Dun CN XJTURalf Geiger DE Fraunhofer IIS
Philippe Gournay CanadaVoiceAge Corp. / Univ. of Sherbrooke
Bernhard Grill DE Fraunhofer IISOliver Hellmuth DE Fraunhofer IISJürgen Herre DE Fraunhofer IISJeff Huang USA Qualcomm Inc.Kyeong Ok Kang Korea ETRIKei Kikuiri JP NTT DOCOMOKristofer Kjörling SE DolbyHyunkook Lee KR LG electronicsTaejin Lee KR ETRITilman Liebchen DE LG ElectronicsTakehiro Moriya JP NTTMarkus Multrus DE Fraunhofer IISMax Neuendorf DE Fraunhofer IISToshiyuki Nomura JP NECTakeshi Norimatsu JP PanasonicEunmi Oh KR Samsung
Werner Oomen NLPhilips Applied Technologies
Pierrick Philippe FR France Telecom R&DHeiko Purnhagen SE DolbySchuyler Quackenbush USA ARLAndreas Schneider DE DolbyStephan Schreiner Germany Fraunhofer IISJeongil Seo KR ETRIHaiyan SHU Singapore I2RRalph Sperschneider DE Fraunhofer IISHerve Taddei DE Huawei TechnologiesOliver Wuebbolt DE ThomsonMinjie Xie USA Huawei
Huan Zhou SGPanasonic Singapore Laboratories
123
Annex B Audio Contributions and Schedule
Day / Time Task Group
Sunday
1000-1800 AhG: SAOC and USAC
1000-1130 SAOC
m16310Clarifications regarding the enhanced Karaoke/Solo processing mode
Leonid TerentievJürgen HerreCornelia FalchOliver Hellmuth
m16309 Report on corrections for the MPEG SAOC FCD text
Jonas EngdegårdHeiko PurnhagenCornelia FalchLeonid TerentievAndreas HölzerOliver HellmuthJohannes HilpertJeroen Koppens
m16363 Test Sequence Proposal for SAOC Verification TestJeongil SeoKyeongok KangKevin SeungChul Ham
m16384 Details of NB Position on SAOC Ballot Juergen Herre for GNB
1130-1300 USAC
Discussion on CE acceptance process
m16324 Comments on new USAC reference bitstreamsMax NeuendorfMarkus Multrus
m16323Report on Merge of sys2 Technology into RM0: SBR Improvements
Max NeuendorfTaejin Lee
m16383Report on Merge of sys2 Technology into RM0: TCX Improvements
Taejin LeeMax Neuendorf
1300-1400 Lunch
m16312Dolby Listening Test Results for USAC CE on Phase Coding in MPS
Heiko PurnhagenKristofer Kjörling
m16339Philips Listening Test Results for USAC CE on Phase Coding in MPS
Werner OomenJeroen Koppens
m16456Fraunhofer Listening Test Results for USAC CE on Phase Coding in MPS
Julien RobilliardMatthias NeusingerJohannes Hilpert
m16374Report on Phase Coding in MPEG Surround for USAC
JungHoe KimJulien RobilliardEunmi OhBernhard Grill
m16311Dolby Listening Test Results for USAC CE on AVQ-based LPC
Heiko PurnhagenKristofer Kjörling
m16322Fraunhofer IIS Listeningtest Results on USAC CE for AVQ-based LPC Quantizer
Markus MultrusRalf Geiger
m16316 CE Report on LPC Quantization for USAC
Philippe GournayBruno BessetteRoch LefebvreRedwan Salami
m16325VoiceAge Test Report for USAC CE on Unvoiced Coding
Philippe GournayRoch Lefebvre
m16373 Report on Unvoiced Speech Coding for USACHosang SungEunmi OhMiyoung Kim
Review of AhG Report and Presentation Schuyler Quackenbush
1800- Chairs Meeting
124
Monday
0900-1230 MPEG Plenary
1300-1400 Lunch
1400-1430 Audio Plenary
Welcome
Report on Sunday Chairs meeting
Review main tasks for the week
General documents
m16433 87th MPEG Audio Report S. Quackenbush
m16245 Ad Hoc Group on Audio Standards Maintenance R. Sperschneider
m16246 Ad Hoc Group on SAOC, USAC S. Quackenbush
NB Position Papers
m16297Swedish NB comment in response to Resolution 3.1.2 in N10312
Swedish NB via SC 29 Secretariat
1430-1500 Plenary Items
Profiles and 960/1024 xform lengths
SAOC timetable
Normative start-up and shut-down
SVN repository for audio
1630-1800 MPEG-2, MPEG-4 and MPEG-26
m16441Fast SBR filterbanks for AAC-ELD, HE-AAC, and USAC.
Yuriy A. ReznikRavi K. Chivukula
m16443On complexity of size 960 transform in AAC and related codecs
Yuriy A. ReznikRavi K. Chivukula
m16299 Defect report on ISO/[email protected]
m16321Proposed Additions to and Corrections of the USAC Reference Software
Stefan BayerMarkus Multrus
m16434 Draft Revised Audio CE Methodology Schuyler Quackenbush
1800- HoD Meeting
Tuesday
0900-1000 USAC
m16314Progress report on harmonic transposer CE for the USAC work item
Kristofer KjörlingMax Neuendorf
m16397Core Experiment Proposal on the eSBR module of USAC
Kei KikuiriKousuke TsujinoNobuhiko Naka
m16446 Core experiment proposal on arithmetic codingSungyong YoonHyunkook LeeYounghee Choi
m16338 Huawei Core Experiment proposal for USACHerve TaddeiDejun ZhangMinjie Xie
m16439 Proposed improvements to WD2 of USAC
Jeremie LecomteMax NeuendorfRalf GeigerMarkus Multrus
1100-1200 Workplan on MPEG Reference Encoder
1200-1400 Lunch
125
1400-1500 960/1024 frame length
1430-1500 Maintenance
m16315proposed clarification on byte alignments in LOAS streams
Toshiyuki Nomura
1500-1800 USAC
m16376 Proposed Changes to WD2 for Phase Coding
JungHoe KimJulien RobilliardEunmi OhBernhard Grill
1800- Chairs Meeting
Wednesday
0900-1100 MPEG Plenary
1130-1230 Exploration: Metadata
m16334Proposed new Architecture of Metadata Driven Audio Post-Processing
Stephan Schreiner
1230-1400 Lunch
1400-1430 SVN breakout (SQ, MM, YR, PP, HT, MN)
MPEG Reference Encoder Software
1430-1600 Revised CE Methodology
1800-2100 Social
Aloha Pavilion and Ko’ala
Thursday
0900-1200 Open Issues
Pulse indexing in ACELPMPEG Reference Encoder Software Revised CE Methodology
1200-1400 Lunch
Arithmetic coding of SFsTransitions of wLPT to FDPhase coding in MPEG Surround960/1024 issues
1800- Chairs Meeting
Friday
0900-1300 Audio plenary
Remarks on Thursday Chairs meeting
Recommendations for final plenary
Establishment of new Ad-hoc groups
AhG Mandates
Get document numbers
1000 Approve Responses to NB comments and Liaison
1030
Approval of output documentsTitle: N10xxxFile: w10xxx (short title).doc (NOT *.docx!)Zip: w10xxx.zip
Review of Audio presentation to MPEG plenary
Agenda for next meeting
A.O.B.
Closing of the Audio meeting
126
1300-1400 Lunch
1400- MPEG Plenary
127
Annex C Task Groups
1. MPEG-2 and MPEG-4 Audio, MPEG Audio Conformance, MPEG reference software2. MPEG-D Spatial Audio Object Coding3. MPEG-D Unified Speech and Audio Coding4. Exploration: Meta-Data
128
Annex D Output DocumentsNo. Title TBP Available
14496-3 Audio10650 ISO/IEC 14496-3:2009/DCOR 1:200X Byte Alignment No 09/04/24
10651Study on ISO/IEC 14496-3:2009/ FPDAM 1:200x, HD-AAC Profile, MPEG Surround Signaling
No 09/04/24
10652 WD on AAC family of profiles No 09/04/2414496-5 Reference Software
10653 DoC on ISO/IEC 14496-5:2001/FPDAM 24, MPEG-4 AAC ELD No 09/04/2410654 ISO/IEC 14496-5:2001/FDAM 24, MPEG-4 AAC ELD No 09/05/24
14496-26 Audio Conformance10655 Request for Amendment, 14496-26:2009/PDAM 2 No 09/04/24
10656ISO/IEC 14496-26:2009/PDAM 2, BSAC Conformance for Broadcasting
No 09/04/24
10657Study on ISO/IEC 14496-26:2009/DCOR 1, ALS, SLS and AAC updates
No 09/04/24
23003-1 MPEG Surround10658 Study on ISO/IEC 23003-1:2007/DCOR 2, Misc. Corrections No 09/04/24
23003-2 SAOC
10659Study on ISO/IEC FCD 23003-2:200x, Spatial Audio Object Coding
No 09/04/24
10660 Status and Workplan on SAOC Core Experiments No 09/04/2423003-3 Unified Speech and Audio Coding
10661 WD3 of USAC No 09/05/2410662 Workplan for USAC CEs No 09/04/2410663 Workplan on MPEG USAC Reference Encoder No 09/04/2410664 MPEG Audio CE methodology No 09/04/2410669 MPEG Audio Test Material for Core Experiments No 09/04/24
Liaison Statements10665 Response to IEC TC-100 on IEC CDV 62571 No 09/04/24
Responses to National Bodies10666 Response to Swedish NB on 960 and 1024 block lengths No 09/04/24
129
Annex E Agenda for the 89th MPEG Audio Meeting
Agenda Item1. Opening of the meeting2. Administrative matters
2.1. Communications from the Chair2.2. Approval of agenda and allocation of contributions2.3. Review of task groups and mandates2.4. Approval of previous meeting report2.5. Review of AhG reports 2.6. Joint meetings2.7. Received national body comments and liaison matters
3. Plenary issues4. Task group activities
4.1. MPEG-1, MPEG-2, MPEG-4, and MPEG-264.2. Spatial Audio Object Coding4.3. Unified Speech and Audio Coding4.4. Exploration: Meta-Data
5. Discussion of unallocated contributions6. Meeting deliverables
6.1. Responses to Liaison and NB comments6.2. Recommendations for final plenary6.3. Establishment of new Ad-hoc groups6.4. Approval of output documents6.5. Press statement
7. Future activities8. Agenda for next meeting9. A.O.B10. Closing of the meeting
130
Annex I – 3DG report
Source: Marius Preda, Chair
1 Opening of the meeting
17.5 Approval of the agendaThe agenda is approved.
17.6 Goals for the weekThe goals of this week are:Review SC-3DMC contributions and issue the associated study of CD (and CE?!)Discuss the software and conformance for SC-3DMC
Discuss FAMC, Scene Partitioning RefSoftware and Conformance (?!)Status of software implementation in MP25 (especially the IC integration issues)Commit the last version of the reference software (IM1) on the SVN
Check the code for the hierarchical compression mode for 3DMCFinish 14496-27
Check the validity and re-generate when necessary conformance data for 3DGC AFX 3rd EditionInvestigate future developments of MPEG 3D Graphics Compression
Review new representation (IndexedRegionSet) and codec (SIM)Review RGC (reconfigurable graphics codec) contribution3D Graphics MXM EngineAvatar characteristics
Review Liaisons Review the votes
17.7 Standards from 3DGC
4 5 2001 Amd.22 3DG Compr. Model RefSof
06/07 08/01 08/07 09/02 3
4 4 200x Cor.7 (Audio & 3DG) 09/02 09/07 34 5 2001 Amd.25 Scene partitioning
RefSof08/10 09/02 09/07 3
4 5 200x Amd.26 XXXXXXXXX 09/04 09/10 10/04 34 5 200x Amd.27 Scalable compl.
3DMC RS09/02 09/07 10/01 3
131
4 16 2006 Amd.4 Scalable complexity 3D mesh coding
08/01 08/10 09/02 09/07 10/01 3
4 16 200x 3rd Ed. AFX 07/10 09/04 34 27 200x Amd.1 Scene partitioning
conformance08/10 09/02 09/07 3
4 27 200x Amd.2 Scalable compl. 3DMC conformance
09/02 09/07 10/01 3
17.8 Room allocation3DGC: Pioneer
132
17.9 Allocation of contributionsN° Title ScheduleD1 Monday D1
MPEG Plenary 09:00~11:30Lunch Break 13:00~14:003DG Plenary 14:00~15:00Roll call, Agenda, Goals, FAQ, etc., Marius Preda
m16238Report of AHG on 3DGC documents, experiments and software maintenance
Patrick Gioia, Francisco Moran
Results of voting Marius Preda
Scalable Complexity 3D Mesh Encoding (SC-3DMC)15:00 – 18:00
m16399CE Report on SC3DMC Ver 4.0
Seungwook Lee, Bonki Koo, Daiyong Kim, Kyoungsoo Son, Euee S. Jang
m16401 Algorithm descriptions of attribute data on SC3DMCSeungwook Lee, Bonki Koo, Kyoungsoo Son, Daiyong Kim, Euee S. Jang
m16404 Update and current status of SC3DMCDaiyong Kim, Kyoungsoo Son, Seungwook Lee, Bonki Koo, Euee S. Jang
m16466 Optimized implementation of the TFAN encoderWalid Hachicha, Khaled Mammou, Titus Zaharia
m16435 TFAN bitstream syntax updateKhaled Mamou, Titus Zaharia, Marius Preda, Françoise PRETEUX
D2 Tuesday D2Scalable Complexity 3D Mesh Encoding (SC-3DMC) 09:00~12:00
m16337Fast Array Encoder (FAE): an efficient extension to the QBCR compression technique
Khaled Mamou, Faouzi Ghorbel
m16436 FAE software description Khaled Mamou, Faouzi Ghorbel
m16490CE Report on SC3DMC: FAE versus QBCR, QBCR BP and QBCR AC Khaled Mamou, Faouzi Ghorbel
133
N° Title Schedule
m16402 QBCR and SVA bitstream syntax updateSeungwook Lee, Bonki Koo, Kyoungsoo Son, Daiyong Kim, Euee S. Jang
Wrap-up on SC-3DMCLunch Break 12:00~14:00
m16403Joint with Video on RGC
Seungwook Lee, Bonki Koo, Kyoungsoo Son, Daiyong Kim, Mingxiao Chen, Euee S. Jang
14:00 – 15:00
m16425Joint with System on MPEG-VAvatar Characteristics
Blagica Jovanova, Marius Preda, Françoise Preteux
15:00 – 16:00
m16427Joint with Systems on MXMIntegrated API for 3D Graphics
Ivica Arsov, Marius Preda16:00 – 17:00
m16305IndexedRegionSet: Efficient Representation of Meshes with Multiple Textures
Sergio Arnaldo, Francisco Morán, Marcos Avilés
17:00 – 17:15
--- Scalable Intra-band Mesh coding (SIMc) Leon Denis17:15 – 17:30
D3 Wednesday D3
MPEG Plenary09:00~11:0
0
MP2511:45~12:0
0Status on the Reference Software Marius Preda
SC-3DMC discussion on the start code and marker bit12:00 – 12:15
Lunch Break 13:00~14:00AFX Conformance and RefSoft 14:00~15:00
m16469 Comparison result of conformance test on the SVNDaiyong Kim, Kyoungsoo Son, Seungwook Lee, Bonki Koo, Euee S. Jang
BO1: Editing of AFX 3rd Edition Marius Preda 15:00~17:30BO2: Editing of AFX AMD 4 Seungwook Lee 15:00~17:30BO3: Editing of MPEG-4 Part 27 Daiyong Kim, Francisco Moran 15:00~17:30
134
N° Title Schedule
Avatar Characteristics16:00~18:00
D4 Thursday
First wrap-up BOs 09:00~09:10Joint with Systems on MXM developer's day 09:10~10:00Joint with System on Avatar characteristicsPreparation of the WD2.0
Jeong-Hwan Ahn 10:00~11:00
3DG Vision Marius Preda 11:00~11:30AFX Core Experiment review 11:30~12:00Lunch Break 13:00~14:003DG Plenary (preparation of the output documents) 14:00~18:00
Part 16 IssuesPart 25 IssuesPart 27 IssuesMPEG-V Avatar Characteristics Issues
D5 Friday D53DG output documents preparation All 09:00~12:00AhGs and resolutions
Lunch Break 12:00~14:00MPEG Plenary 14:00~
135
17.10 Attendance list
Name Country Company
Marius Preda France Institut TELECOM
Francisco Morán Burgos Spain UPM
Seung Wook Lee Korea ETRI
Euee S. Jang Korea Hanyang Univ.
Byoungjun Kim Korea Hanyang Univ.
D.Y. Kim Korea Hanyang Univ.
Jeong-Hwan Ahn Korea Samsung
Leon Denis Belgium VUB
18 General issues
18.1 General discussion
18.1.1 Reference Software
It is recalled that the source code of both decoder AND encoder should be provided as part of the Reference Software for all technologies to be adopted in MPEG standards. Moreover, not providing the complete software for a published technology shall conduct to the removal of the corresponding technical specification from the standard.Currently almost all the AFX tools published in the second edition are supported by both encoder and decoder implementation. Only exception is the MeshGrid tool for the standalone decoder; however commitment was renewed by VUB, represented during this meeting by Leon Denis.
18.1.2 Web site
OrangeLabs announced interrupting the maintenance of the group web-site. A call for volunteers is now issued. In the meantime 3DGC contributors are kindly asked to check the web-site and provide comments on the current version of the web-site.
19 Current Voting
Document title DoC Editor of DoC
136
No vote submitted for this meeting.
20 AFX (14496-16) related activities
20.1 AhG on AFX activitiesTitle Report of AHG on 3DGC documents, experiments and software maintenance
Authors Patrick Gioia, Francisco MoranSummaryResolution Accepted
20.2 Scalable Complexity 3D Mesh Compression (14496-16 Amd.4)
Title CE Report on SC3DMC Ver 4.0Authors Seungwook Lee, Bonki Koo, Daiyong Kim, Kyoungsoo Son, Euee S. Jang
Summary
Objectives: - speed up the decoding of QBCR, solution consists in reading more bits at the time- add more parameter control on top of QBCR: delta prediction, entropy coding- encode attributes: two proposals: one for normal and one for texture coordinatesThe normal encoding includes two prediction methods: difference and XOR and two binarization (BP and AC)The texture coordinates compression considers regular and not regular patches. The method is too complex for QBCR and SVA.
ResolutionAccept the proposed method for attributes encoding. Accept the delta prediction.
Title Algorithm descriptions of attribute data on SC3DMCAuthors Seungwook Lee, Bonki Koo, Kyoungsoo Son, Daiyong Kim, Euee S. Jang
SummaryResolution See previous resolution
Title Update and current status of SC3DMCAuthors Daiyong Kim, Kyoungsoo Son, Seungwook Lee, Bonki Koo, Euee S. Jang
SummaryResolution See previous resolution
Title Optimized implementation of the TFAN encoderAuthors Walid Hachicha, Khaled Mammou, Titus Zaharia
137
Summary
This contribution explains the dependency between encoder parameters and proposes to replace the exhaustive research for the optimal configuration with a near-optimal but faster approach. The contribution does not affect the bitstream syntax being related to the encoder.
Resolution- accept the optimization as part of the RefSoft (Encoder side)- add the formula for computing k in the Informative Annex on TFAN encoder
Title TFAN bitstream syntax updateAuthors Khaled Mamou, Titus Zaharia, Marius Preda, Françoise PRETEUX
Summary Editorial changes are proposed for TFAN.
Resolution- change the bitstream syntax for ensuring its byte-alignment- introduce a detailed annex on the arithmetic decoder
TitleFast Array Encoder (FAE): an efficient extension to the QBCR compression technique
Authors Khaled Mamou, Faouzi Ghorbel
Summary
There extensions over the original QBCR are presented: - prediction of the current values with respect to up to 7 temporal neighbors- a 4-bit alignment for speeding up the reading process- an AC combined with Exponential Golomb code
Resolution see below
Title CE Report on SC3DMC: FAE versus QBCR, QBCR BP and QBCR ACAuthors Khaled Mamou, Faouzi Ghorbel
Summary
- the proposed extension of QBCR (called FAE) shows an average gain of 30-40% over QBCR, 10-20% over QBCR BP and equivalent compression results to QBCR AC- FAE AC offers the best compression performances with an average gain (in terms of bitrates) of 10-30% w.r.t FAE
Resolution
- introduce the 7th order prediction in QBCR- introduce mixed entropy encoding (AC+ExpGK) for QBCR and SVA- evaluate the BPC and 4-byte binarization for QBCR- maintain the fixed length binarization for QBCR in no prediction mode
Title Wrap-up on SC-3DMCAuthors all
Summary
Resolution
1. Updates for the textual specifications (study text): - introduce the different prediction modes for QBCR - new header structure:
2. Updates for CE: - add AC+ExpGK for QBCR and SVA- compare AC+ExpGK with BPC for coordinates and attributes- evaluate BPC over 4-bits binarization for QBCR
138
20.2.1 Scene partitioning (14496-11 Amd.6)
SP is followed as a joint activity between Systems and 3DGC. The technology is integrated in Part 11. There was no joint meeting with Systems on this topic during this meeting.SP activity on conformance and reference software continued.
20.3 Maintenance
20.3.1 FAMC Conformance and Reference Software
FNB reports on a problem related to FAMC reference software, namely the usage of little endian convention when writing the bitstream. This conducts to errors in parsing the FAMC bitstream when encapsulated in MP4. Resolution: issue a corrigendum on FAMC ref soft and conformance and ask the contributors to update the software and regenerate the bitstreams.
20.3.2 AFX 3rd Edition
The document was updated during the week. Publication of the current version as FDIS (an editing period of 4 weeks is accepted).
20.4 Dataset and benchmarking
For Scalable Complexity 3D Mesh Coding, the www.MyMultimediaWorld.com will be used for benchmarking.
20.5 Software
Title Current status of MeshGrid compression softwareAuthors Leon Denis
SummaryA presentation of the current implementation (including a GUI) of MeshGrid was demonstrated by VUB representatives. Some bugs still occur.The encoder was committed on the SVN
Resolution Continue the work on standalone version of the MG decoder
20.6 Promotions
20.6.1 Web Site
Title Status of www.mpeg-3dgc.com Authors
Summary The web site is not more maintained by OrangeLab.
ResolutionAction Point: Transfer the web-site to other location and call for volunteers for maintanace.
139
20.7 Future
20.7.1 MPEG-V - Information Exchange with Virtual Worlds (formally Metaverse)
Title Joint meeting with Systems on MPEG-V, Avatar CharacteristicsAuthors Blagica Jovanova, Marius Preda, Françoise Preteux
SummaryA rich schema for avatar metadata is proposed and all the elements are documented.
ResolutionThe XSD is accepted as it is. Update the WD of MPEG-V Avatar Characteristics accordingly.
Title Avatar Characteristics updates Authors Jeong-Hwan
Summary
Resolution
1. Add FaceControlFeatureType as the set of outlines (HeadOutline, EyeOutline, LeftEarOutline, RightEarOutline, NoseOutline, UpperMouthLipOutline, LowerMouthLipOutline)
2. Define a new type OutlineType as three 3D points (left, middle, right)
3. Create BodyControlFeatureType based on SkeletonType.
4. Group BodyControlFeatureType and FaceControlFeatureType in AvatarControlType.
5. Add in element Moves the value FreeDirection (Move to arbitrary direction)
6. Add in type Appearance a new element called PhysicalCondition with two elements (BodyStrength from -3 to 3 and Flexibility with three levels (Low, medium, high)
7. Add in new element type in AvatarType called AvatarModel (1: Human, 2: Animal with 4 legs, 3: Bird, 4: Robot with Wheels, 5-255: undefined)
20.7.2 MXM
Title Integrated API for 3D GraphicsAuthors Ivica Arsov, Marius Preda
SummaryA new version of the Graphics3D API, integrated in MediaFrameworkAPI is presented
Resolution Accepted.
140
20.7.3 Future directions of 3D Graphics Compression
Title Joint meeting with video on RVC
AuthorsSeungwook Lee, Bonki Koo, Kyoungsoo Son, Daiyong Kim, Mingxiao Chen, Euee S. Jang
Summary
A study on RVC based Graphics codecThe main idea is to create a codec built on the graphics primitives' levels and not on node definition. A preliminary list of graphics primitives was proposed.
Resolution
- start from the graphics primitives, identify the codecs we have already, identify the FU for each codec.- mandate in 3DGC AhG- a new Exploration on identifying the functional units to be documented in the CE document
Title IndexedRegionSet: Efficient Representation of Meshes with Multiple TexturesAuthors Sergio Arnaldo, Francisco Morán, Marcos Avilés
Summary A new representation of meshes is proposed
Resolution Continue the exploration to identify the added value of the new representation
Title Scalable Intra-band Mesh coding (SIMc)Authors Leon Denis
Summary
Resolution Continue the exploration for providing evidences of better compression results.
21 3D Graphics Compression Model (14496-25) activities
21.1 Software and conformance
Title Comparison result of conformance test on the SVNAuthors Daiyong Kim, Kyoungsoo Son, Seungwook Lee, Bonki Koo, Euee S. Jang
SummaryAll the bitstreams are classified with respect to their availability and corrected-ness with respect to the last version of the RefSoft
Resolution Contact the providers of the broken bitstreams and ask for new version.
141
22 Liaison
TitleAuthors
SummaryResolution
23 Output documents and Resolutions of 3DGC
23.1 Part 16 Animation Framework eXtension (AFX)
23.1.1 The 3DG subgroup recommends approval of the following documents
No. Title TBP Available14496-16 Animation Framework eXtension (AFX)
10528 Study Text of ISO/IEC 14496-16:2006/PDAM4 (Scalable Complexity 3D Mesh Compression)
No 09/04/24
10529 Description of AFX CE and explorations No 09/04/2410530 ISO/IEC 14496-16 3rd Edition Yes 09/05/2410532 MPEG 3D Graphics FAQ v22 Yes 09/04/24
23.2 Promotion
23.2.1 The 3DG subgroup recommends approval of the following documents
No. Title TBP AvailablePromotion
10531 MPEG 3D Graphics Vision Yes 09/04/24
23.3 Establishment of 3DGC Ad-Hoc Groups10533 AHG on 3DGC documents, software maintenance and core experimentsMandate: 1. Conduct the experiments in Scalable Complexity Mesh Compression
2. Coordinate 3DGC related conformance and reference software3. Maintain and edit 3DGC documents 4. Coordinate editing of the www.mpeg-3dgc.com web site 5. Coordinate exploration activities related to RGC
Chairman: Francisco Morán Burgos Duration: Until 89th MeetingMeetings Sunday before 89th meetingReflector: mpeg-3dgc AT gti. ssr. upm. es
142
Subscribe: https://mx.gti.ssr.upm.es/mailman/listinfo/mpeg-3dgc
24 Closing of the Meeting
See you in London.
143