ISO/IEC 14496-2:2004
(Main)Information technology — Coding of audio-visual objects — Part 2: Visual
Information technology — Coding of audio-visual objects — Part 2: Visual
ISO/IEC 14496-2:2004 provides the following elements related to the encoded representation of visual information: - Specification of video coding tools, object types and profiles, including capability to encode rectangular-based and arbitrary-shaped video objects, capability to define scalable bitstreams and error-resilient encoding tools; - Specification of coding tools, object types and profiles for mapping of still textures into visual scenes; - Specification of coding tools, object types and profiles for human face and body animation based on face/body models and additional semantic parameters; and - Specification of coding tools, object types and profiles for animation of 2D warping grids with uniform and irregular topology. The Visual specification contains definitions of the bitstream syntax, bitstream semantics and the related decoding process. It does not specify the encoders, which can be optimized in different implementations.
Technologies de l'information — Codage des objets audiovisuels — Partie 2: Codage visuel
General Information
Relations
Standards Content (Sample)
INTERNATIONAL ISO/IEC
STANDARD 14496-2
Third edition
2004-06-01
Information technology — Coding of
audio-visual objects — Part 2: Visual
Technologies de l'information — Codage des objets audiovisuels —
Partie 2: Codage visuel
Reference number
©
ISO/IEC 2004
PDF disclaimer
This PDF file may contain embedded typefaces. In accordance with Adobe's licensing policy, this file may be printed or viewed but
shall not be edited unless the typefaces which are embedded are licensed to and installed on the computer performing the editing. In
downloading this file, parties accept therein the responsibility of not infringing Adobe's licensing policy. The ISO Central Secretariat
accepts no liability in this area.
Adobe is a trademark of Adobe Systems Incorporated.
Details of the software products used to create this PDF file can be found in the General Info relative to the file; the PDF-creation
parameters were optimized for printing. Every care has been taken to ensure that the file is suitable for use by ISO member bodies. In
the unlikely event that a problem relating to it is found, please inform the Central Secretariat at the address given below.
This CD-ROM contains the publication ISO/IEC 14496-2:2004 in portable document format (PDF), which can
be viewed using Adobe® Acrobat® Reader.
Adobe and Acrobat are trademarks of Adobe Systems Incorporated.
This third edition cancels and replaces the second edition (ISO/IEC 14496-2:2001), which has been
technically revised. It also incorporates the Amendments ISO/IEC 14496-2:2001/Amd. 1:2002,
ISO/IEC
...
INTERNATIONAL ISO/IEC
STANDARD 14496-2
Third edition
2004-06-01
Information technology — Coding of
audio-visual objects — Part 2: Visual
Technologies de l'information — Codage des objets audiovisuels —
Partie 2: Codage visuel
Reference number
©
ISO/IEC 2004
PDF disclaimer
This PDF file may contain embedded typefaces. In accordance with Adobe's licensing policy, this file may be printed or viewed but
shall not be edited unless the typefaces which are embedded are licensed to and installed on the computer performing the editing. In
downloading this file, parties accept therein the responsibility of not infringing Adobe's licensing policy. The ISO Central Secretariat
accepts no liability in this area.
Adobe is a trademark of Adobe Systems Incorporated.
Details of the software products used to create this PDF file can be found in the General Info relative to the file; the PDF-creation
parameters were optimized for printing. Every care has been taken to ensure that the file is suitable for use by ISO member bodies. In
the unlikely event that a problem relating to it is found, please inform the Central Secretariat at the address given below.
© ISO/IEC 2004
All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means,
electronic or mechanical, including photocopying and microfilm, without permission in writing from either ISO at the address below or
ISO's member body in the country of the requester.
ISO copyright office
Case postale 56 • CH-1211 Geneva 20
Tel. + 41 22 749 01 11
Fax + 41 22 749 09 47
E-mail copyright@iso.org
Web www.iso.org
Published in Switzerland
ii © ISO/IEC 2004 – All rights reserved
Contents
1 Scope. 1
2 Normative references. 1
3 Terms and definitions. 1
4 Abbreviations and symbols . 13
4.1 Arithmetic operators. 13
4.2 Logical operators . 14
4.3 Relational operators. 14
4.4 Bitwise operators . 15
4.5 Conditional operators. 15
4.6 Assignment. 15
4.7 Mnemonics. 15
4.8 Constants. 15
5 Conventions. 16
5.1 Method of describing bitstream syntax . 16
5.2 Definition of functions . 17
5.3 Reserved, forbidden and marker_bit. 18
5.4 Arithmetic precision. 19
6 Visual bitstream syntax and semantics. 19
6.1 Structure of coded visual data. 19
6.2 Visual bitstream syntax . 38
6.3 Visual bitstream semantics. 135
7 The visual decoding process. 236
7.1 Video decoding process. 237
7.2 Higher syntactic structures. 238
7.3 VOP reconstruction. 238
7.4 Texture decoding . 239
7.5 Shape decoding. 250
7.6 Motion compensation decoding . 274
7.7 Interlaced video decoding. 297
7.8 Sprite decoding . 306
7.9 Generalized scalable decoding. 313
7.10 Still texture object decoding . 323
7.11 Mesh object decoding. 347
7.12 FBA object decoding . 352
7.13 3D Mesh Object Decoding. 358
7.14 NEWPRED mode decoding . 384
7.15 Output of the decoding process. 385
7.16 Video object decoding for the studio profile. 385
7.17 The FGS decoding process. 427
8 Visual-Systems Composition Issues . 429
8.1 Temporal Scalability Composition . 429
8.2 Sprite Composition . 430
8.3 Mesh Object Composition. 431
8.4 Spatial Scalability composition . 432
9 Profiles and Levels. 432
9.1 Visual Object Types . 432
9.2 Visual Profiles. 436
9.3 Visual Profiles@Levels. 437
© ISO/IEC 2004 – All rights reserved iii
Annex A (normative) Coding transforms . 441
Annex B (normative) Variable length codes and arithmetic decoding . 451
Annex C (normative) Face and body object decoding tables and definitions . 547
Annex D (normative) Video buffering verifier . 580
Annex E (informative) Features supported by the algorithm . 589
Annex F (informative) Preprocessing and postprocessing. 599
Annex G (normative) Profile and level indication and restrictions . 625
iv © ISO/IEC 2004 – All rights reserved
Foreword
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical Commission)
form the specialized system for worldwide standardization. National bodies that are members of ISO or IEC
participate in the development of International Standards through technical committees established by the
respective organization to deal with particular fields of technical activity. ISO and IEC technical committees
collaborate in fields of mutual interest. Other international organizations, governmental and non-governmental, in
liaison with ISO and IEC, also take part in the work. In the field of information technology, ISO and IEC have
established a joint technical committee, ISO/IEC JTC 1.
International Standards are drafted in accordance with the rules given in the ISO/IEC Directives, Part 2.
The main task of the joint technical committee is to prepare International Standards. Draft International Standards
adopted by the joint technical committee are circulated to national bodies for voting. Publication as an International
Standard requires approval by at least 75 % of the national bodies casting a vote.
Attention is drawn to the possibility that some of the elements of this document may be the subject of patent rights.
ISO and IEC shall not be held responsible for identifying any or all such patent rights.
ISO/IEC 14496-2 was prepared by Joint Technical Committee ISO/IEC JTC 1, Information technology,
Subcommittee SC 29, Coding of audio, picture, multimedia and hypermedia information.
This third edition cancels and replaces the second edition (ISO/IEC 14496-2:2001), which has been technically
revised. It also incorporates the Amendments ISO/IEC 14496-2:2001/Amd. 1:2002,
ISO/IEC 14496-2:2001/Amd. 2:2002 and ISO/IEC 14496-2:2001/Amd. 3:2003.
ISO/IEC 14496 consists of the following parts, under the general title Information technology — Coding of audio-
visual objects:
— Part 1: Systems
— Part 2: Visual
— Part 3: Audio
— Part 4: Conformance testing
— Part 5: Reference software
— Part 6: Delivery Multimedia Integration Framework (DMIF)
— Part 7: Optimized reference software for coding of audio-visual objects
— Part 8: Carriage of ISO/IEC 14496 content over IP networks
— Part 9: Reference hardware description
— Part 10: Advanced video coding
— Part 11: Scene description and application engine
— Part 12: ISO base media file format
— Part 13: Intellectual Property Management and Protection (IPMP) extentions
© ISO/IEC 2004 – All rights reserved v
— Part 14: MP4 file format
— Part 15: Advanced Video Coding (AVC) file format
— Part 16: Animation framework extension (AFX)
— Part 17: Streaming text format
— Part 18: Font compression and streaming
— Part 19: Synthesized texture stream
vi © ISO/IEC 2004 – All rights reserved
Introduction
Purpose
This part of ISO/IEC 14496 was developed in response to the growing need for a coding method that can facilitate
access to visual objects in natural and synthetic moving pictures and associated natural or synthetic sound for
various applications such as digital storage media, internet, various forms of wired or wireless communication etc.
The use of ISO/IEC 14496 means that motion video can be manipulated as a form of computer data and can be
stored on various storage media, transmitted and received over existing and future networks and distributed on
existing and future broadcast channels.
Application
The applications of ISO/IEC 14496 cover, but are not limited to, such areas as listed below:
IMM Internet Multimedia
IVG Interactive Video Games
IPC Interpersonal Communications (videoconferencing, videophone, etc.)
ISM Interactive Storage Media (optical disks, etc.)
MMM Multimedia Mailing
NDB Networked Database Services (via ATM, etc.)
RES Remote Emergency Systems
RVS Remote Video Surveillance
WMM Wireless Multimedia
Multimedia
Profiles and levels
ISO/IEC 14496 is intended to be generic in the sense that it serves a wide range of applications, bitrates,
resolutions, qualities and services. Furthermore, it allows a number o
...
Questions, Comments and Discussion
Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.