ISO/IEC 14496-10:2008
(Main)Information technology — Coding of audio-visual objects — Part 10: Advanced Video Coding
Information technology — Coding of audio-visual objects — Part 10: Advanced Video Coding
ISO/IEC 14496-10:2008 was developed jointly with the ITU-T in response to the growing need for higher compression of moving pictures for various applications such as digital storage media, television broadcasting, Internet streaming, and real-time audiovisual communication. It is also designed to enable the use of the coded video representation in a flexible manner for a wide variety of network environments. It is designed to be generic in the sense that it serves a wide range of applications, bit rates, resolutions, qualities and services. The use of ISO/IEC 14496-10:2008 allows motion video to be manipulated as a form of computer data and to be stored on various storage media, transmitted and received over existing and future networks and distributed on existing and future broadcasting channels. In the course of creating ISO/IEC 14496-10:2008, requirements from a wide variety of applications have been considered, necessary algorithmic elements have been developed, and these have been integrated into a single syntax. Hence, ISO/IEC 14496-10:2008 will facilitate video data interchange among different applications. The coded representation specified in the syntax is designed to enable a high compression capability with minimal degradation of image quality. The algorithm is not ordinarily lossless, as the exact source sample values are typically not preserved through the encoding and decoding processes. A number of syntactical features with associated decoding processes are defined that can be used to achieve highly efficient compression, and individual selected regions can be sent without loss. The expected encoding algorithm (not specified in ISO/IEC 14496-10:2008) can select between inter and intra coding for block-shaped regions of each picture. Inter coding uses motion vectors for block-based inter-picture prediction to exploit temporal statistical dependencies between different pictures. Intra coding uses spatial prediction modes to exploit spatial statistical dependencies in the source signal within a single picture. Motion vectors and intra prediction modes may be associated with a variety of block sizes in a picture. The residual signal remaining after intra or inter prediction is then processed using a spatial transform to remove spatial correlation within each transform block. The transformed blocks are then quantised. Quantisation is an irreversible process that forms an approximation that can be represented using a reduced number of bits while incurring some loss of fidelity. Finally, the motion vectors or intra prediction modes are combined with the quantised transform coefficient information and encoded using either context-adaptive variable length codes or context-adaptive binary arithmetic coding. Annexes A through E and G contain normative requirements and are an integral part of ISO/IEC 14496-10:2008. Annex A defines eleven profiles (Baseline, Main, Extended, High, High 10, High 4:2:2, High 4:4:4 Predictive, High 10 Intra, High 4:2:2 Intra, High 4:4:4 Intra, and CAVLC 4:4:4 Intra), each being tailored to a group of application domains, and also defines levels of capability within each of these profiles. Annex B specifies the syntax and semantics of a byte stream format for delivery of the coded video as an ordered stream of bytes or bits. Annex C specifies the Hypothetical Reference Decoder and its use to check bitstream and decoder conformance. Annex D specifies syntax and semantics for Supplemental Enhancement Information message payloads. Annex E specifies syntax and semantics of the Video Usability Information parameters of coded video sequences. Annex G specifies scalable video coding in three additional profiles (Scalable Baseline, Scalable High, and Scalable High Intra) which enable a coded video bitstream to be structured into layers, such that layered subsets of the bitstream can be independently decodable to provide video quality commensurate with the quantity of data that remains in each smaller b
Technologies de l'information — Codage des objets audiovisuels — Partie 10: Codage visuel avancé
General Information
Relations
Buy Standard
Standards Content (Sample)
INTERNATIONAL ISO/IEC
STANDARD 14496-10
Fourth edition
2008-09-15
Information technology — Coding of
audio-visual objects —
Part 10:
Advanced video coding
Technologies de l'information — Codage des objets audiovisuels —
Partie 10: Codage visuel avancé
Reference number
©
ISO/IEC 2008
PDF disclaimer
PDF files may contain embedded typefaces. In accordance with Adobe's licensing policy, such files may be printed or viewed but shall
not be edited unless the typefaces which are embedded are licensed to and installed on the computer performing the editing. In
downloading a PDF file, parties accept therein the responsibility of not infringing Adobe's licensing policy. The ISO Central Secretariat
accepts no liability in this area.
Adobe is a trademark of Adobe Systems Incorporated.
Details of the software products used to create the PDF file(s) constituting this document can be found in the General Info relative to
the file(s); the PDF-creation parameters were optimized for printing. Every care has been taken to ensure that the files are suitable for
use by ISO member bodies. In the unlikely event that a problem relating to them is found, please inform the Central Secretariat at the
address given below.
This CD-ROM contains the publication ISO/IEC 14496-10:2008 in portable document format (PDF), which
can be viewed using Adobe® Acrobat® Reader.
Adobe and Acrobat are trademarks of Adobe Systems Incorporated.
This fourth edition cancels and replaces the third edition (ISO/IEC 14496-10:2005), which has been
technically revised.
© ISO/IEC 2008
All rights reserved. Unless required for installation or otherwise specified, no part of this CD-ROM may be reproduced, stored in a retrieval
system or transmitted in any form or by any means without prior permission from ISO. Requests for permission to reproduce this product
should be addressed to
ISO copyright office • Case pos
...
INTERNATIONAL
ISO/IEC
STANDARD
14496-10
Fourth edition
2008-09-15
Information technology — Coding of
audio-visual objects —
Part 10:
Advanced video coding
Technologies de l'information — Codage des objets audiovisuels —
Partie 10: Codage visuel avancé
Reference number
©
ISO/IEC 2008
PDF disclaimer
This PDF file may contain embedded typefaces. In accordance with Adobe's licensing policy, this file may be printed or viewed but
shall not be edited unless the typefaces which are embedded are licensed to and installed on the computer performing the editing. In
downloading this file, parties accept therein the responsibility of not infringing Adobe's licensing policy. The ISO Central Secretariat
accepts no liability in this area.
Adobe is a trademark of Adobe Systems Incorporated.
Details of the software products used to create this PDF file can be found in the General Info relative to the file; the PDF-creation
parameters were optimized for printing. Every care has been taken to ensure that the file is suitable for use by ISO member bodies. In
the unlikely event that a problem relating to it is found, please inform the Central Secretariat at the address given below.
© ISO/IEC 2008
All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means,
electronic or mechanical, including photocopying and microfilm, without permission in writing from either ISO at the address below or
ISO's member body in the country of the requester.
ISO copyright office
Case postale 56 • CH-1211 Geneva 20
Tel. + 41 22 749 01 11
Fax + 41 22 749 09 47
E-mail copyright@iso.org
Web www.iso.org
Published in Switzerland
ii
© ISO/IEC 2008 – All rights reserved
Contents Page
Foreword.xiii
0 Introduction.xv
0.1 Prologue.xv
0.2 Purpose .xv
0.3 Applications .xv
0.4 Publication and versions of this specification.xv
0.5 Profiles and levels.xvi
0.6 Overview of the design characteristics .xvi
0.6.1 Predictive coding.xvii
0.6.2 Coding of progressive and interlaced video .xvii
0.6.3 Picture partitioning into macroblocks and smaller partitions .xvii
0.6.4 Spatial redundancy reduction .xviii
0.7 How to read this specification.xviii
1 Scope.1
2 Normative references.1
3 Terms and definitions .1
4 Abbreviations .9
5 Conventions .10
5.1 Arithmetic operators .10
5.2 Logical operators.10
5.3 Relational operators.11
5.4 Bit-wise operators .11
5.5 Assignment operators.11
5.6 Range notation .11
5.7 Mathematical functions.12
5.8 Variables, syntax elements, and tables.13
5.9 Text description of logical operations.13
5.10 Processes.14
6 Source, coded, decoded and output data formats, scanning processes, and neighbouring relationships.15
6.1 Bitstream formats.15
6.2 Source, decoded, and output picture formats.15
6.3 Spatial subdivision of pictures and slices.20
6.4 Inverse scanning processes and derivation processes for neighbours.21
6.4.1 Inverse macroblock scanning process .21
6.4.2 Inverse macroblock partition and sub-macroblock partition scanning process.22
6.4.3 Inverse 4x4 luma block scanning process .23
6.4.4 Inverse 4x4 Cb or Cr block scanning process for ChromaArrayType equal to 3.23
6.4.5 Inverse 8x8 luma block scanning process .24
6.4.6 Inverse 8x8 Cb or Cr block scanning process for ChromaArrayType equal to 3.24
6.4.7 Derivation process of the availability for macroblock addresses .24
6.4.8 Derivation process for neighbouring macroblock addresses and their availability .24
6.4.9 Derivation process for neighbouring macroblock addresses and their availability in MBAFF frames.25
6.4.10 Derivation processes for neighbouring macroblocks, blocks, and partitions .26
6.4.11 Derivation process for neighbouring locations.31
7 Syntax and semantics.34
7.1 Method of specifying syntax in tabular form.34
7.2 Specification of syntax functions, categories, and descriptors.35
7.3 Syntax in tabular form.36
7.3.1 NAL unit syntax.36
7.3.2 Raw byte sequence payloads and RBSP trailing bits syntax.37
7.3.3 Slice header syntax.46
© ISO/IEC 2008 – All rights reserved
iii
7.3.4 Slice data syntax . 50
7.3.5 Macroblock layer syntax. 51
7.4 Semantics.58
7.4.1 NAL unit semantics . 58
7.4.2 Raw byte sequence payloads and RBSP trailing bits semantics . 67
7.4.3 Slice header semantics. 81
7.4.4 Slice data semantics. 91
7.4.5 Macroblock layer semantics . 92
8 Decoding process. 104
8.1 NAL unit decoding process.105
8.2 Slice decoding process.106
8.2.1 Decoding process for picture order count. 106
8.2.2 Decoding process for macroblock to slice group map . 110
8.2.3 Decoding process for slice data partitioning. 114
8.2.4 Decoding process for reference picture lists construction . 114
8.2.5 Decoded reference picture marking process . 121
8.3 Intra prediction process.125
8.3.1 Intra_4x4 prediction process for luma samples . 126
8.3.2 Intra_8x8 prediction process for luma samples . 132
8.3.3 Intra_16x16 prediction process for luma samples . 140
8.3.4 Intra prediction process for chroma samples . 142
8.3.5 Sample construction process for I_PCM macroblocks . 147
8.4 Inter prediction process.147
8.4.1 Derivation process for motion vector components and reference indices. 150
8.4.2 Decoding process for Inter prediction samples. 163
8.4.3 Derivation process for prediction weights . 172
8.5 Transform coefficient decoding process and picture construction process prior to deblocking filter process 174
8.5.1 Specification of transform decoding process for 4x4 luma residual blocks. 174
8.5.2 Specification of transform decoding process for luma samples of Intra_16x16 macroblock prediction
mode 175
8.5.3 Specification of transform decoding process for 8x8 luma residual blocks. 176
8.5.4 Specification of transform decoding process for chroma samples. 177
8.5.5 Specification of transform decoding process for chroma samples with ChromaArrayType equal to 3. 179
8.5.6 Inverse scanning process for transform coefficients.
...
Questions, Comments and Discussion
Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.