Information technology — Coding of audio-visual objects — Part 2: Visual

Technologies de l'information — Codage des objets audiovisuels — Partie 2: Codage visuel

General Information

Status
Withdrawn
Publication Date
15-Dec-1999
Withdrawal Date
15-Dec-1999
Current Stage
9599 - Withdrawal of International Standard
Completion Date
06-Dec-2001
Ref Project

Relations

Buy Standard

Standard
ISO/IEC 14496-2:1999 - Information technology -- Coding of audio-visual objects
English language
330 pages
sale 15% off
Preview
sale 15% off
Preview

Standards Content (Sample)

INTERNATIONAL ISO/IEC
STANDARD 14496-2
First edition
1999-12-01
Information technology — Coding of
audio-visual objects —
Part 2:
Visual
Technologies de l'information— Codage des objets audiovisuels —
Partie 2: Codage visuel
Reference number
ISO/IEC 14496-2:1999(E)
©
ISO/IEC 1999

---------------------- Page: 1 ----------------------
ISO/IEC 14496-2:1999(E) © ISO/IEC
Contents
1 Scope.1
2 Normative references.1
3 Definitions.2
4 Abbreviations and symbols .8
4.1 Arithmetic operators.9
4.2 Logical operators .9
4.3 Relational operators.9
4.4 Bitwise operators .10
4.5 Conditional operators.10
4.6 Assignment.10
4.7 Mnemonics.10
4.8 Constants.10
5 Conventions.10
5.1 Method of describing bitstream syntax .10
5.2 Definition of functions .12
5.2.1 Definition of next_bits() function.12
5.2.2 Definition of bytealigned() function.12
5.2.3 Definition of nextbits_bytealigned() function.12
5.2.4 Definition of next_start_code() function.12
5.2.5 Definition of next_resync_marker() function.12
5.2.6 Definition of transparent_mb() function .13
5.2.7 Definition of transparent_block() function .13
© ISO/IEC 1999
All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means, electronic
or mechanical, including photocopying and microfilm, without permission in writing from the publisher.
ISO/IEC Copyright Office � Case postale 56 � CH-1211 Genève 20 � Switzerland
Printed in Switzerland
ii

---------------------- Page: 2 ----------------------
© ISO/IEC ISO/IEC 14496-2:1999(E)
5.3 Reserved, forbidden and marker_bit.13
5.4 Arithmetic precision .13
6 Visual bitstream syntax and semantics.13
6.1 Structure of coded visual data.13
6.1.1 Visual object sequence .14
6.1.2 Visual object .14
6.1.3 Video object.14
6.1.4 Mesh object.19
6.1.5 Face object.20
6.2 Visual bitstream syntax .24
6.2.1 Start codes.24
6.2.2 Visual Object Sequence and Visual Object .27
6.2.3 Video Object Layer.29
6.2.4 Group of Video Object Plane .34
6.2.5 Video Object Plane and Video Plane with Short Header.34
6.2.6 Macroblock .48
6.2.7 Block.54
6.2.8 Still Texture Object .55
6.2.9 Mesh Object.64
6.2.10 Face Object.67
6.3 Visual bitstream semantics.77
6.3.1 Semantic rules for higher syntactic structures.77
6.3.2 Visual Object Sequence and Visual Object .77
6.3.3 Video Object Layer.83
6.3.4 Group of Video Object Plane .91
6.3.5 Video Object Plane and Video Plane with Short Header.91
6.3.6 Macroblock related.101
6.3.7 Block related.104
6.3.8 Still texture object .104
6.3.9 Mesh object.109
6.3.10 Face object.112
iii

---------------------- Page: 3 ----------------------
ISO/IEC 14496-2:1999(E) © ISO/IEC
7 The visual decoding process.117
7.1 Video decoding process.117
7.2 Higher syntactic structures.118
7.3 VOP reconstruction.118
7.4 Texture decoding .119
7.4.1 Variable length decoding.119
7.4.2 Inverse scan.120
7.4.3 Intra dc and ac prediction for intra macroblocks.121
7.4.4 Inverse quantisation .123
7.4.5 Inverse DCT .126
7.5 Shape decoding.126
7.5.1 Higher syntactic structures.127
7.5.2 Macroblock decoding .127
7.5.3 Arithmetic decoding.136
7.5.4 Grayscale Shape Decoding.138
7.6 Motion compensation decoding .140
7.6.1 Padding process .141
7.6.2 Half sample interpolation .144
7.6.3 General motion vector decoding process .144
7.6.4 Unrestricted motion compensation.146
7.6.5 Vector decoding processing and motion-compensation in progressive P-VOP.146
7.6.6 Overlapped motion compensation .148
7.6.7 Temporal prediction structure .150
7.6.8 Vector decoding process of non-scalable progressive B-VOPs.150
7.6.9 Motion compensation in non-scalable progressive B-VOPs.151
7.7 Interlaced video decoding.155
7.7.1 Field DCT and DC and AC Prediction.155
7.7.2 Motion compensation .155
7.8 Sprite decoding .162
7.8.1 Higher syntactic structures.163
7.8.2 Sprite Reconstruction.163
iv

---------------------- Page: 4 ----------------------
© ISO/IEC ISO/IEC 14496-2:1999(E)
7.8.3 Low-latency sprite reconstruction .164
7.8.4 Sprite reference point decoding.165
7.8.5 Warping.165
7.8.6 Sample reconstruction .167
7.9 Generalized scalable decoding.167
7.9.1 Temporal scalability.169
7.9.2 Spatial scalability .172
7.10 Still texture object decoding.175
7.10.1 Decoding of the DC subband.175
7.10.2 ZeroTree Decoding of the Higher Bands .176
7.10.3 Inverse Quantization.181
7.11 Mesh object decoding .188
7.11.1 Mesh geometry decoding.188
7.11.2 Decoding of mesh motion vectors .191
7.12 Face object decoding .193
7.12.1 Frame based face object decoding .193
7.12.2 DCT based face object decoding.194
7.12.3 Decoding of the viseme parameter fap 1.195
7.12.4 Decoding of the viseme parameter fap 2.196
7.12.5 Fap masking .196
7.13 Output of the decoding process.196
7.13.1 Video data .197
7.13.2 2D Mesh data .197
7.13.3 Face animation parameter data .197
8 Visual-Systems Composition Issues .197
8.1 Temporal Scalability Composition .197
8.2 Sprite Composition .198
8.3 Mesh Object Composition.199
9 Profiles and Levels.199
9.1 Visual Object Types .200
9.2 Visual Profiles.202
9.3 Visual Profiles@Levels.202
v

---------------------- Page: 5 ----------------------
ISO/IEC 14496-2:1999(E) © ISO/IEC
9.3.1 Natural Visual .202
9.3.2 Synthetic Visual.202
9.3.3 Synthetic/Natural Hybrid Visual.203
Annex A (normative) Coding transforms .205
A.1 Discrete cosine transform for video texture.205
A.2 Discrete wavelet transform for still texture .205
A.2.1 Adding the mean .205
A.2.2 Wavelet filter .206
A.2.3 Symmetric extension .206
A.2.4 Decomposition level .207
A.2.5 Shape adaptive wavelet filtering and symmetric extension .207
Annex B (normative) Variable length codes and arithmetic decoding .209
B.1 Variable length codes .209
B.1.1 Macroblock type .209
B.1.2 Macroblock pattern .210
B.1.3 Motion vector.212
B.1.4 DCT coefficients.214
B.1.5 Shape Coding .227
B.1.6 Sprite Coding.233
B.1.7 DCT based facial object decoding.234
B.2 Arithmetic Decoding .246
B.2.1 Aritmetic decoding for still texture object .246
B.2.2 Arithmetic decoding for shape decoding .251
B.2.3 Face Object Decoding.254
Annex C (normative) Face object decoding tables and definitions .256
Annex D (normative) Video buffering verifier.269
D.1 Introduction .269
D.2 Video Rate Buffer Model Definition .269
D.3 Comparison between ISO/IEC 14496-2 VBV and the ISO/IEC 13818-2 VBV (Informative).272
D.4 Video Complexity Model Definition .273
D.5 Video Reference Memory Model Definition .274
vi

---------------------- Page: 6 ----------------------
© ISO/IEC ISO/IEC 14496-2:1999(E)
D.6 Interaction between VBV, VCV and VMV (informative).274
D.7 Video Presentation Model Definition (informative).275
Annex E (informative) Features supported by the algorithm.277
E.1 Error resilience.277
E.1.1 Resynchronization .277
E.1.2 Data Partitioning .278
E.1.3 Reversible VLC.278
E.1.4 Decoder Operation.279
E.1.5 Adaptive Intra Refresh (AIR) Method .282
E.2 Complexity Estimation .284
E.3 Resynchronization in Case of Unknown Video Header Format .284
Annex F (informative) Preprocessing and postprocessing .286
F.1 Segmentation for VOP Generation.286
F.1.1 Introduction .286
F.1.2 Description of a combined temporal and spatial segmentation framework .286
F.1.3 References.288
F.2 Bounding Rectangle of VOP Formation .289
F.3 Postprocessing for Coding Noise Reduction .290
F.3.1 Deblocking filter .290
F.3.2 Deringing filter.292
F.3.3 Further issues.294
F.4 Chrominance Decimation and Interpolation Filtering for Interlaced Object Coding.294
Annex G (normative) Profile and level indication and restrictions .296
Annex H (informative) Patent statements .298
H.1 Patent statements .298
Annex I (informative) Bibliography.300
Annex J (normative) View dependent object scalability .301
J.1 Introduction .301
J.2 Decoding Process of a View-Dependent Object .301
J.2.1 General Decoding Scheme.301
J.2.2 Computation of the View-Dependent Scalability parameters.303
J.2.3 VD mask computation .304
vii

---------------------- Page: 7 ----------------------
ISO/IEC 14496-2:1999(E) © ISO/IEC
J.2.4 Differential mask computation.305
J.2.5 DCT coefficients decoding.305
J.2.6 Texture update.305
J.2.7 IDCT .306
Annex K (normative) Decoder configuration information.307
K.1 Introduction .307
K.2 Description of the set up of a visual decoder (informative).307
K.2.1 Processing of decoder configuration information.308
K.3 Specification of decoder configuration information.309
K.3.1 VideoObject .309
K.3.2 StillTextureObject.309
K.3.3 MeshObject .309
K.3.4 FaceObject .310
Annex L (informative) Rate control.311
L.1 Frame Rate Control.311
L.1.1 Introduction .311
L.1.2 Description.311
L.1.3 Summary .
...

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.