SIST EN 62356-2:2006
(Main)Video recording - 12,65 mm type D-11 format -- Part 2: Picture compression and data stream
Video recording - 12,65 mm type D-11 format -- Part 2: Picture compression and data stream
This International Standard specifies the compression of a high-definition source format to a dual-channel packetized data stream format which is suitable for recording on disc and tape storage devices including the Type D-11 tape recorder. The specification includes a number of basic packetizing operations including the shuffling of the source data prior to compression, both to aid compression performance and to allow error concealment processing in the decoder. The standard also includes the processes required to decode the compressed Type D-11 packetized data format into a high-definition output signal.
This standard supports high-definition source formats using 1 920 × 1 080 pixels and the sampling structures as specified in SMPTE 274M and RP 211 at the following picture rates:
• 24/1,001/PsF;
• 24/PsF;
• 25/PsF;
• 30/1,001/PsF;
• 50/I;
• 60/1,001/I
where 'PsF' indicates Progressive segmented Frame and 'I' indicates Interlaced. The data packet format specified by this standard is used as the source data stream for the associated document which maps this Type D-11 packetized data-stream format together with AES3 data over SDTI.
Videoaufzeichnung - D-11-Format mit 12,65 mm -- Teil 2: Bildkompression und Datenstrom
Enregistrement Vidéo - Format 12,65 mm de type D11 -- Partie 2: Flux de données et compression d'image
specifies the compression of a high-definition source format to a dual-channel packetized data stream format which is suitable for recording on disc and tape storage devices including the Type D-11 tape recorder
Video snemanje – Format vrste D-11 z magnetnim trakom s širino 12,65 mm – 2. del: Komprimiranje slike in podatkovni tok (IEC 62356-2:2003)
General Information
Standards Content (Sample)
SLOVENSKI SIST EN 62356-2:2006
STANDARD
januar 2006
Video snemanje – Format vrste D-11 z magnetnim trakom s širino 12,65 mm –
2. del: Komprimiranje slike in podatkovni tok (IEC 62356-2:2003)
(istoveten EN 62356-2:2004)
Video recording – 12,65 mm type D-11 format – Part 2: Picture compression and
data stream (IEC 62356-2:2003)
ICS 33.160.40 Referenčna številka
SIST EN 62356-2:2006(en)
© Standard je založil in izdal Slovenski inštitut za standardizacijo. Razmnoževanje ali kopiranje celote ali delov tega dokumenta ni dovoljeno
---------------------- Page: 1 ----------------------
EUROPEAN STANDARD EN 62356-2
NORME EUROPÉENNE
EUROPÄISCHE NORM October 2004
ICS 33.160.40
English version
Video recording –
12,65 mm type D-11 format
Part 2: Picture compression and data stream
(IEC 62356-2:2003)
Enregistrement Vidéo – Videoaufzeichnung –
Format 12,65 mm de type D11 D-11-Format mit 12,65 mm
Partie 2: Flux de données et compression Teil 2: Bildkompression und Datenstrom
d'image (IEC 62356-2:2003)
(CEI 62356-2:2003)
This European Standard was approved by CENELEC on 2004-09-01. CENELEC members are bound to
comply with the CEN/CENELEC Internal Regulations which stipulate the conditions for giving this European
Standard the status of a national standard without any alteration.
Up-to-date lists and bibliographical references concerning such national standards may be obtained on
application to the Central Secretariat or to any CENELEC member.
This European Standard exists in three official versions (English, French, German). A version in any other
language made by translation under the responsibility of a CENELEC member into its own language and
notified to the Central Secretariat has the same status as the official versions.
CENELEC members are the national electrotechnical committees of Austria, Belgium, Cyprus, Czech
Republic, Denmark, Estonia, Finland, France, Germany, Greece, Hungary, Iceland, Ireland, Italy, Latvia,
Lithuania, Luxembourg, Malta, Netherlands, Norway, Poland, Portugal, Slovakia, Slovenia, Spain, Sweden,
Switzerland and United Kingdom.
CENELEC
European Committee for Electrotechnical Standardization
Comité Européen de Normalisation Electrotechnique
Europäisches Komitee für Elektrotechnische Normung
Central Secretariat: rue de Stassart 35, B - 1050 Brussels
© 2004 CENELEC - All rights of exploitation in any form and by any means reserved worldwide for CENELEC members.
Ref. No. EN 62356-2:2004 E
---------------------- Page: 2 ----------------------
EN 62356-2:2004 - 2 -
Foreword
The text of the International Standard IEC 62356-2:2003, prepared by IEC TC 100, Audio, video and
multimedia systems and equipment, was submitted to the formal vote and was approved by
CENELEC as EN 62356-2 on 2004-09-01 without any modification.
The following dates were fixed:
– latest date by which the EN has to be implemented
at national level by publication of an identical
national standard or by endorsement (dop) 2005-09-01
– latest date by which the national standards conflicting
with the EN have to be withdrawn (dow) 2007-09-01
Annex ZA has been added by CENELEC.
__________
Endorsement notice
The text of the International Standard IEC 62356-2:2003 was approved by CENELEC as a European
Standard without any modification.
__________
---------------------- Page: 3 ----------------------
- 3 - EN 62356-2:2004
Annex ZA
(normative)
Normative references to international publications
with their corresponding European publications
The following referenced documents are indispensable for the application of this document. For dated
references, only the edition cited applies. For undated references, the latest edition of the referenced
document (including any amendments) applies.
NOTE Where an international publication has been modified by common modifications, indicated by (mod), the relevant
EN/HD applies.
Publication Year Title EN/HD Year
SMPTE 292M 1998 BIT-serial digital interface for high- - -
definition television systems
SMPTE 274M 1998 Television - 1920 x 1080 scanning and - -
analog and parallel digital interfaces for
multiple picture rates
SMPTE RP 211 2000 Implementation of 24P, 25P and 30P - -
segmented frames for 1920 x 1080
production format
SMPTE 12M 1999 Television, audio and film - Time and - -
control code
SMPTE RP 188 1999 Transmission of time code and control - -
code in the ancillary data space of a
digital television data stream
---------------------- Page: 4 ----------------------
INTERNATIONAL IEC
STANDARD
62356-2
First edition
2003-11
Video recording –
12,65 mm type D-11 format –
Part 2:
Picture compression and data stream
IEC 2003 Copyright - all rights reserved
No part of this publication may be reproduced or utilized in any form or by any means, electronic or
mechanical, including photocopying and microfilm, without permission in writing from the publisher.
International Electrotechnical Commission, 3, rue de Varembé, PO Box 131, CH-1211 Geneva 20, Switzerland
Telephone: +41 22 919 02 11 Telefax: +41 22 919 03 00 E-mail: inmail@iec.ch Web: www.iec.ch
PRICE CODE
Commission Electrotechnique Internationale
XA
International Electrotechnical Commission
Международная Электротехническая Комиссия
For price, see current catalogue
---------------------- Page: 5 ----------------------
– 2 – 62356-2 IEC:2003(E)
SOMMAIRE
FOREWORD . 4
1 Scope . 6
2 Normative references. 6
3 Introduction. 7
4 Encoding . 7
4.1 Overview . 7
4.2 Pre-processing . 9
4.3 Shuffling .13
4.4 Field-frame decision .15
4.5 Discrete Cosine Transform (DCT) .17
4.6 Rate control.18
4.7 Quantization .19
4.8 Entropy coding .19
4.9 Picture data packing .23
4.10 Auxiliary data.29
5 Decoding .32
5.1 Overview .32
5.2 Unpacking .32
5.3 Entropy decoding.32
5.4 Inverse quantization .32
5.5 Inverse DCT .33
5.6 De-shuffling .33
5.7 Post-processing.33
Annex A (normative) Subsampling filter.34
Annex B (normative) Channel shuffling.36
Annex C (normative) .39
Annex D (normative) VLC tables .42
Bibliography.55
Figure 1 – Encoding block diagram . 9
Figure 2 – Sampling relationships for 1 080/I and 1 080/PsF source and subsampled
systems .11
Figure 3 – Channel division of subsampled 1 080/I and 1 080/PsF signals .12
Figure 4 – Channel distribution.13
Figure 5 – Code blocks and basic blocks in channel.14
Figure 6 – Shuffle block format .14
Figure 7 – Shuffle block header byte descriptions .15
Figure 8 – Frame-mode chrominance DCT block reformat.16
Figure 9 – Field-mode DCT block reformat .17
Figure 10 – DCT coefficient encoding example.22
---------------------- Page: 6 ----------------------
62356-2 IEC:2003(E) – 3 –
Figure 11 – Basic block format .23
Figure 12 – Frame-mode luminance and chrominance cells .23
Figure 13 – Field-mode luminance and chrominance cells.24
Figure 14 – Framemode placement for Offset Mode and Offset Index bits .26
Figure 15 – Fieldmode placement for Offset Mode and Offset Index bits .26
Figure 16 – Packing when quantizer base = 61 or less .27
Figure 17 – Packing when quantizer base = 63 .28
Figure 18 – Auxiliary basic block format .29
Figure 19 – Auxiliary data words .31
Figure 20 – Decoding block diagram .32
Figure A.1 – Template for insertion-loss frequency characteristic (Y) .34
Figure A.2 – Passband ripple tolerance (Y) .34
Figure A.3 – Template for insertion-loss frequency characteristic (C ,C ).35
B R
Figure A.4 – Passband ripple tolerance (CB,CR).35
Figure B.1 – 8*8 block segmentation in each channel.36
Figure B.2 –Block allocation within a segment.37
Table 1 – Data rates associated with source picture rates . 7
Table 2 – Definition of signal sampling parameters .10
Table 3 – Data representation .17
Table 4 – DC quantization divisors .19
Table 5 – AC quantization divisors .19
Table 6 – Offset mode and offset index .20
Table 7 – DC coefficient fixed precision .20
Table 8 – Example luminance a.c. coefficient encoding.21
Table 9 – Auxiliary basic block data .29
Table 10 – MSB inversion .33
Table B.1 – Equation for TMP1 .37
Table B.2 – Values of START_OFFSET for luminance planes .38
Table B.3 – Values of START_OFFSET for chrominance planes .38
Table C.1 – Dynamic range of coefficients .40
Table C.2 – Coefficients for d.c. only transforms .40
Table C.3 – 8 × 8 zigzag scan .40
H V
Table C.4 – 4 × 8 zigzag scan .41
H V
Table C.5 – 8 × 4 zigzag scan .41
H V
---------------------- Page: 7 ----------------------
– 4 – 62356-2 IEC:2003(E)
INTERNATIONAL ELECTROTECHNICAL COMMISSION
____________
VIDEO RECORDING – 12,65 MM TYPE D-11 FORMAT −−−−
Part 2: Picture compression and data stream
FOREWORD
1) The International Electrotechnical Commission (IEC) is a worldwide organization for standardization comprising
all national electrotechnical committees (IEC National Committees). The object of IEC is to promote
international co-operation on all questions concerning standardization in the electrical and electronic fields. To
this end and in addition to other activities, IEC publishes International Standards, Technical Specifications,
Technical Reports, Publicly Available Specifications (PAS) and Guides (hereafter referred to as “IEC
Publication(s)”). Their preparation is entrusted to technical committees; any IEC National Committee interested
in the subject dealt with may participate in this preparatory work. International, governmental and non-
governmental organizations liaising with the IEC also participate in this preparation. IEC collaborates closely
with the International Organization for Standardization (ISO) in accordance with conditions determined by
agreement between the two organizations.
2) The formal decisions or agreements of IEC on technical matters express, as nearly as possible, an international
consensus of opinion on the relevant subjects since each technical committee has representation from all
interested IEC National Committees.
3) IEC Publications have the form of recommendations for international use and are accepted by IEC National
Committees in that sense. While all reasonable efforts are made to ensure that the technical content of IEC
Publications is accurate, IEC cannot be held responsible for the way in which they are used or for any
misinterpretation by any end user.
4) In order to promote international uniformity, IEC National Committees undertake to apply IEC Publications
transparently to the maximum extent possible in their national and regional publications. Any divergence
between any IEC Publication and the corresponding national or regional publication shall be clearly indicated in
the latter.
5) IEC provides no marking procedure to indicate its approval and cannot be rendered responsible for any
equipment declared to be in conformity with an IEC Publication.
6) All users should ensure that they have the latest edition of this publication.
7) No liability shall attach to IEC or its directors, employees, servants or agents including individual experts and
members of its technical committees and IEC National Committees for any personal injury, property damage or
other damage of any nature whatsoever, whether direct or indirect, or for costs (including legal fees) and
expenses arising out of the publication, use of, or reliance upon, this IEC Publication or any other IEC
Publications.
8) Attention is drawn to the Normative references cited in this publication. Use of the referenced publications is
indispensable for the correct application of this publication.
9) Attention is drawn to the possibility that some of the elements of this IEC Publication may be the subject of
patent rights. IEC shall not be held responsible for identifying any or all such patent rights.
International Standard IEC 62356-2 has been prepared by IEC technical committee 100:
Audio, video and multimedia systems and equipment.
It was submitted to the national committees for voting under the Fast Track Procedure as the
following documents:
CDV Report on voting
100/630/CDV 100/700/RVC
Full information on the voting for the approval of this standard can be found in the report on
voting indicated in the above table.
This publication has been drafted in accordance with the ISO/IEC Directives, Part 2.
---------------------- Page: 8 ----------------------
62356-2 IEC:2003(E) – 5 –
The committee has decided that the contents of this publication will remain unchanged until
2008-11. At this date, the publication will be
• reconfirmed;
• withdrawn;
• replaced by a revised edition, or
• amended.
---------------------- Page: 9 ----------------------
– 6 – 62356-2 IEC:2003(E)
VIDEO RECORDING – 12,65 MM TYPE D-11 FORMAT −−
−−
Part 2: Picture compression and data stream
1 Scope
This International Standard specifies the compression of a high-definition source format to a
dual-channel packetized data stream format which is suitable for recording on disc and tape
storage devices including the Type D-11 tape recorder. The specification includes a number
of basic packetizing operations including the shuffling of the source data prior to compression,
both to aid compression performance and to allow error concealment processing in the
decoder. The standard also includes the processes required to decode the compressed Type
D-11 packetized data format into a high-definition output signal.
This standard supports high-definition source formats using 1 920 × 1 080 pixels and the
sampling structures as specified in SMPTE 274M and RP 211 at the following picture rates:
• 24/1,001/PsF;
• 24/PsF;
• 25/PsF;
• 30/1,001/PsF;
• 50/I;
• 60/1,001/I
where 'PsF' indicates Progressive segmented Frame and 'I' indicates Interlaced.
The data packet format specified by this standard is used as the source data stream for the
associated document which maps this Type D-11 packetized data-stream format together with
AES3 data over SDTI.
2 Normative references
The following referenced documents are indispensable for the application of this document.
For dated references, only the edition cited applies. For undated references, the latest edition
of the referenced document (including any amendments) applies.
SMPTE 292M:1998, Television – Bit-Serial Digital Interface for High-Definition Television
Systems
SMPTE 274M:1998, Television – 1920 × 1080 Scanning and Analog and Parallel Digital
Interfaces for Multiple Picture Rates
SMPTE RP 211:2000, Implementation of 24P, 25P and 30P Segmented Frames for 1920 ×
1080 Production Format
SMPTE 12M:1999, Television, Audio and Film-Time and Control Code
SMPTE RP 188:1999, Transmission of Time Code and Control Code in the Ancillary Data
Space of a Digital Television Data Stream
---------------------- Page: 10 ----------------------
62356-2 IEC:2003(E) – 7 –
3 Introduction
This standard specifies the encoding and decoding of high-definition source formats via
compression into a bit rate in the range 112~140Mb/s for recording on a Type D-11 digital
tape recorder. The recorded bit rate is related to the source picture rate according to Table 1.
Table 1 – Data rates associated with source picture rates
Base data rate
Picture rate
Mb/s
24/1,001/PsF 111,863
24/PsF 111,975
25/PsF 116,640
30/1,001/PsF 139,828
50/I 116,640
60/1,001/I 139,828
In common with other compression systems, the Type D-11 encoding process uses intra-
frame coding (i.e. the coding is bound by the frame period) using the Discrete Cosine
Transform (DCT) to provide the data de-correlation required for efficient compression. The
coefficients are quantized and variable length coded (VLC) to produce the basic output data
format.
The source pictures are subsampled prior to compression coding. This reduces the number of
coded pixels and allows the number of bits-per-pixel value to be raised in proportion. The
luminance source sampling grid of 1 920 × 1 080 pixels is reduced to 1 440 × 1 080 pixels.
For each chrominance channel, the source sampling grid of 960 × 1 080 pixels is reduced to
480 × 1 080 pixels. In the decoder, the output pixel sample grid is restored back to the source
format of 1 920 × 1 080 pixels by interpolation following the compression decoding process.
The compressed data format specified by the output of the compression encoder is of a form
which allows direct mapping into the basic block structure as defined in the Type D-11 digital
recorder document.
4 Encoding
4.1 Overview
Type D-11 source data for compression shall comprise only the production aperture area as
defined by SMPTE 274M.
NOTE DCT coding uses a data block size which allows exactly 1 080 lines to be coded.
The source formats comprise luminance (Y) and chrominance (C , C ) component signals as
B R
defined by SMPTE 274M and SMPTE RP 211.
Type D-11 source picture rates for compression shall be constrained to the following values:
• 24/1 001 frames per second in the segmented format as defined by SMPTE RP 211;
• 24 frames per second in the segmented format as defined by SMPTE RP 211;
• 25 frames per second in the segmented format as defined by SMPTE RP 211;
• 30/1 001 frames per second in the segmented format as defined by SMPTE RP 211;
• 50 fields per second in the interlaced format (a.k.a. 50/I) as defined by SMPTE 274M;
---------------------- Page: 11 ----------------------
– 8 – 62356-2 IEC:2003(E)
• 60/1 001 fields per second in the interlaced format (a.k.a. 60/I) as defined by SMPTE
274M.
The active picture data for compression shall be pre-filtered and then subsampled from a
source representation to a subsampled representation.
The reduced active data shall then be split into two identical channels for processing as
shown in Figure 1 and Table 2.
The total picture data in each channel shall be divided into 20 250 8*8 blocks, each formed
from eight samples of eight consecutive lines in a frame.
The 8*8 blocks for each channel shall then be shuffled within the frame boundary to produce
270 code blocks each comprising 45 luminance (Y) 8*8 blocks and 30 chrominance 8*8 blocks
(15C and 15 C ).
B R
The picture data in each code block shall be compressed by the application of the DCT,
quantization and VLC encoding. Each code block shall be separately encoded, and there
shall be no data-sharing between code blocks. The data from the compression output shall be
packed into the code block space of 1 080 bytes.
Each code block shall be segmented into five basic blocks each comprising 216 compressed
data bytes. Each basic block nominally contains the compressed data for nine luminance 8*8
blocks and six chrominance 8*8 blocks (3 C and 3 C ). Data overflow from one basic block
B R
can be shared with other basic blocks in the same code block.
NOTE The 8*8 blocks may be coded by a single 8*8 DCT block, by two 8*4 DCT blocks or by two 4*8 DCT blocks
depending on the mode of operation (see 4.4).
The 270 code blocks for each channel shall be divided into six equal segments of 45 code
blocks per segment. Each segment shall contain one auxiliary basic block prior to the
compressed data basic blocks. All auxiliary basic blocks in one channel shall be identical with
the exception of the segment identification number. The auxiliary basic block shall contain
utility data for the segment. The distribution of a channel into code blocks and basic blocks is
illustrated in Figure 5.
All basic blocks shall have a total length of 219 bytes. The data for the basic blocks in a code
block shall be 216 bytes in length, allowing 3 bytes for the basic block header. The data for
the auxiliary basic block in each segment shall be 217 bytes in length, allowing 2 bytes for the
basic block header.
NOTE The ‘*’ symbol is used to denote multiplication.
---------------------- Page: 12 ----------------------
62356-2 IEC:2003(E) – 9 –
AUX
data
Channel 0: even samples
Entropy
Field-
DCT Quantize Pack
coding
frame
decision
Shuffle
Channel 0: encoded picture
and AUX data
Rate Control
AUX
data
Sub-
sample
Source
picture
Entropy
DCT Quantize Pack
coding
Channel 1: encoded picture
Shuffle
and AUX data
Field-
frame Rate control
decision
Channel 1: odd samples
Figure 1 – Encoding block diagram
4.2 Pre-processing
4.2.1 Source picture
The source picture shall be the production aperture as defined in SMPTE 274M having a
luminance structure of 1 920*1 080 pixels and a multiplexed chrominance structure of 960*1
080 pixels for each chrominance component.
The source interface has a sample resolution of 10 bits which shall be reduced to 8 bits after
the horizontal subsampling process.
4.2.2 Vertical sampling process
For 1 080/I systems, 540 lines for Y, C , C signals from each interlaced field shall be
B R
processed. The coding lines for each interlaced field are illustrated in Figure 2.
For 1 080/PsF systems, 1 080 lines for Y, C , C signals from each whole frame shall be
B R
processed. The coding lines for the segmented frame are illustrated in Figure 2.
4.2.3 Horizontal subsampling process
For the luminance component, all the 1 920 active samples per line shall be subsampled to 1
440 samples per line after a bandwidth limitation filtering process.
For each of the two chrominance components, all the 960 active samples per line shall be
subsampled to 480 samples per line after a bandwidth limitation filtering process.
The basic sample parameters for luminance (Y) and the two chrominance signals (C , C ) of
B R
the source and sub-sampled component signals are described in Table 2.
---------------------- Page: 13 ----------------------
– 10 – 62356-2 IEC:2003(E)
Figure 2 depicts the re-sampled spatial positions of the subsampled components for 1 080/I
and 1 080/PsF line-scanning systems.
The subsampled data in each frame shall be divided into two identical channels: an even
sample channel and an odd sample channel as illustrated in Figure 3.
Let r be the horizontal sample position number in the subsampled Y, C , C source.
B R
For Y samples r = 0, 1, 2, 3, …. , 1 439
For C , C samples r = 0, 1, 2, 3, …. , 479
B R
Those samples that have 'r' as an even number, depicted as a white circle in Figure 3, shall
be distributed to channel 0.
Those samples that have 'r' as an odd number, depicted as a grey circle in Figure 3, shall be
distributed to channel 1.
Each luminance (Y) sample channel has rectangular area of 720 samples by 1 080 lines.
Each chrominance (C , C ) sample channel has a rectangular area of 240 samples by 1 080
B R
lines respectively as illustrated in Figure 4.
Figure 3 shows the overall structure of the subsampling process.
To avoid alias artifacts, the source format shall be pre-filtered with a filter operating in the
horizontal dimension only. The templates for the overall filtering characteristics of the sub-
sampling process are defined in Annex A .
NOTE The filtering and subsampling processes are implemented as one combined operation.
Table 2 – Definition of signal sampling parameters
Parameter Source sampling Subsampling Channel division
Y 1 920 1 440 720
Number of samples per
line
C , C 960 480 240
B R
Number of active lines
1 080 1 080 1 080
per frame
Quantization 10-bit (0.1 023) 8-bit (0.255) 8-bit (0.255)
Peak
4 to 1 019 1 to 254 1 to 254
range
Peak white level: 940 Peak white level: 235 Peak white level: 235
Sample levels
Y Black level: 64 Black level: 16 Black level: 16
Total levels: 877 Total levels: 220 Total levels: 220
Signal level: 512 ± 448 Signal level: 128 ± 112 Signal level: 128 ¦ 112
C , C
B R
Total levels: 897 Total levels: 225 Total levels: 225
---------------------- Page: 14 ----------------------
62356-2 IEC:2003(E) – 11 –
T/3
T
Source luminance (Y)
Line 21
st
1 Line field 1
Line 584
st
1 Line field 2
Line 22
Line 585
Sub-sampled
4T/3
luminance (Y)
1st Line field 1 Line 21
1st Line field 2 Line 584
Line 22
Line 585
Source
2T
chrominance (C , C )
B R
1st Line field 1 Line 21
1st Line field 2 Line 584
Line 22
Line 585
Sub-sampled
4T
chrominance (C , C )
B R
1st Line field 1
Line 21
1st Line field 2
Line 584
Line 22
First pixel in active
Line
...
Questions, Comments and Discussion
Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.