European character repertoires and their coding - 8-bit single-byte coding

This Technical Specificationspecifies the graphic char-ac-ter repertoires and their single-byte coding, which are available for use for information inter-change between information processing systems and for use within such systems, in the scripts that are commonly used by the members of CEN/CENELEC and the Institutions of the European Union and the European Free Trade Association.
This Technical Specificationdoes not specify the interchange of information using a telematic service. The character repertoire and the coding used by a telematic service are defined by the specification of that service. The transmission of information based on the specifications of this Technical Specificationusing a telematic service may necessitate an adaptation of the number of characters of a repertoire (repertoire transformation function) or a change to the coding (code transformation function).

Informationstechnik - Europäische Zeichenvorräte und deren Codierung - 8-Bit-Einzelbyte-Codierung

Diese Technische Spezifikation legt die Schriftzeichenvorräte sowie deren Einzelbyte-Codierungen der Sprachen fest, die von den CEN/CENELEC-Mitgliedern und den Institutionen der Europäischen Union und der Europäischen Freihandelszone bevorzugt verwendet werden und die für den Informationsaustausch zwischen Informationsverarbeitungssystemen und für die Anwendung innerhalb dieser Systeme zur Verfügung stehen.
Diese Technische Spezifikation trifft keine Festlegungen hinsichtlich des Austausches von Informationen in oder mit Telematikdiensten. Der in einem solchen Dienst verwendete Zeichenvorrat und dessen Codierung sind in den Spezifikationen des Telematikdienstes festgelegt. Werden dieser Technischen Spezifikation entsprechende Informationen mit Hilfe eines Telematikdienstes übermittelt, kann es nötig werden, die Anzahl der Zeichen in einem Zeichenvorrat anzupassen (Zeichenvorrats-Umsetzungsfunktion) oder die Codierung zu ändern (Code-Umsetzungsfunktion).

Nabori evropskih znakov in njihovo kodiranje – kodiranje v 8-bitne besede

General Information

Status
Published
Publication Date
13-May-2003
Current Stage
6060 - Definitive text made available (DAV) - Publishing
Start Date
14-May-2003
Completion Date
14-May-2003

Buy Standard

Technical specification
TS CEN/TS 1923:2003
English language
50 pages
sale 10% off
Preview
sale 10% off
Preview
e-Library read for
1 day

Standards Content (Sample)


SLOVENSKI STANDARD
01-oktober-2003
Nabori evropskih znakov in njihovo kodiranje – kodiranje v 8-bitne besede
European character repertoires and their coding - 8-bit single-byte coding
Informationstechnik - Europäische Zeichenvorräte und deren Codierung - 8-Bit-
Einzelbyte-Codierung
Ta slovenski standard je istoveten z: CEN/TS 1923:2003
ICS:
35.040 Nabori znakov in kodiranje Character sets and
informacij information coding
2003-01.Slovenski inštitut za standardizacijo. Razmnoževanje celote ali delov tega standarda ni dovoljeno.

TECHNICAL SPECIFICATION
CEN/TS 1923
SPÉCIFICATION TECHNIQUE
TECHNISCHE SPEZIFIKATION
May 2003
ICS 35.040
Supersedes EN 1923:1998
English version
European character repertoires and their coding - 8-bit single-
byte coding
This Technical Specification (CEN/TS) was approved by CEN on 16 October 2002 for provisional application.
The period of validity of this CEN/TS is limited initially to three years. After two years the members of CEN will be requested to submit their
comments, particularly on the question whether the CEN/TS can be converted into a European Standard.
CEN members are required to announce the existence of this CEN/TS in the same way as for an EN and to make the CEN/TS available. It
is permissible to keep conflicting national standards in force (in parallel to the CEN/TS) until the final decision about the possible
conversion of the CEN/TS into an EN is reached.
CEN members are the national standards bodies of Austria, Belgium, Czech Republic, Denmark, Finland, France, Germany, Greece,
Hungary, Iceland, Ireland, Italy, Luxembourg, Malta, Netherlands, Norway, Portugal, Slovakia, Spain, Sweden, Switzerland and United
Kingdom.
EUROPEAN COMMITTEE FOR STANDARDIZATION
COMITÉ EUROPÉEN DE NORMALISATION
EUROPÄISCHES KOMITEE FÜR NORMUNG
Management Centre: rue de Stassart, 36  B-1050 Brussels
© 2003 CEN All rights of exploitation in any form and by any means reserved Ref. No. CEN/TS 1923:2003 E
worldwide for CEN national Members.

Contents
Foreword.3
1 Scope .4
2 Normative references .4
3 Terms and definitions.4
4 Conformance.5
4.1 Conformance for information interchange.5
4.2 Conformance of devices .5
4.2.1 General.5
4.2.2 Device description .5
4.2.3 Originating devices.5
4.2.4 Receiving devices.5
5 Scenario description .5
5.1 Repertoires .5
5.2 Combinations of repertoires and their coding.5
6 Repertoire descriptions.6
6.1 Latin script.6
6.2 Greek script .6
6.3 Cyrillic script .6
6.4 The symbols repertoire .6
7 Coding methods applicable.7
7.1 8-bit single-byte coding.7
7.2 Formation of G-sets.7
7.2.1 Invariant-Latin repertoire .7
7.2.2 Initial-Latin repertoire .7
7.2.3 Basic-Latin-a repertoire.7
7.2.4 Basic-Latin-b repertoire .7
7.2.5 Basic-Latin-c repertoire.7
7.2.6 Large-Latin-8-a repertoire .8
7.2.7 Large-Latin-8-b repertoire.8
7.2.8 Celtic repertoire.8
7.2.9 Romanian repertoire.8
7.2.10 Basic-Greek repertoire .8
7.2.11 Basic-Cyrillic repertoire .8
7.2.12 Symbols repertoire .8
8 Identification of options .8
Annex A (informative) Specifications of referenced ISO-IR code tables.10
Annex B (informative) CEN/TS 1923 options compared to ISO/IEC 7/8-bit standards.21
Annex C (informative) Code table illustrations .22
Foreword
This document (CEN/TS 1923:2003) has been prepared by Technical Committee CEN/TC 304, "Information and
communications technology - European localization requirements", the secretariat of which is held by SIS.
According to the CEN/CENELEC Internal Regulations, the national standards organizations of the following coun-
tries are bound to announce this European Standard: Austria, Belgium, Czech Republic, Denmark, Finland, France,
Germany, Greece, Hungary, Iceland, Ireland, Italy, Luxembourg, Malta, Netherlands, Norway, Portugal, Slovakia,
Spain, Sweden, Switzerland and the United Kingdom.
This Technical Specification is a revision of the European Standard EN 1923:1998, which it cancels and replaces.
The main purpose of the revision is to include, and thereby to publicize the availability of, 8-bit code tables devel-
oped after the publication of EN 1923:1998; in particular the code table of ISO/IEC 8859-15 and the tables of other
additions to the ISO/IEC 8859 series. Although CEN/TC 304 decided that a revision of the contents of EN
1923:1998 was necessary, some uncertainty existed whether the standard as such is needed by the data commu-
nity in the present-day direction towards multi-octet coding schemes. The committee therefore decided to classify
the revised document as a Technical Specification. Its usefulness will thereby become evaluated.
The contents of this document differs from that of EN 1923:1998 in the following respects:
– Extensive editorial changes have been made to the text for conformance with present CEN/CENELEC drafting
rules.
– Additional coding scheme options have been introduced, corresponding to ISO/IEC 8859 parts 14, 15 and 16
(Latin-8, Latin-9 and Latin-10), and also to ISO-IR 204 ("Latin-1 alternative with Euro").
– For consistency, the definitions of all options now refer to registrations according to ISO 2375:1985 in the ISO
"International register of coded character sets to be used with escape sequences". Relationships to ISO/IEC
10646-1:2000 specifications are also given, to the extent applicable.
– An informative Annex A has been added, containing ISO/IEC 10646-1:2000 identifications for all characters in
the options character sets.
– An informative Annex B has been added, listing relationships to ISO/IEC 7/8-bit coding standards.
– An informative Annex C has been added, illustrating the code tables for all options.
3.2
byte
1 Scope
bit string that is operated upon as a unit
This Technical Specification specifies the graphic
3.3
character repertoires and their single-byte coding,
character
which are available for use for information interchange
member of a set of elements used for the organiza-
between information processing systems and for use
tion, control, or representation of data
within such systems, in the scripts that are commonly
used by the members of CEN/CENELEC and the In-
3.4
stitutions of the European Union and the European
coded-character-data-element
Free Trade Association.
CC-data-element
element of interchanged information that is specified
This Technical Specification does not specify the in-
to consist of a sequence of coded representations of
terchange of information using a telematic service.
characters, in accordance with one or more identified
The character repertoire and the coding used by a
standards for coded character sets
telematic service are defined by the specification of
that service. The transmission of information based on
3.5
the specifications of this Technical Specification using
coded character set
a telematic service may necessitate an adaptation of
code
the number of characters of a repertoire (repertoire
set of unambiguous rules that establishes a character
transformation function) or a change to the coding
set and the one-to-one relationship between the char-
(code transformation function).
acters of the set and their bit combinations
3.6
2 Normative references
code extension
techniques for the encoding of characters that are not
This Technical Specification incorporates by dated or
included in the character set of a given code
undated reference, provisions from other publications.
These normative references are cited at the appropri-
3.7
ate places in the text and the publications are listed
code table
hereafter. For dated references, subsequent amend-
table showing the characters allocated to each bit
ments to or revisions of any of these publications ap-
combination in a code
ply to this Technical Specification only when incorpo-
rated in it by amendment or revision. For undated ref-
3.8
erences the latest edition of the publication referred to
control character
applies.
control function the coded representation of which
consists of a single bit combination
ISO/IEC 2022:1994, Information technology – Char-
acter code structure and extension techniques.
3.9
control function
ISO 2375:1985, Data processing – Procedure for
action that affects the recording, processing, trans-
registration of escape sequences
mission or interpretation of data, and that has a coded
representation consisting of one or more bit combina-
ISO/IEC 4873:1991, Information technology – ISO
tions
8-bit code for information interchange – Structure and
rules for implementation.
3.10
to designate
to identify a set of characters that are to be repre-
3 Terms and definitions sented, in some cases immediately and in others on
the occurrence of a further control function, in a pre-
scribed manner
For the purposes of this Technical Specification, the
following terms and definitions apply:
3.11
device
component of information processing equipment
3.1
which can transmit and/or receive coded in-formation
bit combination
within CC-data-elements; it may be an input/output
ordered set of bits used for the representation of
device in the conventional sense, or a process such
characters
as an application program or gateway function
3.12 4.2.2 Device description
escape sequence
string of bit combinations that is used for control pur- A device that conforms to this Technical Specification
poses in code extension procedures; the first of these
shall be the subject of a description that identifies the
bit combinations represents the control function ES- means by which the user may supply characters to
CAPE the device, or may recognize them whe
...

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.