Information technology — 8-bit single-byte coded graphic character sets — Part 9: Latin alphabet No. 5

This part of ISO/IEC 8859 specifies a set of 191 coded graphic characters identified as Latin alphabet No. 5. This set of coded graphic characters is intended for use in data and text processing applications and also for information interchange. The set contains graphic characters used for general purpose applications in typical office environments in at least the following languages: Albanian, Basque, Breton, Catalan, Danish, Dutch, English, Faroese, Finnish, French (with restrictions, see Annex A.1, Notes), Frisian, Galician, German, Greenlandic, Irish Gaelic (new orthography), Italian, Latin, Luxemburgish, Norwegian, Portuguese, Rhaeto-Romanic, Scottish Gaelic, Spanish, Swedish and Turkish. This set of coded graphic characters may be regarded as a version of an 8-bit code according to ISO/IEC 2022 or ISO/IEC 4873 at level 1. This part of ISO/IEC 8859 may not be used in conjunction with any other parts of ISO/IEC 8859. If coded characters from more than one part are to be used together, by means of code extension techniques, the equivalent coded character sets from ISO/IEC 10367 should be used instead within a version of ISO/IEC 4873 at level 2 or level 3. The coded characters in this set may be used in conjunction with coded control functions selected from ISO/IEC 6429. However, control functions are not used to create composite graphic symbols from two or more graphic characters (see clause 6). NOTE ? ISO/IEC 8859 is not intended for use with Telematic services defined by ITU-T. If information coded according to ISO/IEC 8859 is to be transferred to such services, it will have to conform to the requirements of those services at the access-point.

Technologies de l'information — Jeux de caractères graphiques codés sur un seul octet — Partie 9: Alphabet latin no 5

General Information

Status
Published
Publication Date
20-Jan-1999
Current Stage
9093 - International Standard confirmed
Completion Date
27-Aug-2020
Ref Project

Relations

Buy Standard

Standard
ISO/IEC 8859-9:1999 - Information technology -- 8-bit single-byte coded graphic character sets
English language
10 pages
sale 15% off
Preview
sale 15% off
Preview

Standards Content (Sample)

INTERNATIONAL ISO/IEC
STANDARD 8859-9
Second edition
1999-01-15
Information technology — 8-bit single-byte
coded graphic character sets —
Part 9:
Latin alphabet No. 5
Technologies de l'information — Jeux de caractères graphiques codés sur
un seul octet —
o
Partie 9: Alphabet latin n 5
Reference number
B C
ISO/IEC 8859-9:1999(E)

---------------------- Page: 1 ----------------------
ISO/IEC 8859-9:1999 (E)
Contents
Page
Foreword . . iii
Introduction . . iv
1 Scope . 1
2 Conformance . 1
3 Normative references . 1
4 Definitions . 2
5 Notation, code table and names . 2
6 Specification of the coded character set . 3
7 Identification of the character set . 6
Annex A: Coverage of languages by parts 1 to 10 of
ISO/IEC 8859 . 7
Annex B: Main differences between the first edition and
this second edition of this part of ISO/IEC 8859 . 9
Annex C: Bibliography . 10
© ISO/IEC 1999
All rights reserved. Unless otherwise specified, no part of this publication may be
reproduced or utilized in any form or by any means, electronic or mechanical,
including photocopying and microfilm, without permission in writing from the publisher.
ISO/IEC Copyright Office Case Postale 56 CH-1211 Genève 20 Switzerland
• • •
Printed in Switzerland
ii

---------------------- Page: 2 ----------------------
© ISO/IEC ISO/IEC 8859-9:1999 (E)
Foreword
ISO (the International Organization for Standardization) and IEC (the
International Electrotechnical Commission) form the specialized
system for worldwide standardization. National bodies that are
members of ISO or IEC participate in the development of
International Standards through technical committees established by
the respective organization to deal with particular fields of technical
activity. ISO and IEC technical committees collaborate in fields of
mutual interest. Other international organizations, governmental and
nongovernmental, in liaison with ISO and IEC, also take part in the
work.
In the field of information technology, ISO and IEC have established
a joint technical committee, ISO/IEC JTC1. Draft International
Standards adopted by the joint technical committee are circulated to
national bodies for voting. Publication as an International Standard
requires approval by at least 75% of the national bodies casting a
vote.
International Standard ISO/IEC 8859-9 was prepared by Joint
Technical Committee ISO/IEC JTC 1, Information technology,
Subcommittee SC 2, Coded character sets.
This second edition cancels and replaces the first edition
(ISO/IEC 8859-9:1989) which has been technically revised.
ISO/IEC 8859 consists of the following parts, under the general title
Information technology – 8-bit single-byte coded graphic character
sets:
– Part 1: Latin alphabet No. 1
– Part 2: Latin alphabet No. 2
– Part 3: Latin alphabet No. 3
– Part 4: Latin alphabet No. 4
– Part 5: Latin/Cyrillic alphabet
– Part 6: Latin/Arabic alphabet
– Part 7: Latin/Greek alphabet
– Part 8: Latin/Hebrew alphabet
– Part 9: Latin alphabet No. 5
– Part 10: Latin alphabet No. 6
Annexes A to C of this part of ISO/IEC 8859 are for information only.
iii

---------------------- Page: 3 ----------------------
ISO/IEC 8859-9:1999 (E) © ISO/IEC
Introduction
ISO/IEC 8859 consists of several parts. Each part specifies a set of
up to 191 graphic characters and the coded representation of these
characters by means of a single 8-bit byte. Each set is intended for
use for a particular group of languages.
iv

---------------------- Page: 4 ----------------------
INTERNATIONAL STANDARD © ISO/IEC ISO/IEC 8859-9:1999 (E)
Information technology –
8-bit single-byte coded graphic character sets –
Part 9: Latin alphabet No. 5
coded representations of graphic characters within
1 Scope
that CC-data-element conform to the requirements
This part of ISO/IEC 8859 specifies a set of 191
of clause 6.
coded graphic characters identified as Latin
alphabet No. 5.
2.2 Conformance of devices
This set of coded graphic characters is intended for
A device is in conformance with this part of
use in data and text processing applications and
ISO/IEC 8859 if it conforms to the requirements of
also for information interchange.
2.2.1, and either or both of 2.2.2 and 2.2.3. A claim
of conformance shall identify the document which
The set contains graphic characters used for
contains the description specified in 2.2.1.
general purpose applications in typical office
environments in at least the following languages:
2.2.1 Device description
Albanian, Basque, Breton, Catalan, Danish, Dutch, A device that conforms to this part of ISO/IEC 8859
English, Faroese, Finnish, French (with restrictions, shall be the subject of a description that identifies
see Annex A.1, Notes), Frisian, Galician, German, the means by which the user may supply characters
Greenlandic, Irish Gaelic (new orthography), Italian, to the device, or may recognize them when they are
Latin, Luxemburgish, Norwegian, Portuguese, made available to him, as specified respectively in
Rhaeto-Romanic, Scottish Gaelic, Spanish, Swedish 2.2.2 and 2.2.3.
and Turkish.
2.2.2 Originating devices
This set of coded graphic characters may be
An originating device shall allow its user to supply
regarded as a version of an 8-bit code according to
any sequence of characters from those specified in
ISO/IEC 2022 or ISO/IEC 4873 at level 1.
clause 6, and shall be capable of transmitting their
This part of ISO/IEC 8859 may not be used in
coded representations within a CC-data-element.
conjunction with any other parts of ISO/IEC 8859.
If coded characters from more than one part are to 2.2.3 Receiving devices
be used together, by means of code extension
A receiving device shall be capable of receiving and
techniques, the equivalent coded character sets
interpreting any coded representations of characters
from ISO/IEC 10367 should be used instead within
that are within a CC-data-element, and that conform
a version of ISO/IEC 4873 at level 2 or level 3.
to clause 6, and shall make the corresponding
characters available to its user in such a way that
The coded characters in this set may be used in
the user can identify them from among those
conjunction with coded control functions selected
specified there, and can distinguish them from each
from ISO/IEC 6429. However, control functions are
other.
not used to create composite graphic symbols from
two or more graphic characters (see clause 6).
3 Normative references
NOTE – ISO/IEC 8859 is not intended for use with
Telematic services defined by ITU-T. If information coded
The following standards contain provisions which,
according to ISO/IEC 8859 is to be transferred to such
through reference in this text, constitute provisions
services, it will have to conform to the requirements of
of this part of ISO/IEC 8859. At the time of publica-
those services at the access-point.
tion, the editions indicated were valid. All standards
are subject to revision, and parties to agreements
2 Conformance
based on this part of ISO/IEC 8859 are encouraged
to investigate the possibility of applying the most
2.1 Conformance of information interchange
recent editions of the standards indicated below.
A coded-character-data-element (CC-data-element)
Members of IEC and ISO maintain registers of
within coded information for interchange is in
currently valid International Standards.
conformance with this part of ISO/IEC 8859 if all the
1

---------------------- Page: 5 ----------------------
ISO/IEC 8859-9:1999 (E) © ISO/IEC
ISO/IEC 2022:1994, Information technology – The bit combinations may be interpreted to
Character code structure and extension techniques. represent numbers in binary notation by attributing
the following weights to the individual bits:
ISO/IEC 4873:1991, Information technology –
Bit b b b b b b b b
ISO 8-bit code for information interchange –
8 7 6 5 4 3 2 1
Structure and rules for implementation.
Weight 128 64 32 16 8 4 2 1
ISO/IEC 8824-1:1995, Information technology –
Using these weights, the bit combinations are
Abstract Syntax Notation One (ASN.1): Specifica-
identified by notations of the form xx/yy, where xx
tion of basic notation.
and yy are numbers in the range 00 to 15. The
correspondence between the notations of the form
4 Definitions
xx/yy and the bit combinations consisting of the bits
For the purposes of this part of ISO/IEC 8859 the
b to b is as follows:
8 1
following definitions apply.
– xx is the number represented by b ,b ,b and
8 7 6
4.1 bit combination: An ordered set of bits used
b where these bits are given the weights 8, 4, 2,
5
for the representation of characters.
and 1 respectively.
4.2 byte: A bit string that is operated upon asa unit.
– yy is the number represented by b ,b ,b and
4 3 2
4.3 character: A member of a set of elements b where these bits are given the weights 8, 4, 2,
1
used for the organization, control, or representation and 1 respectively.
of data.
The bit combinations are also identified by notations
4.4 code table: A table showing the characters
of the form hk, where h and k are numbers in the
allocated to each bit combination in a code.
range 0 to F in hexadecimal notation. The number
h is the same as the number xx described above,
4.5 coded character set; code: A set of
and the number k the same as the number yy
unambiguous rules that establishes a character set
described above.
and the one-to-one relationship between the
characters of the set and their bit combinations.
5.2 Layout of the code table
4.6 coded-character-data-element (CC-data-
An 8-bit code table consists of 256 positions
element): An element of interchanged information
arranged in 16 columns and 16 rows. The columns
that is specified to consist of a sequence of coded
and the rows are numbered 00 to 15. In hexa-
representations of characters, in accordance with
decimal notation the columns and the rows are
one or more identified standards for coded
numbered 0 to F.
character sets.
The code table positions are identified by notations
4.7 graphic character: A character, other than a
of the form xx/yy, where xx is the column number
control function, that has a visual representation
and yy is the row number. The column and row
normally handwritten, printed or displayed, and that
numbers are shown at the top and left edges of the
has a coded representation consisting of one or
table respectively. The code table positions are
more bit combinations.
also identified by notations of the form hk, where h
NOTE – In ISO/IEC 8859a single bit combination is used
is the column number and k is the row number in
to represent each character.
hexadecimal notation. The column and row
4.8 graphic symbol: A visual representation of a
numbers are shown at the bottom and right edges of
graphic character or of a control function.
the table respectively.
4.9 position: That part of a code table identified
The positions of the code table are in one-to-one
by its column and row coordinates.
correspondence with the bit combinations of the
code. The notation of a code table position, of the
5 Notation, code table and names
form xx/yy, or of the form hk, is the same as that of
5.1 Notation
the corresponding bit combination.
The bits of the bit combinations of the 8-bit code are
5.3 Names and meanings
identified by b ,b ,b ,b ,b ,b ,b , and b , where
8 7 6 5 4 3 2 1
b is the highest-order, or most-significant bit and b
This part of ISO/IEC 8859 assigns a unique name
8 1
is the lowest-order, or least-significant bit.
and a unique identifier to each graphic character.
These names and identifiers have been taken from
2

---------------------- Page: 6 ----------------------
© ISO/IEC ISO/IEC 8859-9:1999 (E)
ISO/IEC 10646-1 (E). This part of ISO/IEC 8859 Table1 – Character set, coded representation
also specifies an acronym for each of the characters
Bit
SPACE, NO-BREAK SPACE and SOFT HYPHEN.
combi- Hex Identifier Name
nation
For acronyms only Latin capital letters A to Z are
used. It is intended that the acronyms be retained in
02/00 20 U+0020 SPACE
02/01 21 U+0021 EXCLAMATION MARK
all translations of the text.
02/02 22 U+0022 QUOTATION MARK
02/03 23 U+0023 NUMBER SIGN
Except for SPACE (SP), NO-BREAK SPACE
02/04 24 U+0024 DOLLAR SIGN
(NBSP) and SOFT HYPHEN (SHY), this part of
02/05 25 U+0025 PERCENT SIGN
ISO/IEC 8859 does not define and does not restrict
02/06 26 U+0026 AMPERSAND
the meanings of graphic characters.
02/07 27 U+0027 APOSTROPHE
02/08 28 U+0028 LEFT PARENTHESIS
This part of ISO/IEC 8859 specifies a graphic
02/09 29 U+0029 RIGHT PARENTHESIS
symbol for each graphic character. This symbol is 02/10 2A U+002A ASTERISK
02/11 2B U+002B PLUS SIGN
shown in the corresponding position of the code
02/12 2C U+002C COMMA
table. However, this part, or any other part, of
02/13 2D U+002D HYPHEN-MINUS
ISO/IEC 8859 does not specify a particular style or
02/14 2E U+002E FULL STOP
font design for imaging graphic characters. Annex 02/15 2F U+002F SOLIDUS
03/00 30 U+0030 DIGIT ZERO
B of ISO/IEC 10367 gives further information on this
03/01 31 U+0031 DIGIT ONE
subject.
03/02 32 U+0032 DIGIT TWO
03/03 33 U+0033 DIGIT THREE
5.3.1 SPACE (SP)
03/04 34 U+0034 DIGIT FOUR
03/05 35 U+0035 DIGIT FIVE
A graphic character the visual representation of
03/06 36 U+0036 DIGIT SIX
which consists of the absence of a graphic symbol.
03/07 37 U+0037 DIGIT SEVEN
03/08 38 U+0038 DIGIT EIGHT
5.3.2 NO-BREAK SPACE (NBSP)
03/09 39 U+0039 DIGIT NINE
03/10 3A U+003A COLON
A graphic character the visual representation of
03/11 3B U+003B SEMICOLON
which consists of the absence of a graphic symbol,
03/12 3C U+003C LESS-THAN SIGN
03/13 3D U+003D EQUALS SIGN
for use when a line break is to be prevented in the
03/14 3E U+003E GREATER-THAN SIGN
text as presented.
03/15 3F U+003F QUESTION MARK
04/00 40 U+0040 COMMERCIAL AT
5.3.3 SOFT HYPHEN (SHY)
04/01 41 U+0041 LATIN CAPITAL LETTER A
04/02 42 U+0042 LATIN CAPITAL LETTER B
A graphic character that is imaged by a graphic
04/03 43 U+0043 LATIN CAPITAL LETTER C
symbol identical with, or similar to, that representing
04/04 44 U+0044 LATIN CAPITAL LETTER D
HYPHEN, for use when a line break has been
04/05 45 U+0045 LATIN CAPITAL LETTER E
04/06 46 U+0046 LATIN CAPITAL LETTER F
established within a word.
04/07 47 U+0047 LATIN CAPITAL LETTER G
04/08 48 U+0048 LATIN CAPITAL LETTER H
6 Specification of the coded character set
04/09 49 U+0049 LATIN CAPITAL
...

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.