ISO/IEC 8859-4:1998
(Main)Information technology — 8-bit single-byte coded graphic character sets — Part 4: Latin alphabet No. 4
Information technology — 8-bit single-byte coded graphic character sets — Part 4: Latin alphabet No. 4
Technologies de l'information — Jeux de caractères graphiques codés sur un seul octet — Partie 4: Alphabet latin no 4
General Information
Relations
Standards Content (Sample)
INTERNATIONAL ISOAEC
8859-4
STANDARD
First edition
1998-07-01
Information technology - 8-bit single-byte
coded graphic character sets -
Part 4:
Latin alphabet No. 4
- Jeux de caractkres graphiques cod& sur
Technologies de I’informa tion
un seul octet -
Partie 4: Alphabet latin no 4
Reference number
q m ’
ISOh EC 8859-4: 1998(E)
---------------------- Page: 1 ----------------------
ISOAEC 8859-4: 1998 (E)
Contents
Page
Foreword . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii
Introduction . . . . . . . . . . . . . I . . . . . . . . . . . . . . . . . . . . . . . . iv
1 Scope . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
2 Conformance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Normative references . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
3
Definitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
4
Notation, code table and names . . . . . . . . . . . . . . . . . . . 2
5
Specification of the coded character set . . . . . . . . . . . . . 3
6
7 Identification of the character set . . . . . . . . . . . . . . . . . . 6
Annex A: Coverage of languages by parts 1 to 10 of
lSO/IEC 8859 . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Annex B: Main differences between IS0 8859-4:1988 and
this first edition of this part of lSO/IEC 8859 . . . . 9
Annex C: Bibliography . . . . . . . . . . . . . . . . . . . . . . . . I . . 10
0 ISO/IEC 1998
Unless otherwise specified, no part of this publication may be
All rights reserved.
reproduced or utilized in any form or by any means, electronic or mechanical,
including photocopying and microfilm, without permission in writing from the publisher.
ISO/IEC Copyright Office l Case Postale 56 l CH-121 1 Geneve 20 l Switzerland
Printed in Switzerland
ii
---------------------- Page: 2 ----------------------
o ISO/IEC ISOAEC 8859-4:1998 (E)
Foreword
IS0 (the International Organization for Standardization) and IEC (the
International Electrotechnical Commission) form the specialized
system for worldwide standardization. National bodies that are
members of IS0 or IEC participate in the development of
International Standards through technical committees established by
the respective organization to deal with particular fields of technical
activity. IS0 and IEC technical committees collaborate in fields of
mutual interest. Other international organizations, governmental and
nongovernmental, in liaison with IS0 and IEC, also take part in the
work.
In the field of information technology, IS0 and IEC have established
a joint technical committee, ISO/IEC JTCI. Draft International
Standards adopted by the joint technical committee are circulated to
national bodies for voting. Publication as an International Standard
requires approval by at least 75% of the national bodies casting a
vote.
International Standard ISO/IEC 8859-4 was prepared by Joint
Technical Committee ISO/IEC JTC 1, information technology,
Subcommittee SC 2, Character sets and information coding.
This edition cancels and replaces IS0 8859-4:1988 which has been
technically revised.
ISO/IEC 8859 consists of the following parts, under the general title
lnforma tion technology - 8-bit single-byte coded graphic character
sets:
- Part 1: Latin alphabet No. I
-
Part 2: Latin alphabet No. 2
-
Part 3: Latin alphabet No. 3
-
Part 4: Latin alphabet No. 4
-
Part 5: Latin/Cyrillic alphabet
-
Part 6: Latin/Arabic alphabet
-
Part 7: Latin/Greek alphabet
-
Part 8: Latin/Hebrew alphabet
-
Part 9: Latin alphabet No. 5
-
Part IO: Latin alphabet No. 6
Annexes A to C of this part of ISO/IEC 8859 are for information only.
. . .
III
---------------------- Page: 3 ----------------------
ISOAEC 8859-4: 1998 (E)
0 ISOAEC
Iintroduction
ISOAEC 8859 consists of several parts. Each part specifies a set of
up to 191 graphic characters and the coded representation of these
characters by means of a single B-bit byte. Each set is intended for
use for a particular group of languages.
iv
---------------------- Page: 4 ----------------------
ISOAEC 8859-4:1998 (E)
INTERNATIONAL STANDARD o ISO/IEC
Information technology -
8-bit single-byte coded graphic character sets -
Part 4: Latin alphabet No. 4
that CC-data-element conform to the requirements
1 Scope
of clause 6.
This part of lSO/IEC 8859 specifies a set of 191
characters identified as Latin
coded graphic
2.2 Conformance of devices
alphabet No. 4.
A device is in conformance with this part of
This set of coded graphic characters is intended for ISO/IEC 8859 if it conforms to the requirements of
use in data and text processing applications and 2.2.1, and either or both of 2.2.2 and 2.2.3. A claim
also for information interchange. of conformance shall identify the document which
contains the description specified in 2.2.1.
The set contains graphic characters used for
general purpose applications in typical office
2.2.1 Device description
environments in at least the following languages:
A device that conforms to this part of ISO/IEC 8859
Danish, English, Estonian, Finnish, German, shall be the subject of a description that identifies
Greenlandic, Latin, Latvian, Lithuanian, Norwegian, the means by which the user may supply characters
Sami (but see Annex A.1, Notes), Slovene and to the device, or may recognize them when they are
Swedish. made available to him, as specified respectively in
2.2.2 and 2.2.3.
This set of coded graphic characters may be
regarded as a version of an 8-bit code according to
2.2.2 Originating devices
ISO/IEC 2022 or ISO/IEC 4873 at level 1.
An originating device shall allow its user to supply
This part of ISO/IEC 8859 may not be used in any sequence of characters from those specified in
conjunction with any other parts of ISO/IEC 8859. clause 6, and shall be capable of transmitting their
If coded characters from more than one part are to coded representations within a CC-data-element.
be used together, by means of code extension
2.2.3 Receiving devices
techniques, the equivalent coded character sets
from ISO/IEC 10367 should be used instead within
A receiving device shall be capable of receiving and
a version of ISO/IEC 4873 at level 2 or level 3.
interpreting any coded representations of characters
that are within a CC-data-element, and that conform
The coded characters in this set may be used in
to clause 6, and shall make the corresponding
conjunction with coded control functions selected
characters available to its user in such a way that
from ISO/IEC 6429. However, control functions are
the user can identify them from among those
not used to create composite graphic symbols from
specified there, and can distinguish them from each
two or more graphic characters (see clause 6).
other.
NOTE - ISOAEC 8859 is not intended for use with
Telematic services defined by ITU-T. If information coded
3 Normative references
according to ISOAEC 8859 is to be transferred to such
services, it will have to conform to the requirements of
The following standards contain provisions which,
those services at the access-point.
through reference in this text, constitute provisions
of this part of ISO/IEC 8859. At the time of publica-
2 Conformance
tion, the editions indicated were valid. All standards
are subject to revision, and parties to agreements
2.1 Conformance of information interchange
based on this part of ISO/IEC 8859 are encouraged
A coded-character-data-element (CC-data-element)
to investigate the possibility of applying the most
within coded information for interchange is in
recent editions of the standards indicated below.
conformance with this part of ISO/IEC 8859 if all the
Members of IEC and IS0 maintain registers of
coded representations of graphic characters within
currently valid International Standards.
1
---------------------- Page: 5 ----------------------
ISOAEC 885994:1998 (E) 0 ISO/IEC
ISO/IEC 2022:1994,
lnforma tion technology - The bit combinations may be interpreted to
Character code structure and extension techniques. represent numbers in binary notation by attributing
the following weights to the individual bits:
lSO/IEC 4873:1991, lnforma tion technology -
Bit
IS0 a-bit code for information interchange -
b, b, b, b, b4 b3 b2 bl
Structure and rules for implementation.
Weight 128 64 32 16 8 4
2 1
lSO/IEC 8824-l :I 995, lnforma tion technology -
Using these weights, the bit combinations are
Abstract Syntax Notation One (ASN. I): Specifica-
identified by notations of the form xx/yy, where xx
tion of basic notation.
and yy are numbers in the range 00 to 15. The
correspondence between the notations of the form
4 Definitions
xx/yy and the bit combinations consisting of the bits
For the purposes of this part of ISO/IEC 8859 the
b, to b, is as follows:
following definitions apply:
xx is the number represented by b,, b,, b, and
-
4.1 bit combination: An ordered set of bits used
b, where these bits are given the weights 8, 4, 2,
for the representation of characters.
and 1 respectively.
4.2 byte: A bit string that is operated upon as a unit.
- yy is the number represented by b,, b,, b, and
4.3 character: A member of a set of elements b, where these bits are given the weights 8, 4, 2,
used for the organization, control, or representation and 1 respectively.
of data.
The bit combinations are also identified by notations
4.4 code table: A table showing the characters
of the form hk, where h and k are numbers in the
allocated to each bit combination in a code.
range 0 to F in hexadecimal notation. The number
h is the same as the number xx described above,
4.5 coded character set; code:
A set of
and the number k the same as the number yy
unambiguous rules that establishes a character set
described above.
and the one-to-one relationship between the
characters of the set and their bit combinations.
5.2 Layout of the code table
4.6 coded-character-data-element (CC-data-
An 8-bit code table consists of 256 positions
element): An element of interchanged information
arranged in 16 columns and 16 rows. The columns
that is specified to consist of a sequence of coded
and the rows are numbered 00 to 15. In hexa-
representations of characters, in accordance with
decimal notation the columns and the rows are
one or more identified standards for coded
numbered 0 to F.
character sets.
4.7 graphic character: The code table positions are identified by notations
A character, other than a
control function, that has a visual representation of the form xx/yy, where xx is the column number
normally handwritten, printed or displayed, and that and yy is the row number. The column and row
has a coded representation consisting of one or numbers are shown at the top and left edges of the
more bit combinations. table respectively. The code table positions are
also identified by notations of the form hk, where h
NOTE - In lSO/IEC 8859 a single bit combination is used
is the column number and k is the row number in
to represent each character.
hexadecimal notation. The column and row
4.8 graphic symbol: A visual representation of a
numbers are shown at the bottom and right edges of
graphic character or of a control function.
the table respectively.
4.9 position:
That part of a code table identified
The positions of the code table are in one-to-one
by its column and row coordinates.
correspondence with the bit combinations of the
code. The notation of a code table position, of the
5 Notation, code table and names
form xx/yy, or of the form hk, is the same as that of
5.1 Notation
the corresponding bit combination.
The bits of the bit combinations of the 8-bit code are
5.3 Names and meanings
identified by b,, b,, b,, b,, b,, b,, b,, and b,, where
b, is the highest-order, or most-significant bit and b,
This part of ISOAEC 8859 assigns a unique name
is the lowest-order, or least-significant bit.
and a unique identifier to each graphic character.
These names and identifiers have been taken from
2
---------------------- Page: 6 ----------------------
0 ISO/IEC ISOAEC 8859-4: 1998 (E)
ISO/IEC 10646-I (E). This part of lSO/IEC 8859 - Character set, coded representation
Table 1
also specifies an acronym for each of the characters
I I
r
3il
SPACE, NO-BREAK SPACE and SOFT HYPHEN.
:om bi- 1 Hex1 Identifier! Name
lation
For acronyms only Latin capital letters A to Z are
used. It is intended that the acronyms be retained in
02/00 20 u+oo20 SPACE
02/01 21 u+oo21 EXCLAMATION MARK
all translations of the text.
02102 22 QUOTATION MARK
u+oo22
02/03 23 U+OO23 NUMBER SIGN
Except for SPACE (SP), NO-BREAK SPACE
02/04 24 U+OO24 DOLLAR SIGN
(NBSP) and SOFT HYPHEN (SHY), this part of
02/05 25 UtO025 PERCENT SIGN
ISO/IEC 8859 does not define and does not restrict
02lO6 26 UtO026 AMPERSAND
the meanings of graphic characters.
02107 27 UtO027 APOSTROPHE
28 LEFT PARENTHESIS
02108 UtO028
This part of ISO/IEC 8859 specifies a graphic
02109 29 utoo29 RIGHT PARENTHESIS
02110 2A Ut002A ASTERISK
symbol for each graphic character. This symbol is
28 Ut002B PLUS SIGN
02111
shown in the corresponding position of the code
02/12 2c utoo2c COMMA
table. However, this part, or any other part, of
02/13 2D Ut002D HYPHEN-MINUS
ISO/IEC 8859 does not specify a particular style or
2E Ut002E FULL STOP
02/14
02l15 2F UtOO2F SOLIDUS
font design for imaging graphic characters. Annex
03100 30 utoo30 DIGIT ZERO
B of lSO/IEC 10367 gives further information on this
31 utoo31 DIGIT ONE
0310 1
subject.
03102 32 UtO032 DIGIT TWO
03103 33 utoo33 DIGIT THREE
5.3.1 SPACE (SP)
03104 34 utoo34 DIGIT FOUR
03105 35 utoo35 ~ DIGIT FIVE
A graphic character the visual representation of
03106 36 UtO036 DIGIT SIX
which consists of the absence of a graphic symbol.
37 DIGIT SEVEN
03107 utoo37
03108 38 UtO038 DIGIT EIGHT
39 DIGIT NINE
5.3.2 NO-BREAK SPACE (NBSP) 03/09 utoo39
03/l 0 3A Ut003A COLON
A graphic character the visual representation of
03111 38 U+003B SEMICOLON
which consists of the absence of a graphic symbol, LESS-THAN SIGN
03112 3c utoo3c
03/l 3 30 Ut003D EQUALS SIGN
for use when a line break is to be prevented in the
GREATER-THAN SIGN
03114 3E Ut003E
text as presented.
3F UtOO3F QUESTION MARK
03115
04/00 40 utoo40 COMMERCIAL AT
5.3.3 SOFT HYPHEN (SHY)
LATIN CAPITAL LETTER A
0410 1 41 utoo41
04/02 42 UtO042 LATIN CAPITAL LEllER B
A graphic character that is imaged by a graphic
04/03 43 utoo43 LATIN CAPITAL LETTER C
symbol identical with, or similar to, that representing
44 LATIN CAPITAL LETTER D
04104 utoo44
HYPHEN, for use when a line break has been
04/05 45 utoo45 LATIN CAPITAL LETTER E
04/06 46 UtO046 LATIN CAPITAL LETTER F
established within a word.
47 LATIN CAPITAL LETTER G
04107 utoo47
04/08 48 UtO048 LATIN CAPITAL LEll-ER H
6 Specification of the coded character set
04/09 49 utoo49 LATIN CAPITAL LETTER I
04/l 0 4A Ut004A LATIN CAPITAL LETTER J
This part of lSO/IEC 8859 specifies 191 characters
04/l 1 4B Ut004B LATIN CAPITAL LETTER K
LATIN CAPITAL LETTER L
allocated to the bit combinations of the code table 04/l 2 4c utoo4c
04113 40 Ut004D LATIN CAPITAL LETTER M
(table 2). None of these characters are combining
04/l 4 4E Ut004E LATIN CAPITAL LETTER N
characters.
04/l 5 4F UtOO4F LATIN CAPITAL LETTER 0
05/00 50 utoo50 LATIN CAPITAL LETTER P
NOTE - Combining characters are described in ISOAEC
05101 51 utoo51 LATIN CAPITAL LE-l-l-ER Q
2022:1994 subclause 6.3.3.
05/02 52 U+OO52 LATIN CAPITAL LETTER R
LATIN CAPITAL LETTER S
05/03 53 utoo53
such as BACKSPACE or
Control functions,
05104 54 utoo54 LATIN CAPITAL LETTER T
CARRIAGE RETURN, shall not be used to create
05/05 55 utoo55 LATIN CAPITAL LETTER U
compos
...
Questions, Comments and Discussion
Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.