Information technology — 8-bit single-byte coded graphic character sets — Part 8: Latin/Hebrew alphabet

Technologies de l'information — Jeux de caractères graphiques codés sur un seul octet — Partie 8: Alphabet latin/hébreu

General Information

Status
Published
Publication Date
20-Jan-1999
Current Stage
9093 - International Standard confirmed
Completion Date
27-Aug-2020
Ref Project

Relations

Buy Standard

Standard
ISO/IEC 8859-8:1999 - Information technology -- 8-bit single-byte coded graphic character sets
English language
12 pages
sale 15% off
Preview
sale 15% off
Preview

Standards Content (Sample)

INTERNATIONAL ISO/IEC
STANDARD 8859-8
First edition
1999-01-15
Information technology — 8-bit single-byte
coded graphic character sets —
Part 8:
Latin/Hebrew alphabet
Technologies de l'information — Jeux de caractères graphiques codés sur
un seul octet —
Partie 8: Alphabet latin/hébreu
Reference number
B C
ISO/IEC 8859-8:1999(E)

---------------------- Page: 1 ----------------------
ISO/IEC 8859-8:1999 (E)
Contents
Page
Foreword . . iii
Introduction . . iv
1 Scope . 1
2 Conformance . 1
3 Normative references . 1
4 Definitions . 2
5 Notation, code table and names . 2
6 Specification of the coded character set . 3
7 Identification of the character set . 7
Annex A: Coverage of languages by parts 1 to 10 of
ISO/IEC 8859 . 8
Annex B: Main differences between ISO 8859-8:1988 and
this first edition of this part of ISO/IEC 8859 . 10
Annex C: Bi-directional text support . 11
Annex D: Bibliography . 12
© ISO/IEC 1999
All rights reserved. Unless otherwise specified, no part of this publication may be
reproduced or utilized in any form or by any means, electronic or mechanical,
including photocopying and microfilm, without permission in writing from the publisher.
ISO/IEC Copyright Office Case Postale 56 CH-1211 Genève 20 Switzerland
• • •
Printed in Switzerland
ii

---------------------- Page: 2 ----------------------
© ISO/IEC ISO/IEC 8859-8:1999 (E)
Foreword
ISO (the International Organization for Standardization) and IEC (the
International Electrotechnical Commission) form the specialized
system for worldwide standardization. National bodies that are
members of ISO or IEC participate in the development of
International Standards through technical committees established by
the respective organization to deal with particular fields of technical
activity. ISO and IEC technical committees collaborate in fields of
mutual interest. Other international organizations, governmental and
nongovernmental, in liaison with ISO and IEC, also take part in the
work.
In the field of information technology, ISO and IEC have established
a joint technical committee, ISO/IEC JTC1. Draft International
Standards adopted by the joint technical committee are circulated to
national bodies for voting. Publication as an International Standard
requires approval by at least 75% of the national bodies casting a
vote.
International Standard ISO/IEC 8859-8 was prepared by Joint
Technical Committee ISO/IEC JTC 1, Information technology,
Subcommittee SC 2, Coded character sets.
This edition cancels and replaces ISO 8859-8:1988 which has been
technically revised.
ISO/IEC 8859 consists of the following parts, under the general title
Information technology – 8-bit single-byte coded graphic character
sets:
– Part 1: Latin alphabet No. 1
– Part 2: Latin alphabet No. 2
– Part 3: Latin alphabet No. 3
– Part 4: Latin alphabet No. 4
– Part 5: Latin/Cyrillic alphabet
– Part 6: Latin/Arabic alphabet
– Part 7: Latin/Greek alphabet
– Part 8: Latin/Hebrew alphabet
– Part 9: Latin alphabet No. 5
– Part 10: Latin alphabet No. 6
Annexes A to D of this part of ISO/IEC 8859 are for information only.
iii

---------------------- Page: 3 ----------------------
ISO/IEC 8859-8:1999 (E) © ISO/IEC
Introduction
ISO/IEC 8859 consists of several parts. Each part specifies a set of
up to 191 graphic characters and the coded representation of these
characters by means of a single 8-bit byte. Each set is intended for
use for a particular group of languages.
iv

---------------------- Page: 4 ----------------------
INTERNATIONAL STANDARD © ISO/IEC ISO/IEC 8859-8:1999 (E)
Information technology –
8-bit single-byte coded graphic character sets –
Part 8: Latin/Hebrew alphabet
2.2 Conformance of devices
1 Scope
A device is in conformance with this part of
This part of ISO/IEC 8859 specifies a set of 155
ISO/IEC 8859 if it conforms to the requirements of
coded graphic characters identified as Latin/Hebrew
2.2.1, and either or both of 2.2.2 and 2.2.3. A claim
alphabet.
of conformance shall identify the document which
This set of coded graphic characters is intended for
contains the description specified in 2.2.1.
use in data and text processing applications and
also for information interchange.
2.2.1 Device description
A device that conforms to this part of ISO/IEC 8859
The set contains graphic characters used for
shall be the subject of a description that identifies
general purpose applications in typical office
the means by which the user may supply characters
environments in at least the following languages:
to the device, or may recognize them when they are
English, Hebrew, Latin.
made available to him, as specified respectively in
2.2.2 and 2.2.3.
It is not intended for pointed Hebrew.
2.2.2 Originating devices
This set of coded graphic characters may be
regarded as a version of an 8-bit code according to
An originating device shall allow its user to supply
ISO/IEC 2022 or ISO/IEC 4873 at level 1.
any sequence of characters from those specified in
clause 6, and shall be capable of transmitting their
This part of ISO/IEC 8859 may not be used in
coded representations within a CC-data-element.
conjunction with any other parts of ISO/IEC 8859.
If coded characters from more than one part are to
2.2.3 Receiving devices
be used together, by means of code extension
A receiving device shall be capable of receiving and
techniques, the equivalent coded character sets
interpreting any coded representations of characters
from ISO/IEC 10367 should be used instead within
that are withina CC-data-element, and that conform
a version of ISO/IEC 4873 at level 2 or level 3.
to clause 6, and shall make the corresponding
The coded characters in this set may be used in
characters available to its user in such a way that
conjunction with coded control functions selected
the user can identify them from among those
from ISO/IEC 6429. However, control functions are
specified there, and can distinguish them from each
not used to create composite graphic symbols from
other.
two or more graphic characters (see clause 6).
NOTE – ISO/IEC 8859 is not intended for use with
3 Normative references
Telematic services defined by ITU-T. If information coded
The following standards contain provisions which,
according to ISO/IEC 8859 is to be transferred to such
services, it will have to conform to the requirements of
through reference in this text, constitute provisions
those services at the access-point.
of this part of ISO/IEC 8859. At the time of publica-
tion, the editions indicated were valid. All standards
2 Conformance are subject to revision, and parties to agreements
based on this part of ISO/IEC 8859 are encouraged
2.1 Conformance of information interchange
to investigate the possibility of applying the most
A coded-character-data-element (CC-data-element) recent editions of the standards indicated below.
within coded information for interchange is in Members of IEC and ISO maintain registers of
conformance with this part of ISO/IEC 8859 if all the currently valid International Standards.
coded representations of graphic characters within
that CC-data-element conform to the requirements
of clause 6.
1

---------------------- Page: 5 ----------------------
ISO/IEC 8859-8:1999 (E) © ISO/IEC
ISO/IEC 2022:1994, Information technology – 4.10 graphic symbol: A visual representation of
Character code structure and extension techniques. a graphic character or of a control function.
4.11 implicit directionality: A text presentation
ISO/IEC 4873:1991, Information technology –
method in which the direction is determined by an
ISO 8-bit code for information interchange –
algorithm. The algorithm is based on the directional
Structure and rules for implementation.
character properties of the character, its position
ISO/IEC 8824-1:1995, Information technology – relative to the preceding and following character and
Abstract Syntax Notation One (ASN.1): Specifica- to the primary direction.
tion of basic notation.
4.12 left-to-right character: A character specific
to a script written from left to right like the Latin
4 Definitions
script or the Greek script. Typical examples are the
letters A–Z.
For the purposes of this part of ISO/IEC 8859 the
following definitions apply:
4.13 position: That part of a code table identified
by its column and row coordinates.
4.1 bi-directional text: A text which may contain
strings of characters with left-to-right and right-to- 4.14 right-to-left character: A character specific
left directions. to a script written from right to left like the Arabic
script or the Hebrew script. Typical examples are
4.2 bit combination: An ordered set of bits used
the letters of the Hebrew alphabet.
for the representation of characters.
4.3 byte: A bit string that is operated upon asa unit. 5 Notation, code table and names
4.4 character: A member of a set of elements 5.1 Notation
used for the organization, control, or representation
The bits of the bit combinations of the 8-bit code are
of data.
identified by b ,b ,b ,b ,b ,b ,b , and b , where
8 7 6 5 4 3 2 1
b is the highest-order, or most-significant bit and b
4.5 code table: A table showing the characters
8 1
is the lowest-order, or least-significant bit.
allocated to each bit combination in a code.
4.6 coded character set; code: A set of The bit combinations may be interpreted to
unambiguous rules that establishes a character set represent numbers in binary notation by attributing
and the one-to-one relationship between the the following weights to the individual bits:
characters of the set and their bit combinations.
Bit b b b b b b b b
8 7 6 5 4 3 2 1
4.7 coded-character-data-element (CC-data-
Weight 128 64 32 16 8 4 2 1
element): An element of interchanged information
that is specified to consist of a sequence of coded
Using these weights, the bit combinations are
representations of characters, in accordance with
identified by notations of the form xx/yy, where xx
one or more identified standards for coded
and yy are numbers in the range 00 to 15. The
character sets.
correspondence between the notations of the form
4.8 directional character properties: A set of
xx/yy and the bit combinations consisting of the bits
mutually exclusive properties which may qualify the
b to b is as follows:
8 1
members of a character set. These properties are
used by algorithms which transform text from – xx is the number represented by b ,b ,b and
8 7 6
processing sequence into presentation sequence. b where these bits are given the weights 8, 4, 2,
5
Examples of values for directional character and 1 respectively.
properties are "right-to-left", "left-to-right", "digit",
– yy is the number represented by b ,b ,b and
4 3 2
"numeric separator", "neutral".
b where these bits are given the weights 8, 4, 2,
1
4.9 graphic character: A character, other than a
and 1 respectively.
control function, that has a visual representation
The bit combinations are also identified by notations
normally handwritten, printed or displayed, and that
of the form hk, where h and k are numbers in the
has a coded representation consisting of one or
range 0 to F in hexadecimal notation. The number
more bit combinations.
h is the same as the number xx described above,
NOTE – In ISO/IEC 8859a single bit combination is used
and the number k the same as the number yy
to represent each character.
described above.
2

---------------------- Page: 6 ----------------------
© ISO/IEC ISO/IEC 8859-8:1999 (E)
5.3.1 SPACE (SP)
5.2 Layout of the code table
A graphic character the visual representation of
An 8-bit code table consists of 256 positions
which consists of the absence of a graphic symbol.
arranged in 16 columns and 16 rows. The columns
and the rows are numbered 00 to 15. In hexa-
5.3.2 NO-BREAK SPACE (NBSP)
decimal notation the columns and the rows are
numbered 0 to F.
A graphic character the visual representation of
which consists of the absence of a graphic symbol,
The code table positions are identified by notations
for use when a line break is to be prevented in the
of the form xx/yy, where xx is the column number
text as presented.
and yy is the row number. The column and row
numbers are shown at the top and left edges of the
5.3.3 SOFT HYPHEN (SHY)
table respectively. The code table positions are
A graphic character that is imaged by a graphic
also identified by notations of the form hk, where h
symbol identical with, or similar to, that representing
is the column number and k is the row number in
HYPHEN, for use when a line break has been
hexadecimal notation. The column and row
established within a word.
numbers are shown at the bottom and right edges of
the table respectively.
5.3.4 LEFT-TO-RIGHT MARK (LRM)
The positions of the code table are in one-to-one
A graphic character the visual representation of
correspondence with the bit combinations of the
which consists of the absence of a graphic symbol,
code. The notation of a code table position, of the
which acts like a left-to-right character in a bi-
form xx/yy, or of the form hk, is the same as that of
directional text (such as LATIN SMALL LETTER A).
the corresponding bit combination.
5.3.5 RIGHT-TO-LEFT MARK (RLM)
5.3 Names and meanings
A graphic character the visual representation of
This part of ISO/IEC 8859 assigns a unique name
which consists of the absence of a graphic symbol,
and a unique identifier to each graphic character.
which acts like a right-to-left character in a bi-
These names and identifiers have been taken from
directional text (such as HEBREW LETTER ALEF).
ISO/IEC 10646-1 (E). This part of ISO/IEC 8859
also specifies an acronym for each of the characters
6 Specification of the coded character set
SPACE, NO-BREAK SPACE, SOFT HYPHEN,
This part of ISO/IEC 8859 specifies 155 characters
LEFT-TO-RIGHT MARK and RIGHT-TO-LEFT
allocated to the bit combinations of the code table
MARK. For acronyms only Latin capital letters A to
(table 2).
Z are used. It is intended that the acronyms be
retained in all translations of the text.
Control functions, such as BACKSPACE or
CARRIAGE RETURN, shall not be used to create
Except for SPACE (SP), NO-BREAK SPACE
composite graphic symbols, which are made up
(NBSP), SOFT HYPHEN (SHY), LEFT-TO-RIGHT
from the graphic representations of two or more
MARK (LRM) and RIGHT-TO-LEFT MARK (RLM),
characters.
this part of ISO/IEC 8859 does not define and does
not restrict the meanings of graphic characters.
6.1 Characters of the set and their coded
This part of ISO/IEC 8859 specifies a graphic
representation
symbol for each graphic character. This symbol is
See table 1.
shown in the corresponding position of the code
table. However, this part, or any other part, of
ISO/IEC 8859 does not specify a particular style or
font design for imaging graphic characters. Annex
B of ISO/IEC 10367 gives further information on this
subject.
3

---------------------- Page: 7 ----------------------
ISO/IEC 8859-8:1999 (E) © ISO/IEC
Table1 – Character set, coded representation Table 1 (continued)
Bit Bit
combi- Hex Identifier Name combi- Hex Identifier Name
nation nation
02/00 20 U+0020 SPACE 05/00 50 U+0050 LATIN CAPITAL LETTER P
02/01 21 U+0021 EXCLAMATION MARK 05/01 51 U+0051 LATIN CAPITAL LETTER Q
02/02 22 U+0022 QUOTATION MARK 05/02 52 U+0052 LATIN CAPITAL LETTER R
02/03 23 U+0023 NUMBER SIGN 05/03 53 U+0053 LATIN CAPITAL LETTER S
02/04 24 U+0024 DOLLAR SIGN 05/04 54 U+0054 LATIN CAPITAL LETTER T
02/05 25 U+0025 PERCENT SIGN 05/05 55 U+0055 LATIN CAPITAL LETTER U
02/06 26 U+0026 AMPERSAND 05/06 56 U+0056 LATIN CAPITAL LETTER V
02/07 27 U+0027 APOSTROPHE 05/07 57 U+0057 LATIN CAPITAL LETTER W
02/08 28 U+0028 LEFT PARENTHESIS 05/08 58 U+0058 LATIN CAPITAL LETTER X
02/09 29 U+0029 RIGHT PARENTHESIS 05/09 59 U+0059 LATIN CAPITAL LETTER Y
02/10 2A U+002A ASTERISK 05/10 5A U+005A LATIN CAPITAL LETTER Z
02/11 2B U+002B PLUS SIGN 05/11 5B U+005B LEFT SQUARE BRACKET
02/12 2C
...

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.