ISO/IEC 8859-11:2001
(Main)Information technology — 8-bit single-byte coded graphic character sets — Part 11: Latin/Thai alphabet
Information technology — 8-bit single-byte coded graphic character sets — Part 11: Latin/Thai alphabet
This part of ISO/IEC 8859 specifies a set of 183 coded graphic characters identified as Latin/Thai alphabet. This set of coded graphic characters is intended for use in data and text processing applications and also for information interchange. The set contains graphic characters used for general purpose applications in typical office environments in at least the following languages: Thai, English and Latin. Some of the characters in this set are combining characters (see clause 6). This set of coded graphic characters may be regarded as a version of an 8-bit code according to ISO/IEC 2022 or ISO/IEC 4873 at level 1. This part of ISO/IEC 8859 may not be used in conjunction with any other parts of ISO/IEC 8859. If coded characters from more than one part are to be used together, by means of code extension techniques, the equivalent coded character sets from ISO/IEC 10367 or their corresponding G1 sets from ?ISO International Register of Coded Character Sets to be used with escape sequences', should be used instead within a version of ISO/IEC 4873 at level 2 or level 3. The coded characters in this set may be used in conjunction with coded control functions selected from ISO/IEC 6429. However, control functions are not used to create composite graphic symbols from two or more graphic characters (see clause 6). NOTE ? ISO/IEC 8859 is not intended for use with Telematic services defined by ITU-T. If information coded according to ISO/IEC 8859 is to be transferred to such services, it will have to conform to the requirements of those services at the access-point.
Technologies de l'information — Jeux de caractères graphiques codés sur un seul octet — Partie 11: Alphabet latin/thaï
General Information
Standards Content (Sample)
INTERNATIONAL ISO/IEC
STANDARD 8859-11
First edition
2001-12-15
Information technology — 8-bit single-byte
coded graphic character sets —
Part 11:
Latin/Thai alphabet
Technologies de l'information — Jeux de caractères graphiques codés sur
un seul octet —
Partie 11: Alphabet latin/thaï
Reference number
ISO/IEC 8859-11:2001(E)
©
ISO/IEC 2001
---------------------- Page: 1 ----------------------
ISO/IEC 8859-11:2001(E)
PDF disclaimer
This PDF file may contain embedded typefaces. In accordance with Adobe's licensing policy, this file may be printed or viewed but shall not
be edited unless the typefaces which are embedded are licensed to and installed on the computer performing the editing. In downloading this
file, parties accept therein the responsibility of not infringing Adobe's licensing policy. The ISO Central Secretariat accepts no liability in this
area.
Adobe is a trademark of Adobe Systems Incorporated.
Details of the software products used to create this PDF file can be found in the General Info relative to the file; the PDF-creation parameters
were optimized for printing. Every care has been taken to ensure that the file is suitable for use by ISO member bodies. In the unlikely event
that a problem relating to it is found, please inform the Central Secretariat at the address given below.
© ISO/IEC 2001
All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means, electronic
or mechanical, including photocopying and microfilm, without permission in writing from either ISO at the address below or ISO's member body
in the country of the requester.
ISO copyright office
Case postale 56 • CH-1211 Geneva 20
Tel. + 41 22 749 01 11
Fax + 41 22 749 09 47
E-mail copyright@iso.ch
Web www.iso.ch
Printed in Switzerland
ii © ISO/IEC 2001 – All rights reserved
---------------------- Page: 2 ----------------------
ISO/IEC 8859-11:2001(E)
Contents
Page
Foreword .………………………………………. iv
Introduction .……………………………………… v
1 Scope .………………………………………… 1
2 Conformance .…………………………………… 1
3 Normative references .……………………………….1
4 Terms and definitions .………………………. 2
5 Notation, code table and names .………………………. 2
6 Specification of the coded character set .…………………. 3
7 Identification of the character set .……………………… 7
Annex A Coverage of languages by parts 1 to 10 and
13 to 16 of ISO/IEC 8859 .……………………… 8
Bibliography .…………………………….……………. 10
© ISO/IEC 2001 – All rights reserved iii
---------------------- Page: 3 ----------------------
ISO/IEC 8859-11:2001(E)
Foreword
ISO (the International Organization for Standardization) and IEC (the
International Electrotechnical Commission) form the specialized
system for worldwide standardization. National bodies that are
members of ISO or IEC participate in the development of International
Standards through technical committees established by the respective
organization to deal with particular fields of technical activity. ISO and
IEC technical committees collaborate in fields of mutual interest. Other
international organizations, governmental and non-governmental, in
liaison with ISO and IEC, also take part in the work. In the field of
information technology, ISO and IEC have established a joint
technical committee, ISO/IEC JTC 1.
International Standards are drafted in accordance with the rules given
in the ISO/IEC Directives, Part 3.
The main task of the joint technical committee is to prepare
International Standards. Draft International Standards adopted by the
joint technical committee are circulated to national bodies for voting.
Publication as an International Standard requires approval by at least
75 % of the national bodies casting a vote.
Attention is drawn to the possibility that some of the elements of this
part of ISO/IEC 8859 may be the subject of patent rights. ISO and IEC
shall not be held responsible for identifying any or all such patent
rights.
ISO/IEC 8859-11 was prepared by Joint Technical Committee
ISO/IEC JTC 1, Information technology, Subcommittee SC 2, Coded
character sets.
ISO/IEC 8859 consists of the following parts, under the general title
Information technology — 8-bit single-byte coded graphic character
sets:
Part 1: Latin alphabet No. 1
Part 2: Latin alphabet No. 2
Part 3: Latin alphabet No. 3
Part 4: Latin alphabet No. 4
Part 5: Latin/Cyrillic alphabet
Part 6: Latin/Arabic alphabet
Part 7: Latin/Greek alphabet
Part 8: Latin/Hebrew alphabet
Part 9: Latin alphabet No. 5
Part 10: Latin alphabet No. 6
Part 11: Latin/Thai alphabet
Part 13: Latin alphabet No. 7
Part 14: Latin alphabet No. 8 (Celtic)
Part 15: Latin alphabet No. 9
Part 16: Latin alphabet No. 10
Annex A of this part of ISO/IEC 8859 is for information only.
iv © ISO/IEC 2001 – All rights reserved
---------------------- Page: 4 ----------------------
ISO/IEC 8859-11:2001(E)
Introduction
ISO/IEC 8859 consists of several parts. Each part specifies a set of
up to 191 graphic characters and the coded representation of these
characters by means of a single 8-bit byte. Each set is intended for
use for a particular group of languages.
© ISO/IEC 2001 – All rights reserved v
---------------------- Page: 5 ----------------------
INTERNATIONAL STANDARD ISO/IEC 8859-11:2001(E)
Information technology –
8-bit single-byte coded graphic character sets –
Part 11: Latin/Thai alphabet
2.2 Conformance of devices
1 Scope
A device is in conformance with this part of ISO/IEC
This part of ISO/IEC 8859 specifies a set of 183
8859 if it conforms to the requirements of 2.2.1, and
coded graphic characters identified as Latin/Thai
either or both of 2.2.2 and 2.2.3. A claim of
alphabet.
conformance shall identify the document which
This set of coded graphic characters is intended for
contains the description specified in 2.2.1.
use in data and text processing applications and
2.2.1 Device description
also for information interchange.
A device that conforms to this part of ISO/IEC 8859
The set contains graphic characters used for general
shall be the subject of a description that identifies
purpose applications in typical office environments in
the means by which the user may supply characters
at least the following languages:
to the device, or may recognize them when they are
Thai, English and Latin.
made available to him, as specified respectively in
2.2.2 and 2.2.3.
Some of the characters in this set are combining
characters (see clause 6).
2.2.2 Originating devices
This set of coded graphic characters may be
An originating device shall allow its user to supply
regarded as a version of an 8-bit code according to
any sequence of characters from those specified in
ISO/IEC 2022 or ISO/IEC 4873 at level 1.
clause 6, and shall be capable of transmitting their
coded representations within a CC-data-element.
This part of ISO/IEC 8859 may not be used in
conjunction with any other parts of ISO/IEC 8859. If
2.2.3 Receiving devices
coded characters from more than one part are to be
A receiving device shall be capable of receiving and
used together, by means of code extension
interpreting any coded representations of characters
techniques, the equivalent coded character sets
that are within a CC-data-element, and that conform
from ISO/IEC 10367 or their corresponding G1 sets
to clause 6, and shall make the corresponding
from ‘ISO International Register of Coded Character
characters available to its user in such a way that
Sets to be used with escape sequences’, should be
the user can identify them from among those
used instead within a version of ISO/IEC 4873 at
specified there, and can distinguish them from each
level 2 or level 3.
other.
The coded characters in this set may be used in
3 Normative references
conjunction with coded control functions selected
from ISO/IEC 6429. However, control functions are
The following normative documents contain
not used to create composite graphic symbols from
provisions which, through reference in this text,
two or more graphic characters (see clause 6).
constitute provisions of this part of ISO/IEC 8859.
NOTE – ISO/IEC 8859 is not intended for use with For dated references, subsequent amendments to,
Telematic services defined by ITU-T. If information coded
or revisions of, any of these publications do not
according to ISO/IEC 8859 is to be transferred to such
apply. However, parties to agreements based on this
services, it will have to conform to the requirements of
part of ISO/IEC 8859 are encouraged to investigate
those services at the access-point.
the possibility of applying the most recent editions of
the normative documents indicated below. For
2 Conformance
undated references, the latest edition of the
normative document referred to applies. Members of
2.1 Conformance of information interchange
ISO and IEC maintain registers of currently valid
A coded-character-data-element (CC-data-element)
International Standards.
within coded information for interchange is in
conformance with this part of ISO/IEC 8859 if all the
coded representations of graphic characters within
that CC-data-element conform to the requirements
of clause 6.
© ISO/IEC 2001 – All rights reserved 1
---------------------- Page: 6 ----------------------
ISO/IEC 8859-11:2001(E)
ISO/IEC 2022:1994, Information technology – Thebitcombinationsmaybeinterpretedto
Character code structure and extension techniques representnumbersinbinarynotationbyattributing
thefollowingweightstotheindividualbits:
ISO/IEC 4873:1991, Information technology –
ISO 8-bit code for information interchange –
Bit bbbbbbbb
87654321
Structure and rules for implementation
Weight1286432168421
ISO/IEC 8824-1:1998, Information technology –
Abstract Syntax Notation One (ASN.1): Specifica-
Usingtheseweights,thebitcombinationsare
tion of basic notation
identifiedbynotationsoftheformxx/yy,wherexx
andyyarenumbersintherange00to15.The
correspondencebetweenthenotationsoftheform
4 Termsaandddefinitions
xx/yyandthebitcombinationsconsistingofthebits
For the purposes of this part of ISO/IEC 8859, the
btobisasfollows:
81
following terms and definitions apply.
–xxisthenumberrepresentedbyb,b,band
876
4.1 bit combination: An ordered set of bits used
bwherethesebitsaregiventheweights8,4,2,
5
for the representation of characters.
and1respectively.
4.2 byte: Abitstringthatisoperateduponasaunit.
–yyisthenumberrepresentedbyb,b,band
432
bwherethesebitsaregiventheweights8,4,2,
4.3 character: A member of a set of elements
1
and1respectively.
usedfortheorganization, control, orrepresentation
ofdata.
Thebitcombinationsarealsoidentifiedbynotations
oftheformhk,wherehandkarenumbersinthe
4.4 code table: A table showing the characters
range0toFinhexadecimalnotation.Thenumber
allocated toeachbitcombination inacode.
histhesameasthenumberxxdescribedabove,
4.5 coded character set; code: A set of
andthenumberkthesameasthenumberyy
unambiguousrulesthatestablishesacharacterset
describedabove.
and the one-to-one relationship between the
charactersofthesetandtheirbitcombinations. 5.2Layoutofthecodetable
4.6 coded-character-data-element (CC-data-
An8-bitcodetableconsistsof256positions
element): Anelementofinterchangedinformation
arrangedin16columnsand16rows.Thecolumns
thatisspecifiedtoconsistofasequenceofcoded
andtherowsarenumbered00to15.Inhexa-
representationsofcharacters,inaccordancewith
decimalnotationthecolumnsandtherowsare
one or more identified standards for coded
numbered0toF.
charactersets.
Thecodetablepositionsareidentifiedbynotations
4.7graphiccharacter:Acharacter,otherthana
oftheformxx/yy,wherexxisthecolumnnumber
controlfunction,thathasavisualrepresentation
andyyistherownumber.Thecolumnandrow
normallyhandwritten,printedordisplayed,andthat
numbersareshownatthetopandleftedgesofthe
hasacodedrepresentationconsistingofoneor
tablerespectively.Thecodetablepositionsare
morebitcombinations.
alsoidentifiedbynotationsoftheformhk,whereh
NOTE–InISO/IEC8859asinglebitcombinationisusedisthecolumnnumberandkistherownumberin
torepresenteachcharacter.
hexadecimalnotation.Thecolumnandrow
numbersareshownatthebottomandrightedgesof
4.8graphicsymbol:Avisualrepresentationofa
thetablerespectively.
graphiccharacterorofacontrolfunction.
Thepositionsofthecodetableareinone-to-one
4.9position:Thatpartofacodetableidentified
correspondencewiththebitcombinationsofthe
byitscolumnandrowcoordinates.
code.Thenotationofacodetableposition,ofthe
formxx/yy,oroftheformhk,isthesameasthatof
5Notation,codetableandnames
thecorrespondingbitcombination.
5.1Notation
5.3Namesandmeanings
Thebitsofthebitcombinationsofthe8-bitcodeare
identifiedbyb,b,b,b,b,b,b,andb,where
8765432 1 ThispartofISO/IEC8859assignsauniquename
bisthehighest-order,ormost-significantbitandb
8 1andauniqueidentifiertoeachgraphiccharacter.
isthelowest-order,orleast-significantbit.
Thesenamesandidentifiershavebeentakenfrom
ISO/IEC10646-1(E).ThispartofISO/IEC8859
2 © ISO/IEC 2001 – All rights reserved
---------------------- Page: 7 ----------------------
ISO/IEC 8859-11:2001(E)
also specifies an acronym for each of the characters 5.3.3 SOFT HYPHEN (SHY)
SPACE, NO-BREAK SPACE and SOFT HYPHEN.
A graphic character that is imaged by a graphic
For acronyms only Latin capital letters A to Z are
symbol identical with, or similar to, that representing
used. It is intended that the acronyms be retained in
HYPHEN, for use when a line break has been
all translations of the text.
established within a word.
Except for SPACE (SP), NO-BREAK SPACE (NBSP)
and SOFT HYPHEN (SHY), this part of ISO/IEC 8859
6 Specification of the coded character set
does not define and does not restrict the meanings of
This part of ISO/IEC 8859 specifies 183 characters
graphic characters.
allocated to the bit combinations of the code table
This part of ISO/IEC 8859 specifies a graphic symbol
(table 2).
for each graphic character. This symbol is shown in
Some of these characters are combining characters.
the corresponding position of the code table.
They are identified in table 1 as such.
However, this part, or any other part, of ISO/IEC 8859
does not specify a particular style or font design for
NOTE – Combining characters are described in ISO/IEC
imaging graphic characters. Annex B of ISO/IEC
2022:1994 subclause 6.3.3.
10367 gives further information on this subject.
Control functions, such as BACKSPACE or
5.3.1 SPACE (SP)
CARRIAGE RETURN, shall not be used to create
composite graphic symbols, which are made up from
A graphic character the visual representation of which
the graphic representations of two or more characters.
consists of the absence of a graphic symbol.
6.1 Characters of the set and their coded
5.3.2 NO-BREAK SPACE (NBSP)
representation
A graphic character the visual representation of which
See table 1.
consists of the absence of a graphic symbol, for use
when a line break is to be prevented in the text as
presented.
© ISO/IEC 2001 – All rights reserved 3
---------------------- Page: 8 ----------------------
ISO/IEC 8859-11:2001(E)
Table 1 – Character set, coded representation Table 1 (continued)
Bit Bit
combi- Hex Identifier Name combi- Hex Identifier Name
nation nation
02/00 20 U+0020 SPACE 05/00 50 U+0050 LATIN CAPITAL LETTER P
02/01 21 U+0021 EXCLAMATION MARK 05/01 51 U+0051 LATIN CAPITAL LETTER Q
02/02 22 U+0022 QUOTA
...
Questions, Comments and Discussion
Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.