Information technology — Universal Multiple-Octet Coded Character Set (UCS) — Part 1: Architecture and Basic Multilingual Plane — Amendment 1: Mathematical symbols and other characters

Technologies de l'information — Jeu universel de caractères codés sur plusieurs octets (JUC) — Partie 1: Architecture et plan multilingue de base — Amendement 1: Symboles mathématiques et autres caractères

General Information

Status
Withdrawn
Publication Date
17-Jul-2002
Withdrawal Date
17-Jul-2002
Current Stage
9599 - Withdrawal of International Standard
Completion Date
08-Apr-2004
Ref Project

Relations

Buy Standard

Standard
ISO/IEC 10646-1:2000/Amd 1:2002 - Information technology — Universal Multiple-Octet Coded Character Set (UCS) — Part 1: Architecture and Basic Multilingual Plane — Amendment 1: Mathematical symbols and other characters Released:12/13/2002
English language
115 pages
sale 15% off
Preview
sale 15% off
Preview

Standards Content (Sample)

INTERNATIONAL ISO/IEC
STANDARD 10646-1
Second edition
2000-09-15
AMENDMENT 1
2002-07-15
Corrected version
2002-12-15


Information technology — Universal
Multiple-Octet Coded Character Set
(UCS) —
Part 1:
Architecture and Basic Multilingual Plane
AMENDMENT 1: Mathematical symbols and
other characters
Technologies de l'information — Jeu universel de caractères codés sur
plusieurs octets (JUC) —
Partie 1: Architecture et plan multilingue de base
AMENDEMENT 1: Symboles mathématiques et autres caractères




Reference number
ISO/IEC 10646-1:2000/Amd.1:2002(E)
©
 ISO/IEC 2002

---------------------- Page: 1 ----------------------
ISO/IEC 10646-1:2000/Amd.1:2002(E)
PDF disclaimer
This PDF file may contain embedded typefaces. In accordance with Adobe's licensing policy, this file may be printed or viewed but shall not
be edited unless the typefaces which are embedded are licensed to and installed on the computer performing the editing. In downloading this
file, parties accept therein the responsibility of not infringing Adobe's licensing policy. The ISO Central Secretariat accepts no liability in this
area.
Adobe is a trademark of Adobe Systems Incorporated.
Details of the software products used to create this PDF file can be found in the General Info relative to the file; the PDF-creation parameters
were optimized for printing. Every care has been taken to ensure that the file is suitable for use by ISO member bodies. In the unlikely event
that a problem relating to it is found, please inform the Central Secretariat at the address given below.


©  ISO/IEC 2002
All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means, electronic
or mechanical, including photocopying and microfilm, without permission in writing from either ISO at the address below or ISO's member body
in the country of the requester.
ISO copyright office
Case postale 56 • CH-1211 Geneva 20
Tel. + 41 22 749 01 11
Fax + 41 22 749 09 47
E-mail copyright@iso.ch
Web www.iso.ch
Printed in Switzerland

ii © ISO/IEC 2002 – All rights reserved

---------------------- Page: 2 ----------------------
ISO/IEC 10646-1:2000/Amd.1:2002(E)
Foreword
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical Commission)
form the specialized system for worldwide standardization. National bodies that are members of ISO or IEC
participate in the development of International Standards through technical committees established by the
respective organization to deal with particular fields of technical activity. ISO and IEC technical committees
collaborate in fields of mutual interest. Other international organizations, governmental and non-governmental, in
liaison with ISO and IEC, also take part in the work. In the field of information technology, ISO and IEC have
established a joint technical committee, ISO/IEC JTC 1.
International Standards are drafted in accordance with the rules given in the ISO/IEC Directives, Part 3.
The main task of the joint technical committee is to prepare International Standards. Draft International Standards
adopted by the joint technical committee are circulated to national bodies for voting. Publication as an International
Standard requires approval by at least 75 % of the national bodies casting a vote.
Attention is drawn to the possibility that some of the elements of this Amendment may be the subject of patent
rights. ISO and IEC shall not be held responsible for identifying any or all such patent rights.
Amendment 1 to International Standard ISO/IEC 10646-1:2000 was prepared by Joint Technical Committee
ISO/IEC JTC 1, Information technology, Subcommittee SC 2, Coded character sets.
This corrected version of ISO/IEC 10646-1:2000/Amd.1:2002 includes the electronic attachments that were omitted
in the original version.

© ISO/IEC 2002 – All rights reserved iii

---------------------- Page: 3 ----------------------
ISO/IEC 10646-1:2000/Amd.1:2002 (E)


Information technology — Universal Multiple-Octet Coded
Character Set (UCS) —

Part 1:
Architecture and Basic Multilingual Plane

AMENDMENT 1: Mathematical symbols and other characters
short identifier for that code position can be used to
Page v, Foreword
refer to the character allocated at that code position.
Replace:
NOTE 1 – For instance, U+DC00 identifies a code
— Part 2: Secondary Multilingual Plane for
position that is permanently reserved for UTF-16, and
scripts and symbols, Supplementary Plane for CJK
U+FFFF identifies a code position that is permanently
Ideographs, Special Purpose Plane
reserved. U+0025 identifies a code position to which
a character is allocated; U+0025 also identifies that
by:
character (named PERCENT SIGN).
— Part 2: Supplementary Planes
Rename the existing note to NOTE 2, and increment
Remove the text “Additional parts will specify other
all other note numbers of the sub-clause by one
planes”.
(NOTE 3 and 4).

In clause 6.5.b change “It is not defined if the first
Page 2, clause 3
four digits of the eight-digit form are not all zeros” to
Append the following references: “It is not defined if the eight-digit form is greater than
0010FFFF” and append at the end of the text “Lead-
ISO/IEC 10646-2:2001, Information Technology –
ing zeros beyond four digits are suppressed.”.
Universal Multiple-Octet Coded Character Set (UCS)
– Part 2: Supplementary Planes. Change “{+}xxxx” in the BNF form to “{+}(xxxx |
xxxxx | xxxxxx)”.
Unicode Standard Annex, UAX#9, The Unicode Bidi-
rectional Algorithm, Version 3.1.0, 2001-03-23.
Add a clause 6.5f:

f. For the 8 digit forms, the characters SPACE or
NO-BREAK SPACE may optionally be inserted
Page 6, figure 2
before the four last digits.
In the legend “Private use planes 0F, 10, E0-FF” de-

lete “E0-FF”, and delete the upward arrows pointing
Page 8, clause 6
to planes E0 to FF.
After clause 6.5, add a new clause 6.6 (UCS Se-

quence Identifiers) and a note:
Page 7, clause 6
“ISO/IEC 10646 defines an identifier for any se-
Change “four-digit form” and “4-digit form” to “four-
quence of code positions taken from the standard.
to-six-digit form” throughout.
Such an identifier is known as a UCS Sequence
Identifier (USI). For a sequence of n code positions it
In clause 6.5, replace title, first paragraph and add
has the following form:
note as follows:

6.5 Short identifiers for code positions (UIDs)
where UID1, UID2, etc. represent the short identifiers
ISO/IEC 10646 defines short identifiers for each
of the corresponding code positions, in the same
code position, including code positions that are re-
order as those code positions appear in the se-
served. A short identifier for any code position is dis-
quence. If each of the code positions in such a se-
tinct from a short identifier for any other code posi-
quence has a character allocated to it, the USI can
tion. If a character is allocated at a code position, a
© ISO/IEC 2002 – All rights reserved
1

---------------------- Page: 4 ----------------------
ISO/IEC 10646-1:2000/Amd.1:2002 (E)

be used to identify the sequence of characters allo-
9.1 Planes reserved for future standardization
cated at those code positions. The syntax for UID1,
Planes 11 to FF in Group 00 and all planes in any
UID2, etc. is specified in clause 6.5. A COMMA
other groups (i.e. Planes 00 to FF in Groups 01 to
character (optionally followed by a SPACE character)
7F) are reserved for future standardization, and thus
separates the UIDs. The UCS Sequence Identifier
those code positions shall not be used for any other
shall include at least two UIDs; it shall begin with a
purpose.
LESS-THAN SIGN and be terminated by a
Code positions in these planes do not have a map-
GREATER-THAN SIGN.”
ping to the UTF-16 form (see Annex C).
NOTE – UCS Sequences Identifiers cannot be
NOTE – To ensure continued interoperability between
used for specification of subset and collection
the UTF-16 form and other coded representations of
content. They may be used outside this standard
the UCS, it is intended that no characters will be allo-
to identify: composite sequences for mapping
cated to code positions in Planes 11 to FF in Group 00
purposes, font repertoire, etc.
or any planes in any other groups.

9.2 Planes accessible by UTF-16
Page 8, clause 7
Each code position in Planes 01 to 10 of Group 00
Remove the note.
has a unique mapping to a four-octet sequence in
accordance with the UTF-16 form of coded repre-
sentation (see annex C). This form is compatible
Page 8, Clause 8
with the two-octet BMP form of UCS-2 (see 13.1).
Insert the following note after the first paragraph

which ends “UCS-2 (see 13.1)”.
Page 8, clause 10
NOTE 1 – Since UCS-2 only contains the repertoire of
the BMP it is not fully interoperable with UCS-4, UTF-8
Replace sub-clause 10.2 to read as follows:
and UTF-16.
10.2 Code positions for private use characters
Insert the following paragraph and note after the
The code positions of Plane 0F and Plane 10 of
second paragraph which ends “reserved for control
Group 00 shall be for private use.
characters.”
The 6400 code positions E000 to F8FF of the Basic
Code positions 0000 2060 to 0000 206F, 0000 FFF0
Multilingual Plane shall be for private use.
to 0000 FFFC, and 000E 0000 to 000E 0FFF are
The contents of these code positions are not speci-
reserved for Alternate Format Characters (see annex
fied in ISO/IEC 10646 (see 10.1).
F).

NOTE 2 – Unassigned code positions in those ranges
may be ignored in normal processing and display.
Page 9, clause 14

In the first paragraph, change the text “listed in an-
Replace the last paragraph as follows:
nex B” to “listed in annex B of each part of ISO/IEC
10646”.
Code positions 0000 FDD0 to 0000 FDEF,
0000 FFFE, and 0000 FFFF are permanently re-

served.
Page 9, clause 14.1
NOTE 3 – Code position 0000 FFFE is reserved for
Change the text “(see clause B.1)” to “(see clause
“signature” (see annex H). Code positions 0000 FDD0
B.1 of each part of ISO/IEC 10646)”.
to 0000 FDEF, and 0000 FFFF can be used for inter-
nal processing uses requiring numeric values which

are guaranteed not to be coded characters, such as in
Page 9, clause 14.2
terminating tables, or signaling end-of-text. Further-
more, since 0000 FFFF is the largest BMP value, it
Change the text “listed in clause B.2” to “listed in
may also be used as the final value in binary or se-
clause B.2 of each part of ISO/IEC 10646”.
quential searching index within the context of UCS-2

or UTF-16.”
NOTE 4 – A “permanently reserved” code position
Page 12, clause 19
cannot be changed by future amendments.
Add a new clause 19.1 at the end of clause 19.

19.1 Directional of bi-directional text
Page 8, clause 9
The Unicode Bidirectional Algorithm describes the
Replace clause 9 to read as follows:
algorithm used to determine the directionality for bidi-
9. Other planes rectional text.
2    © ISO/IEC 2002 – All rights reserved

---------------------- Page: 5 ----------------------
ISO/IEC 10646-1:2000/Amd.1:2002 (E)

Page 13, Clause 22 Compatibility characters • 13-20 octet: Kanji J sources (J3-hhhh ),
(J4-hhhh ).
Add the following sub-clauses:
• 21-28 octet: Hanja K sources (K0-hhhh ).
22.1 Compatibility Ideographs
• 29-36 octet : Unicode U sources (U0-hhhh )
The CJK compatibility ideographs (characters part of
the CJK COMPATIBILITY IDEOGRAPHS-2001 col-
The format definition uses ‘h’ as a hexadecimal unit.
lection) are ideographs that should have been unified
Uppercase characters and all other symbols includ-
with one of the CJK unified ideographs (characters
ing the space character between parentheses ap-
part of the CJK UNIFIED IDEOGRAPHS-2001 col-
pear as shown.
lection), per the unification rule described in Annex S
of this International Standard.
Click on this highlighted text to access the reference
However, they are included in this International
file.
Standard as separate characters, because, based
NOTE 2 – The content is also available as a separate
on various national, cultural, or historical reason for
viewable file in the same file directory as this docu-
some specific country and region, some national and
ment. The file is named: “CJKC0SR.txt”.
regional standards assign separate code positions

for them.
Page 13, clause 24
NOTE – For this reason, compatibility ideographs
In the first paragraph, change the text “clause B.1” to
should only be used for maintaining and guarantee-
“clause B.1 of each part of ISO/IEC 10646”, and
ing a round trip conversion with the specific national,
regional, or other standard. Other usage is strongly change the text “clause B.2” to “clause B.2 of each
discouraged.
part of ISO/IEC 10646”.

22.2 Source references for CJK Compatibility
Ideographs
Page 14, clause 24
ISO/IEC 10646-1 provides the source reference for
A sub-clause 24.5 is added to the clause 24 Combin-
CJK Compatibility Ideographs included in the Basic
ing Characters as follows:
Multilingual Plane. ISO/IEC 10646-2 provides the
24.5 Variation selectors
source reference for CJK Compatibility Ideographs
included in the Supplementary Planes.
Variation selectors are combining characters follow-
ing immediately a specific base character to indicate
The source references for the CJK Compatibility
a specific variant form of graphic symbol for that
Ideographs included in the BMP are provided in a
character. Some variation selectors are specific to a
machine-readable format that is accessible as a link
script, such as the Mongolian free variation selec-
to this document. The content pointed to by this link
tors, others are used with various other base charac-
is also normative.
ters such as the mathematical symbols. Variations
selectors following other characters have no effect
NOTE 1 – The referenced file is only available to us-
ers who obtain their copy of the standard in a ma- on the selection of the graphic symbol for that char-
chine-readable form. However, the file format makes
acter.
it printable.
No sequences using characters from VARIATION
SELECTOR-2 to VARIATION SELECTOR-16 are
The Kanji J sources are:
defined at this time.
J3 JIS X 213:2000 level-3
The following table provides a description of the
J4 JIS X 213:2000 level-4
variant appearances corresponding to the use of
The Hanja K source is: appropriate variation selectors with all allowed base
mathematical symbols.
K0 KS C 5601-1987
NOTE 1 – The VARIATION SELECTOR-1 is the
The Unicode U source is:
only variation selector used with mathematical
symbols.
U0 The Unicode Standard 3.0-2000


The content linked to is a plain text file that specifies, Sequence Description of variant appearance
(UID notation)
after a 8-line header, as many lines as CJK Com-
<2229, FE00> INTERSECTION with serifs
patibility Ideographs; each containing the following
<222A, FE00> UNION with serifs
information organized in fixed width fields:
<2268, FE00> LESS-THAN BUT NOT EQUAL TO with
• 01-06 octet: BMP code value (0hhhh )
vertical stroke
• 07-12 octet: Corresponding CJK Unified Ideo-
<2269, FE00> GREATER-THAN AND NOT DOUBLE
graph (0hhhh ).
EQUAL with vertical stroke
© ISO/IEC 2002 – All rights reserved
3

---------------------- Page: 6 ----------------------
ISO/IEC 10646-1:2000/Amd.1:2002 (E)

<2272, FE00> LESS-THAN OR EQUIVALENT TO fol- <1822, 180B> medial MONGOLIAN LETTER I
lowing the slant of the lower leg second form
<2273, FE00> GREATER-THAN OR EQUIVALENT TO <1823, 180B> medial, MONGOLIAN LETTER O
following the slant of the lower leg final second form
<2278, FE00> NEITHER LESS-THAN NOR GREATER- <1824, 180B> medial MONGOLIAN LETTER U
THAN with vertical stroke second form
<2279, FE00> NEITHER GREATER-THAN NOR LESS- <1825, 180B> medial, MONGOLIAN LETTER OE
THAN with vertical stroke final second form
<228A, FE00> SUBSET OF WITH NOT EQUAL TO with <1825, 180C> medial MONGOLIAN LETTER OE
stroke through bottom members third form
<228B, FE00> SUPERSET OF WITH NOT EQUAL TO <1826, 180B> isolate, MONGOLIAN LETTER UE
with stroke through bottom members medial, second form
<2293, FE00> SQUARE CAP with serifs final
<1826, 180C> medial MONGOLIAN LETTER UE
<2294, FE00> SQUARE CUP with serifs
third form
<2295, FE00> CIRCLED PLUS with white rim
<1828, 180B> initial, MONGOLIAN LETTER NA
<2297, FE00> CIRCLED TIMES with white rim
medial second form
<229C, FE00> CIRCLED EQUALS equal sign touching
<1828, 180C> medial MONGOLIAN LETTER NA
the circle
third form
<22DA, FE00> LESS-THAN EQUAL TO OR GREATER-
<1828, 180D> medial MONGOLIAN LETTER NA
THAN with slanted equal
separate form
<22DB, FE00> GREATER-THAN EQUAL TO OR LESS-
<182A, 180B> final MONGOLIAN LETTER BA
THAN with slanted equal
alternative form
<2A3C, FE00> INTERIOR PRODUCT tall variant with
<182C, 180B> initial, MONGOLIAN LETTER QA
narrow foot
medial second form
<2A3D, FE00> RIGHTHAND INTERIOR PRODUCT tall
<182C, 180B> isolate MONGOLIAN LETTER QA
variant with narrow foot
feminine second form
<2A9D, FE00> SIMILAR following the slant of the upper
<182C, 180C> medial MONGOLIAN LETTER QA
leg OR LESS-THAN
third form
<2A9E, FE00> SIMILAR following the slant of the upper
<182C, 180D> medial MONGOLIAN LETTER QA
leg OR GREATER-THAN
fourth form
<2AAC, FE00> SMALLER THAN OR EQUAL TO with
<182D, 180B> initial, MONGOLIAN LETTER GA
slanted equal
medial second form
<2AAD, FE00> LARGER THAN OR EQUAL TO with
<182D, 180B> final MONGOLIAN LETTER GA
slanted equal
feminine form
<2ACB, FE00> SUBSET OF ABOVE NOT EQUAL TO
<182D, 180C> medial MONGOLIAN LETTER GA
with stroke through bottom members
third form
<2ACC, FE00> SUPERSET OF ABOVE NOT EQUAL
<182D, 180D> medial MONGOLIAN LETTER GA
TO with stroke through bottom members
feminine form

<1830, 180B> final MONGOLIAN LETTER SA
The following table provides a description of the
second form
variant appearances corresponding to the use of
<1830, 180C> final MONGOLIAN LETTER SA
appropriate variation selectors with all allowed base
third form
Mongolian characters. Only some presentation forms
<1832, 180B> medial MONGOLIAN LETTER TA
of the base Mongolian characters used with the second form
Mongolian free variation selectors produce variant <1833, 180B> initial, MONGOLIAN LETTER DA
medial, second form
appearances. These combinations are described in
final
the following table.
<1835, 180B> final MONGOLIAN LETTER JA
NOTE 2 – The Mongolian characters have various
second form
presentation forms depending on their position in a
<1836, 180B> initial, MONGOLIAN LETTER YA
CC-data element. These presentations forms are
medial second form
called isolate, initial, medial and final.
<1836, 180C> medial MONGOLIAN LETTER YA

third form
Sequence position Description of variant ap- <1838, 180B> final MONGOLIAN LETTER WA
second form
(UID notation) pearance
<1844, 180B> medial MONGOLIAN LETTER
<1820, 180B> isolate, MONGOLIAN LETTER A
TODO E second form
medial, second form
<1845, 180B> medial MONGOLIAN LETTER
final
TODO I second form
<1820, 180C> medial MONGOLIAN LETTER A
third form <1846, 180B> medial MONGOLIAN LETTER
<1821, 180B> initial, MONGOLIAN LETTER E TODO O second form
final second form

4    © ISO/IEC 2002 – All rights reserved

---------------------- Page: 7 ----------------------
ISO/IEC 10646-1:2000/Amd.1:2002 (E)

<1847, 180B> isolate, MONGOLIAN LETTER <1887, 180D> final MONGOLIAN LETTER ALI
medial, TODO U second form GALI A fourth form
final <1888, 180B> final MONGOLIAN LETTER ALI
<1847, 180C> medial MONGOLIAN LETTER GALI I second form
TODO U third form
<188A, 180B> initial, MONGOLIAN LETTER ALI
<1848, 180B> medial MONGOLIAN LETTER medial GALI NGA second form
TODO OE second form
<1849, 180B> isolate, MONGOLIAN LETTER NOTE 3 – The variation selector only selects a different
medial TODO UE second form appearance of an already encoded character. It is not
intended as a general code extension mechanism. Only
<184D, 180B> initial, MONGOLIAN LETTER
the sequences specifically defined in this annex are sanc-
medial TODO QA feminine form
tioned for standard use; all other sequences are unde-
<184E, 180B> medial MONGOLIAN LETTER
fined. No sequences containing combining characters or
TODO GA second form
composite characters will be defined.
<185D, 180B> medial, MONGOLIAN LETTER

final SIBE E second form
NOTE 4 – The exhaustive list of standardized variants is
<185E, 180B> medial, MONGOLIAN LETTER
also described as StandardizedVariants.html in the Uni-
final SIBE I second form
code character database.
<185E, 180C> medial, MONGOLIAN LETTER

final SIBE I third form
<1860, 180B> medial, MONGOLIAN LETTER
Page 880, Annex A.1
final SIBE UE second form
Add a ‘*’ (for fixed collection) to the following collec-
<1863, 180B> medial MONGOLIAN LETTER
tion:
SIBE KA second form
<1868, 180B> initial, MONGOLIAN LETTER
38 ARROWS
medial SIBE TA second form
39 MATHEMATICAL OPERATORS,
<1868, 180C> medial MONGOLIAN LETTER
45 BLOCK ELEMENTS,
SIBE TA third form
46 GEOMETRIC SHAPES,
<1869, 180B> initial, MONGOLIAN LETTER
49 CJK SYMBOLS AND PUNCTUATION
medial SIBE DA second form
51 KATAKANA
<186F, 180B> initial, MONGOLIAN LETTER
In the list of collection numbers and names, after
medial SIBE ZA second form
<1873, 180B> medial, MONGOLIAN LETTER 91 TIBETAN
final MANCHU I second form
insert new entries as follows:
<1873, 180C> medial, MONGOLIAN LETTER
92 CYRILLIC SUPPLEMENT 0500-052F
final MANCHU I third form
93 TAGALOG 1700-171F
<1873, 180D> medial MONGOLIAN LETTER
94 HANUNOO 1720-173F
MANCHU I fourth form
95 BUHID 1740-175F
<1874, 180B> medial MONGOLIAN LETTER
96 TAGBANWA 1760-177F
MANCHU KA second form
97 MISCELLANEOUS MATHEMATICAL
<1874, 180B> final MONGOLIAN LETTER
SYMBOLS-A 27C0-27EF
MANCHU KA feminine first
98 SUPPLEMENTAL ARROWS-A 27F0-27FF *
form
99 SUPPLEMENTAL ARROWS-B 2900-297F *
<1874, 180C> medial MONGOLIAN LETTER
100 MISCELLANEOUS MATHEMATICAL
MANCHU KA feminine first
SYMBOLS-B 2980-29FF *
form
101 SUPPLEMENTAL MATHEMATICAL
<1874, 180C> final MONGOLIAN LETTER
OPERATORS 2A00-2AFF *
MANCHU KA feminine sec-
102 KATAKANA PHONETIC
ond form
EXTENSIONS 31F0-31FF *
<1874, 180D> medial MONGOLIAN LETTER
103 VARIATION SELECTORS FE00-FE0F *
MANCHU KA feminine sec-

ond form
<1876, 180B> initial, MONGOLIAN LETTER
after
medial MANCHU FA second form
271 COMBINING CHARACTERS B-2
<1880, 180B> all MONGOLIAN LETTER ALI
GALI ANUSVARA ONE sec-
insert new entries as follows:
ond form
281 MES-1 SEE A.4.1 *
<1881, 180B> all MONGOLIAN LETTER ALI
282 MES-2 SEE A.4.2 *
GALI VISARGA ONE sec-

ond form
<1887, 180B> isolate, MONGOLIAN LETTER ALI
Page 881, annex A.1
final GALI A second form
In the list of collections numbers and names, after
<1887, 180C> final MONGOLIAN LETTER ALI
302 BMP SECOND EDITION See A3.3 *
GALI A third form
© ISO/IEC 2002 – All rights reserved
5

---------------------- Page: 8 ----------------------
ISO/IEC 10646-1:2000/Amd.1:2002 (E)

insert the following text: Replace the Note 2 as follows:
NOTE 2 – Collections numbered 57, 58, and 59 were
The following collections contain characters both
specified in the First Edition of this International Standard
inside and outside the Basic Multilingual Plane.
but have now been deleted. Collections numbered 400
303 UNICODE 3.1 See A5.1 *
and 500 were specified in the First and Second Editions of
304 UNICODE 3.2 See A5.2 *
this Standard but have now been deleted.
380 CJK UNIFIED IDEOGRAPHS-2001 *
In the alphabetical list of keywords, add collections
3400-4DB5
4E00-9FA5 “98” and “99” to the entry “Arrows”;
FA0E-FA0F
Add collection “92” to the entry “Cyrillic”;
FA11
Add collection “102” to the entry “Katakana”;
FA13-FA14
FA1F
Add collection “101” to the entry “Mathematical op-
FA21
erators”;
FA23-FA24
FA27-FA29 Add collection “401” and remove “400 500” from the
20000-2A6D6
entry “Private use”;
381 CJK COMPATIBILITY IDEOGRAPHS-2001*
Add collections “97” and “100” to the entry “Sym-
F900-FA0D
bols”.
FA10
FA12

FA15-FA1E
In the alphabetical list of keywords, after
FA20
Braille patterns
FA22
insert
FA25-FA26
Buhid 95
FA2A-FA6A
2F800-2FA1D
After
10646 UNICODE 0000-FDCF
Hangul
FDF0-FFFD
insert
10000-1FFFD
Hanunoo 94
20000-2FFFD
30000-3FFFD
After
40000-4FFFD
Mathematical operators
50000-5FFFD
insert
60000-6FFFD
Mathematical symbols 97 and 100
70000-7FFFD
MES 281 282
80000-8FFFD
After
90000-9FFFD
A0000-AFFFD Syriac
B0000-BFFFD
insert
C0000-CFFFD
Tagalog 93
D0000-DFFFD
Tagbanwa 96
E0000-EFFFD
After
F0000-FFFFD
100000-10FFFD Tibetan

insert
NOTE – The UNICODE collection incorporates all
Unicode 303, 304, 10646
characters currently encoded in the standard.
Variation selectors 103

In the alphabetical list of keywords, add a tab be-
Page 881, annex A.1 tween “Yi” and the collection numbers “76 77”.
Under collection 400, change “PRIVATE USE
PLANES G=00, P=0F, 10, & E0-FF” to “(This collec-
Page 881, Annex A.2
tion number shall not be used, see Note 2.)”.
In the list of blocks in the BMP, after
Add the following collection after 400:
CYRILLIC
401 PRIVATE USE PLANES-0F-10 G=00, P=0F-10
insert a new entry as follows:
Under collection 500, change “PRIVATE USE
CYRILLIC SUPPLEMENT 0500-052F
GROUPS G=60-7F” to “(This collection number shall
after
not be used, see Note 2.)”.
RUNIC
Under Note 1, change the text “90, and 91” to “90,
insert new entries as follows:
91, 93, 94, 95, and 96”. TAGALOG 1700-171F
HANUNOO 1720-173F
6    © ISO/IEC 2002 – All rights reserved

---------------------- Page: 9 ----------------------
ISO/IEC 10646-1:2000/Amd.1:2002 (E)

BUHID 1740-175F 3400-4DBF
TAGBANWA 1760-177F
is changed to:
after
3400-4DB5
DINGBATS

insert new entries as follows:
MISCELLANEOUS MATHEMATICAL
Page 884, annex A
SYMBOLS-A 27C0-27EF
At the end of Annex A add new clauses A.4 and A.5
SUPPLEMENTAL ARROWS-A 27F0-27FF
as follows.
after
A.4 Other collections within the BMP
BRAILLE PATTERNS
insert new entries as follows:
The collections specified within this clause are en-
SUPPLEMENTAL ARROWS-B 2900-297F
tirely within Plane 00.
MISCELLANEOUS MATHEMATICAL
NOTE – The acronym MES indicates Multilingual
SYMBOLS-B 2980-29FF
European Subset.
SUPPLEMENTAL MATHEMATICAL
OPERATORS 2A00-2AFF

after
A.4.1 281 MES-1
BOPOMOFO EXTENDED
281 MES-1 is specified by the following ranges of
insert a new entry as follows:
code positions as indicated for each row.
KATAKANA PHONETIC
EXTENSIONS 31F0-31FF Rows Positions (cells)
00 20-7E A0-FF
after
01 00-13 16-2B 2E-4D 50-7E
ARABIC PRESENTATION FORMS-B
02 C7 D8-DB DD
insert a new entry as follows:
20 15 18-19 1C-1D AC
VARIATION SELECTORS FE00-FE0F
21 22 26 5B-5E 90-93
26 6A


Page 882, annex A.3.1 301 BMP-AMD.7
A.4.2 282 MES-2
Within the row 0B entry, replace the following se-
quence:
282 MES-2 is specified by the following ranges of
.. 8E-90 92-25 99-9A ..
code positions as indicated for each row.
by:
Rows Positions (cells)
00 20-7E A0-FF
.. 8E-90 92-95 99-9A ..
01 00-7F 8F 92 B7 DE-EF FA-FF

02 18-1B 1E-1F 59 7C 92 BB-BD C6-C7 C9 D8-DD
EE
Page 883, annex A.3.3 302 BMP Second Edition
03 74-75 7A 7E 84-8A 8C 8E-A1 A3-CE D7 DA-E1
Row 02 entry::
04 00-5F 90-C4 C7-C8 CB-CC D0-EB EE-F5 F8-F9
1E 02-03 0A-0B 1E-1F 40-41 56-57 60-61 6A-6B 80-
00-33 50-AD B0-EE
85 9B F2-F3
is changed to:
1F 00-15 18-1D 20-45 48-4D 50-57 59 5B 5D 5F-7D
00-1F 22-33 50-AD B0-EE 80-B4 B6-C4 C6-D3 D6-DB DD-EF F2-F4 F6-FE
20 13-15 17-1E 20-22 26 30 32-33 39-3A 3C 3E 44
Row 07 entry:
4A 7F 82 A3-A4 A7 AC AF
00-0D 0F-2C 30-4A 80-BF
21 05 16 22 26 5B-5E 90-95 A8
22 00 02-03 06 08-09 0F 11-12 19-1A 1E-1F 27-2B
is changed to:
48 59 60-61 64-65 82-83 95 97
00-0D 0F-2C 30-4A 80-B0
23 02 10 20-21 29-2A
25 00 02 0C 10 14 18 1C 24 2C 34 3C 50-6C 80 84
Row 0B entry:
88 8C 90-93 A0 AC B2 BA BC C4 CA-CB D8-D9
.. 8E-90 92-25 99-9A ..
26 3A-3C 40 42 60 63 65-66 6A-6B
is changed to: FB 01-02
FF FD
.. 8E-90 92-95 99-9A ..

Row 12 entry:
20-26 28-46 .
is changed to:
00-06 08-46 .
Row 34-4D entry:
© ISO/IEC 2002 – All rights reserved
7

---------------------- Page: 10 ----------------------
ISO/IEC 10646-1:2000/Amd.1:2002 (E)

Collection number and name
A.5 Unicode collections
303 UNICODE 3.1
These collections correspond to Unicode 3.1 and

3.2. They include characters from the BMP as well
Plane 0
as Supplementary Planes.
Collection number and name
A.5.1 303 UNICODE 3.1
98 SUPPLEMENTAL ARROWS-A
303 The fixed collection UNICODE 3.1 consists
99 SUPPLEMENTAL ARROWS-B
of collections from A.3 above and several ranges of
100 MISCELLANEOUS MATHEMATICAL SYMBOLS-B
code positions. The collection list is arranged by
101 SUPPLEMENTAL MATHEMATICAL OPERATORS
planes as follows.
102 KATAKANA PHONETIC EXTENSIONS
103 VARIATION SELECTORS
Plane 0

Collection number and name
Rows Positions (cells)
302 BMP SECOND EDITION
02 20

03 4F 63-6F D8-D9 F6
Row Positions (cells)
04 8A-8B C5-C6 C9-CA CD-CE
03 F4-F5
05 00-0F
06 6E-6F
07 B1
Plane 1
10 F7-F8
Row Positions (cells) 17 00-0C 0E-14 20-36 40-53 60-6C 6E-70 72-73
03 00-1E 20-23 30-4A 20 47 4E-52 57 5F-63 71 B0-B1 E4-EA
04 00-25 28-4D 21 3D-4B F4-FF
D0 00-F5 22 F2-FF
D1 00-26 2A-DD 23 7C 9B-CE
D4 00-54 56-9C 9E-9F A2 A5-A6 A9-AC AE-B9 BB 24 EB-FE
BD-C0 C2-C3 C5-FF 25 96-9F F8-FF
D5 00-05 07-0A 0D-14 16-1C 1E-39 3B-3E 40-44 46 26 16-17 72-7D 80-89
4A-50 52-FF 27 68-75 D0-EB
D6 00-A3 A8-FF 30 3B-3D 95-96 9F-A0 FF
D7 00-C9 CE-FF 32 51-5F B1-BF
A4 A2-A3 B4 C1 C5

FA 30-6
...

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.