SIST-TP ISO/TR 21636-2:2024
(Main)Language coding - A framework for language varieties - Part 2: Description of the framework
Language coding - A framework for language varieties - Part 2: Description of the framework
This document, and the ISO 21636 series in general, provides the general principles for the identification and description of varieties of individual human languages. It, therefore, does not apply to:
- artificial means of communication with or between machines such as programming languages;
- those means of human communication which are not fully or largely equivalent to human language such as individual symbols or gestures that carry isolated meanings but cannot be freely combined into complex expressions.
This document together with the other parts of the ISO 21636 series establishes the dimensions of linguistic variation as well as core values necessary to identify individual varieties in these dimensions or sub-dimensions.
This document forms the basis for the other parts by outlining the general framework for language varieties.
Identification et description des variétés de langues — Partie 2: Description
Jezikovno kodiranje - Ogrodje za jezikovne različice - 2. del: Opis ogrodja
Ta dokument – in skupina standardov ISO 21636 na splošno – zagotavljata splošna načela za identifikacijo in opis različic posameznih človeških jezikov. Zato se ne uporablja za:
– sredstva umetne komunikacije, ki poteka s stroji ali med njimi, kot so programski jeziki;
– sredstva človeške komunikacije, ki niso v celoti oziroma v velikem delu enakovredna človeškemu jeziku, kot so posamezni simboli ali kretnje z ločenim pomenom, ki jih ni mogoče prosto kombinirati v zapletene izraze.
Ta dokument skupaj z drugimi deli skupine standardov ISO 21636 določa dimenzije jezikovne različice in temeljne vrednosti, potrebne za identifikacijo posameznih različic v teh dimenzijah ali poddimenzijah.
Ta dokument predstavlja osnovo za druge dele, in sicer opisuje splošno ogrodje za jezikovne različice.
General Information
- Status
- Published
- Publication Date
- 06-Oct-2024
- Technical Committee
- IDT - Information, documentation, language and terminology
- Current Stage
- 6060 - National Implementation/Publication (Adopted Project)
- Start Date
- 17-Sep-2024
- Due Date
- 22-Nov-2024
- Completion Date
- 07-Oct-2024
Overview
ISO/TR 21636-2:2023 - Language coding: A framework for language varieties - Part 2: Description of the framework - defines a conceptual framework for identifying and describing language varieties within individual human languages. It complements ISO 639 by extending language coding to internal linguistic variation and provides the general principles, dimensions and core values needed to describe varieties present in language resources (LRs). The report is intended for metadata exchange, archiving and interoperable use of LRs across research, industry and digital services.
Key topics and technical scope
- Scope and exclusions: Applies to human languages and their varieties; excludes programming languages and non-combinatorial human signals (isolated symbols or gestures).
- Framework foundation: Establishes general principles for the identification and description of language varieties and forms the basis for other parts of the ISO 21636 series.
- Dimensions of linguistic variation: Defines independent yet interacting dimensions used to characterize varieties, including:
- Space (geographical/dialectal variation)
- Time (historical and temporal change)
- Social group (sociolects)
- Medium (modality: written, spoken, signed, digital/social-media)
- Situation (register, context of use)
- Individual speaker (idiolect)
- Proficiency (competence level)
- Communicative functioning (purpose/genre)
- Metadata and values: Specifies metadata categories and candidate lists of values to describe LRs precisely and to support future granular coding of varieties.
- Principles for use: Emphasizes interoperable, durable metadata compatible with ISO/IEC 11179 and software/content development recommendations.
Practical applications
- Standardizing metadata for language resources (corpora, lexica, recordings) to improve discoverability, reuse and long-term preservation.
- Supporting development, testing and adaptation of language technologies (speech recognition, NLP, MT) by clearly indicating which language variety a resource represents.
- Enabling consistent description in library, archival and research metadata for linguistic documentation, digital humanities and cultural heritage.
- Informing policy and product decisions in eGovernment, eHealth, eLearning, accessibility and multilingual content delivery where variety-sensitive language support is required.
Who should use this standard
- Linguists, sociolinguists and language archivists
- Language technology developers and data engineers
- Libraries, archives and digital repositories
- Translators, interpreters and language training providers
- ICT and media companies working with multilingual/multimodal content
Related standards
- ISO 639 (language identifiers) - complementary for identifying individual languages
- ISO/IEC 11179 (metadata registries) - conceptual fit for metadata design
- Other parts of the ISO 21636 series - for implementation-specific guidance and coding schemes
Keywords: ISO/TR 21636-2, language coding, language varieties, linguistic variation, metadata, language resources, ISO 639, language technologies.
Frequently Asked Questions
SIST-TP ISO/TR 21636-2:2024 is a technical report published by the Slovenian Institute for Standardization (SIST). Its full title is "Language coding - A framework for language varieties - Part 2: Description of the framework". This standard covers: This document, and the ISO 21636 series in general, provides the general principles for the identification and description of varieties of individual human languages. It, therefore, does not apply to: - artificial means of communication with or between machines such as programming languages; - those means of human communication which are not fully or largely equivalent to human language such as individual symbols or gestures that carry isolated meanings but cannot be freely combined into complex expressions. This document together with the other parts of the ISO 21636 series establishes the dimensions of linguistic variation as well as core values necessary to identify individual varieties in these dimensions or sub-dimensions. This document forms the basis for the other parts by outlining the general framework for language varieties.
This document, and the ISO 21636 series in general, provides the general principles for the identification and description of varieties of individual human languages. It, therefore, does not apply to: - artificial means of communication with or between machines such as programming languages; - those means of human communication which are not fully or largely equivalent to human language such as individual symbols or gestures that carry isolated meanings but cannot be freely combined into complex expressions. This document together with the other parts of the ISO 21636 series establishes the dimensions of linguistic variation as well as core values necessary to identify individual varieties in these dimensions or sub-dimensions. This document forms the basis for the other parts by outlining the general framework for language varieties.
SIST-TP ISO/TR 21636-2:2024 is classified under the following ICS (International Classification for Standards) categories: 01.140.20 - Information sciences. The ICS classification helps identify the subject area and facilitates finding related standards.
You can purchase SIST-TP ISO/TR 21636-2:2024 directly from iTeh Standards. The document is available in PDF format and is delivered instantly after payment. Add the standard to your cart and complete the secure checkout process. iTeh Standards is an authorized distributor of SIST standards.
Standards Content (Sample)
SLOVENSKI STANDARD
01-november-2024
Jezikovno kodiranje - Ogrodje za jezikovne različice - 2. del: Opis ogrodja
Language coding — A framework for language varieties — Part 2: Description of the
framework
Identification et description des variétés de langues — Partie 2: Description
Ta slovenski standard je istoveten z: ISO/TR 21636-2:2023
ICS:
01.140.20 Informacijske vede Information sciences
2003-01.Slovenski inštitut za standardizacijo. Razmnoževanje celote ali delov tega standarda ni dovoljeno.
TECHNICAL ISO/TR
REPORT 21636-2
First edition
2023-03
Language coding — A framework for
language varieties —
Part 2:
Description of the framework
Identification et description des variétés de langues —
Partie 2: Description
Reference number
© ISO 2023
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting on
the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address below
or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
ii
Contents Page
Foreword .iv
Introduction .v
1 Scope . 1
2 Normative references . 1
3 Terms and definitions . 1
4 Linguistic variation and language varieties . 1
4.1 Linguistic variation . 1
4.2 Dimensions of linguistic variation . 3
4.3 The space dimension and its varieties . 4
4.4 The time dimension and its varieties . 5
4.5 The social group dimension and its varieties . 5
4.6 The medium dimension and its varieties . 6
4.7 The situation dimension and its varieties . 8
4.8 The individual speaker dimension and its varieties . 10
4.9 The proficiency dimension and its varieties . 10
4.10 The communicative functioning dimension and its varieties .12
Bibliography .14
iii
Foreword
ISO (the International Organization for Standardization) is a worldwide federation of national standards
bodies (ISO member bodies). The work of preparing International Standards is normally carried out
through ISO technical committees. Each member body interested in a subject for which a technical
committee has been established has the right to be represented on that committee. International
organizations, governmental and non-governmental, in liaison with ISO, also take part in the work.
ISO collaborates closely with the International Electrotechnical Commission (IEC) on all matters of
electrotechnical standardization.
The procedures used to develop this document and those intended for its further maintenance are
described in the ISO/IEC Directives, Part 1. In particular, the different approval criteria needed for the
different types of ISO documents should be noted. This document was drafted in accordance with the
editorial rules of the ISO/IEC Directives, Part 2 (see www.iso.org/directives).
Attention is drawn to the possibility that some of the elements of this document may be the subject of
patent rights. ISO shall not be held responsible for identifying any or all such patent rights. Details of
any patent rights identified during the development of the document will be in the Introduction and/or
on the ISO list of patent declarations received (see www.iso.org/patents).
Any trade name used in this document is information given for the convenience of users and does not
constitute an endorsement.
For an explanation of the voluntary nature of standards, the meaning of ISO specific terms and
expressions related to conformity assessment, as well as information about ISO’s adherence to
the World Trade Organization (WTO) principles in the Technical Barriers to Trade (TBT), see
www.iso.org/iso/foreword.html.
This document was prepared by Technical Committee ISO/TC 37, Language and terminology,
Subcommittee SC 2, Terminology workflow and language coding.
A list of all parts in the ISO 21636 series can be found on the ISO website.
Any feedback or questions on this document should be directed to the user’s national standards body. A
complete listing of these bodies can be found at www.iso.org/members.html.
iv
Introduction
More and more digital language resources (LRs) are being created (also by retro-digitization), archived,
processed and analysed. In this context, detailed and exact characterization of language varieties
present in a given language use event is quickly gaining importance. Here, language use includes all
modalities such as written, spoken or signed, and also new forms of language use supported by digital
technology (in social media and similar forms of digital communication). But this is just one way in
which languages vary internally. Others include, for instance, the well-known regional (dialectal) and
social variation.
While in the past a primary goal of working with LRs was the archiving and preservation of LRs, new
goals have emerged and are still emerging:
— institutions and individuals need to exchange metadata (that is, bibliographic description data
and other secondary information) for making the information on existing LRs widely available in a
harmonized form;
— researchers are looking for the primary data (that is, the LRs themselves) for many different
research purposes, including research on linguistic variation;
— researchers and developers need LRs for the development of more advanced language technologies
(LTs) and for testing purposes, as LTs, in particular speech recognition and language analysis, are
entering more and more dimensions of human communication.
In order to achieve the above-mentioned goals and purposes, along with others not outlined in this
document, a standardized set of metadata for the identification of language varieties is important
to guarantee frictionless exchange of secondary information. Well-organized metadata also help to
indicate the degree of interoperability (equalling re-usability and re-purposability of LRs), and the
applicability of LTs to different situations or LRs over time. These metadata are applicable in eBusiness,
eHealth, eGovernment, eInclusion, eLearning, smart environments, ambient assisted living (AAL) and
virtually all other applications which depend on information about LRs. A clear metadata approach is
also a prerequisite for the durability of language resource archiving (in particular in the case of cultural
heritage and scientific research data).
1)
The identification of different individual languages is the subject of ISO 639 , which identifies existing
(living, extinct and historical) individual languages, as well as language groups. This document, and
the ISO 21636 series in general, presupposes and complements ISO 639 by extending the language
code framework in order to allow for the identification of language varieties of different types (such
as geographical, social and modal varieties, among others). The identification of language varieties can
then be included in general, library and archival metadata for describing LRs (which can also include
technical information, time and location of recording, and similar general information, which are not
part of the ISO 21636 series).
The provisions of the ISO 21636 series cover:
— a general conceptual framework to deal coherently with language-internal linguistic variation;
— general rules for the identification and description of language varieties;
— a set of dimensions and open-ended or closed lists of values that can be assigned to each respective
dimension;
— a set of metadata categories and examples for the respective possible values, grouped according to
the most important aspects of the description of events of language use and resulting LRs, related
to linguistic variation.
The metadata categories and values addressed in this document can be candidates for a future highly
granular coding of language varieties based on these comprehensive principles. Thus, this document
1) Under preparation. Status at the time of publication: ISO/FDIS 639:2023.
v
(and the ISO 21636 series in general) conforms to the “recommendations on software and content
development principles 2010”, and fits within the general framework of the ISO/IEC 11179 series for
metadata.
Stakeholders include, but are not limited to:
— information and communication technologies (ICTs) industry (including LTs);
— libraries;
— the media industry (including entertainment);
— internet communities;
— people engaging in language documentation and preservation;
— language archivists;
— translators and interpreters;
— researchers (linguists, in particular sociolinguists, ethnologists, sociologists, etc.);
— people and institutions providing language training;
— emerging new user communities.
It is anticipated that these stakeholders need to refer not only to a certain individual language, but
also to a certain language variety, for instance for oral human-computer interaction, or for tailoring
a certain LR or tool to the needs and specific environment of a target user group. In order to identify
the dimension(s) of linguistic variation internal to individual languages involved, and the respective
relevant language varieties, a first step is to achieve the needed specificity. Adapting a conceptually
sound, uniform framework of reference as developed in this document is superior to the proliferation of
different individual ad hoc solutions.
vi
TECHNICAL REPORT ISO/TR 21636-2:2023(E)
Language coding — A framework for language varieties —
Part 2:
Description of the framework
1 Scope
This document, and the ISO 21636 series in general, provides the general principles for the identification
and description of varieties of individual human languages. It, therefore, does not apply to:
— artificial means of communication with or between machines such as programming languages;
— those means of human communication which are not fully or largely equivalent to human language
such as individual symbols or gestures that carry isolated meanings but cannot be freely combined
into complex expressions.
This document together with the other parts of the ISO 21636 series establishes the dimensions of
linguistic variation as well as core values necessary to identify individual varieties in these dimensions
or sub-dimensions.
This document forms the basis for the other parts by outlining the general framework for language
varieties.
2 Normative references
There are no normative references in this document.
3 Terms and definitions
No terms and definitions are listed in this document.
ISO and IEC maintain terminology databases for use in standardization at the following addresses:
— ISO Online browsing platform: available at https:// www .iso .org/ obp
— IEC Electropedia: available at https:// www .electropedia .org/
4 Linguistic variation and language varieties
4.1 Linguistic variation
Individual human languages differ from one another, and variation exists within each one. Since
language variation is inherent, this variation is also present in LRs. This document covers the
description of LRs that represent instances of use of individual languages regarding their status with
respect to linguistic variation.
While individual languages originally emerged and were mainly used for communication between
humans, their use is increasingly supported by ICTs. Events of language use involving machines are also
covered by this document.
Within individual human languages, linguistic variation occurs in distinct dimensions (listed in 4.2
and described in detail in the remaining text), resulting in different kinds of language varieties. Each
of the dimensions is independent from the others, although mutual influences exist. Each linguistic
manifestation in a given individual language, such as a written text, an utterance, an entry in a lexical
database, etc., can, therefore, be characterized by its location in each of these dimensions of linguistic
variation.
Some language varieties can be further differentiated into sub-varieties, but this level of detail is not
the focus of this document.
Furthermore, the borders between individual languages can be hard to establish, especially when
different groups of people disagree in their assessment of the mutual intelligibility or the socio-political
situation so that the status of a given set of idiolects as an individual language, language variety or
group of related languages is disputed. This difficulty is not addressed in this document (see ISO 639).
Similarly, the borders of language varieties are sometimes hard to establish. In many cases, different
language varieties being distinguished on the same dimension of linguistic variation (for instance
different dialects) do overlap — in other words, there can be idiolects that fulfil all respective criteria
to belong to both varieties. This is particularly true for the time dimension (see 4.4). However, it does
not invalidate the establishment of varieties.
In addition, individual languages and their language varieties are under constant gradual, sometimes
rapid, change so that the following applies:
— A given language variety can over time gain the status of an individual human language (to receive
its own language identifier in ISO 639). For instance, different dialects of Vulgar Latin developed
into distinct Romance languages. Conversely, over time an individual language can develop into a
language variety of another individual language, as appears to have been the case with a postulated
Spanish-based creole language spoken in Cuba that would have developed into a variety of Cuban
Spanish. In this kind of case, the resulting variety would be subject to the framework described in
this document, in which case an existing ISO 639 language identifier assigned to the earlier form of
this language would come to designate an extinct individual language.
— Linguistic expressions and features belonging to a given language variety at one point can shift
along one or more dimensions of linguistic variation (for instance markedly informal expressions
can become acceptable even in formal contexts, or regional expressions can spread over large parts
of the geographical territory where the individual language is spoken).
This document focuses on a framework for the identification and description of linguistic variation
within a language, that is, of language varieties and sub-varieties. It does not attempt to document
all varieties of all individual languages, rather to list the types of descriptors needed to precisely and
fully characterize the linguistic variety represented by an utterance or LR. It does so by describing
the dimensions of variation within languages, and by indicating the major language variations that
typically occur within individual languages.
This document considers individual languages to be sets of idiolects, which in turn are sets of
expressions of human language with individual meanings. Each idiolect is characterized by a coherent
system of structural features, and is homogenous by definition. This understanding of “idiolect” should
not be confused with its use in the sense of “personal variety” — the total repertoire a speaker uses in
a language, which is heterogenous, varying according to different situations and/or media, and thus
unsuited to defining the smallest unit of linguistic variation.
In this framework, all language varieties are subsets of individual languages. Each individual language
is classified into different varieties according to different kinds of external criteria and, at the same
time, of structural criteria (see below). The resulting language varieties in each classification can
overlap, and they can be sub-classified into smaller language varieties, again according to different
criteria.
Each language variety is characterized by structural criteria, in particular, of the sound system
(phonetics and phonology) or its counterpart in other language modalities (the visual-spatial system,
or system of graphemes, etc.), the morphology and syntax, the lexicon and the semantic system. At the
same time, each language variety is also characterized by certain external properties; for example, to
be used in a certain region, at a certain time, in a formal situation. These criteria are organized into a
few major types and constitute the different dimensions of linguistic variation; there is one dimension
of linguistic variation for each relevant major type of external properties of idiolects (such as properties
referring to geographical space, properties referring to time, etc.).
This conceptual framework for language varieties respects the major approaches in linguistic literature
without copying any particular one, though in general orientation and details such as the role assigned
[17]
to idiolects, it is most similar to the work of Lieb .
4.2 Dimensions of linguistic variation
Linguistic manifestations such as LRs and events of language use are characterized by the following
2)
dimensions of linguistic variation (see ISO 21636-1: —, 3.3):
a) space dimension;
b) time dimension;
c) social group dimension;
d) medium dimension;
e) situation dimension;
f) individual speaker dimension;
g) proficiency dimension;
h) communicative functioning dimension.
This document provides a general description of each dimension and its respective varieties. Instructions
on how to identify and indicate the respective varieties belonging to each of these dimensions are given
3)
in ISO 21636-3 . The structure of this document and ISO 21636-3 is strictly parallel so that the reader
can easily compare the general discussion and the instructions.
Although there can be interferences between some dimensions, the dimensions of linguistic variation
are in principle independent of one another. A complete characterization of any given language use
event and any LR with regard to linguistic variation would identify the respective varieties to which the
event or resource belongs in all and each of these eight dimensions of linguistic variation. In practice, a
description can focus only on a few salient or relevant dimensions of linguistic variation, and (perhaps
tacitly) assume default values for other dimensions of linguistic variation.
For reasons of readability, “speaker” is always used generically in this document, covering also more
specific concepts such as “writer”, “signer”, etc. Equally, other comments referring to properties specific
to speaking always hold analogously for the other language modalities.
Language varieties of some dimensions of linguistic variation can have sub-varieties, usually of the
same dimension, such that a given language use event or a given LR can have more than one value with
respect to a dimension of linguistic variation. Where two varieties of the same dimension apply, usually
one variety is more specific and the other broader. For instance, a recording of a speaker using a Norfolk
dialect can also be characterized as belonging to the broader East Anglian dialect of English.
There are even examples for sub-varieties of different dimensions. For instance, the middle Bavarian
period of the Bavarian dialect of German is different from the Bavarian dialect combined with the
middle German period. Such cases are not further addressed in this document.
There can also be mixed events of language use or LRs that contain several events of language use that
belong to different varieties according to the same dimension of linguistic variation. For instance, a
dialogue between speakers uses different dialects, or a dictionary covers several dialects, sociolects,
2) Under preparation. Status at the time of publication: ISO/CD 21636-1:2023.
3) Under preparation. Status at the time of publication: ISO/CD 21636-1:2023.
etc. of an individual language. In such cases, all respective (groups of) language varieties will have
to be identified, and if possible, the respective parts of such an LR will have to be related to their
respective language varieties; for instance, according to the different participants and to the different
time segments of such a dialogue, or to the different entries in such a dictionary.
Another special case involves LRs where the language use is “non-native” in the sense that a speaker
is deliberately imitating a language variety different from his/her own variety. For instance, a speaker
of a dialect X imitates speech of another dialect Y, or a speaker imitates another speaker, or an author
makes use, for example as a stylistic device, of linguistic expressions as conceived as typical for a
certain historical language period, or of a certain social group to which the writer does not belong. In
such cases, it is advisable to state the fact that the variety in question is being imitated. (In general,
[11]
such cases can be dealt with in a way compatible with Extension T of IETF BCP 47 .)
4.3 The space dimension and its varieties
With respect to the space dimension, an individual language can be differentiated into dialects (and
these into sub-dialects) and sometimes also a supra-regional standard variety. This is often the most
complex and differentiated kind of linguistic variation.
EXAMPLE 1 (English dialect): East Anglian.
EXAMPLE 2 (supra-regional standard variety): United Kingdom Standard English.
When determining the dialect of a speaker, the factor that is mainly considered is the geographical
region of the socialization of the speaker, that is, where the speaker grew up and also where the
speaker’s parents grew up. If these factors are heterogeneous, for example due to migration or due to
parents from different regions, sociolinguists try to identify the major dialect which the language use
of the speaker most strongly resembles. In such cases, also minor dialectal influences can be indicated,
for instance from the region of (one of) the speaker’s parents, or of a region where the speaker moved
to in a later phase of his or her life. The geographical region of an individual language can cover several
countries and continents, giving rise to new dialects.
EXAMPLE 3 Western American English with influence from southern British English.
The names given to individual dialects are often traditional and usually refer to the geographical region
where the dialect is spoken. When appropriate, the ISO 3166 series can be applied for countries and
established regions, or ISO 6709 and geographic identifiers in an ISO 19112 reference system for spatial
referencing. How many and which dialects are to be distinguished on a given level of specificity is often
debated between specialists, and so are the names and the borders of the dialects.
Most major and also many smaller, well-researched individual languages are characterized by a finite
number of established major dialects. In the case of little known or disputed language varieties, the
identification of a dialect can be supported by a reference to some scholarly work where the dialect is
established or identified.
EXAMPLE 4 Cheshire English [cf. Leigh E. Introduction in: A Glossary of Words Used in the Dialect of Cheshire
(Hamilton, Adams, and Co./Minshull and Hughes; 1877)].
Dialect areas can overlap and often can best be defined by similarity with some prototypical core
variety. Hence, belonging to a certain dialect can be a question of degree for an idiolect and does not
necessarily exclude it from also being acceptable as a member of another dialect to a certain degree.
In individual languages used in a larger geographical area, there is often one variety, usually based
on one specific traditional dialect or a group of dialects, that is recognized as “standard” by most or
all speakers across the whole or a larger part of the geographical area of the individual language.
The standard variety is often characterized by a high degree of normalization and is used in official
communication. In such cases, many speakers can use both a local dialect and the standard variety.
Again, in the case of the standard variety, often the influence of a local dialect is still evident (for instance
as an “accent”), even with speakers who do not have strong mastery in the original local dialect.
EXAMPLE 5 High German with a Bavarian accent.
In the case of individual languages with a very broad geographical coverage, especially those used in
different countries (e.g. German) and continents (in particular Arabic, Chinese, Dutch, English, French,
Portuguese and Spanish) there can be more than one standard variety.
A special case is posed by diaspora varieties spoken in a geographical region where other individual
languages are more strongly present. Usually the influence of the dominant language is evident in
the individual language in such a situation. This again can influence the native individual language of
speakers who live for a while in such an area, even when they speak their original dialect.
EXAMPLE 6 Urdu spoken in London (with influence from English).
When speakers speak several dialects and/or the standard variety more or less fluently, it can be
necessary to determine which of the dialects or standard variety has been used in each language use
event. For some purposes, it can additionally be useful to indicate which other individual languages and
which other dialects of the individual language in question the speakers are able to speak, because this
can influence their linguistic behaviour, even when speaking another dialect or a standard variety.
4.4 The time dimension and its varieties
With respect to the time dimension, an individual language can be differentiated into language epochs
and historical language periods. They can be named after the eras of political organization, of rulers, or
of cultural, social or economic development. Language epochs can comprise distinct language periods.
For several well-studied individual languages, in particular individual languages with a long tradition
of writing, scholars have established the language epochs “old X”, “middle X” and “modern X”, where X
stands for the name of the individual language. These language epochs are then often sub-divided into
“early” and “late” language periods.
EXAMPLE 1 (epoch): Early Middle English.
EXAMPLE 2 (period): Victorian English.
Language epochs can vary in their temporal extension between individual languages. Sometimes they
vary even between different dialects within one individual language. This holds even more so for the
historical language periods.
The establishment of historical language periods can vary between different experts and depends on
their interest or purpose. The beginning and end of a language period are usually not exact points, so
that a period can be characterized by vague delineations or prototypes, for example: “the period around
the 1880s” or “the 16th and early 17th century”. The closer to the present moment, the shorter are the
periods of an individual language or language variety that can be distinguished due to more detailed
knowledge of the structural features of the individual language. Still, language periods typically span
some decades up to a few centuries.
4.5 The social group dimension and its varieties
With respect to the social group dimension, an individual language can be differentiated into sociolects.
Sociolects refer to the socialization of speakers as belonging to a certain social group, such as class,
milieu, professional group, age group, religious group, ethnic group (if not accounted for by dialects) or
gender.
The number and specificity of sociolects that need to be distinguished varies very much from individual
language to individual language and reflects the social structure and in particular the social segregation
of the society in which the individual language is used. In some small egalitarian societies there can
be little or no social group variation at all (that
...
TECHNICAL ISO/TR
REPORT 21636-2
First edition
2023-03
Language coding — A framework for
language varieties —
Part 2:
Description of the framework
Identification et description des variétés de langues —
Partie 2: Description
Reference number
© ISO 2023
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting on
the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address below
or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
ii
Contents Page
Foreword .iv
Introduction .v
1 Scope . 1
2 Normative references . 1
3 Terms and definitions . 1
4 Linguistic variation and language varieties . 1
4.1 Linguistic variation . 1
4.2 Dimensions of linguistic variation . 3
4.3 The space dimension and its varieties . 4
4.4 The time dimension and its varieties . 5
4.5 The social group dimension and its varieties . 5
4.6 The medium dimension and its varieties . 6
4.7 The situation dimension and its varieties . 8
4.8 The individual speaker dimension and its varieties . 10
4.9 The proficiency dimension and its varieties . 10
4.10 The communicative functioning dimension and its varieties .12
Bibliography .14
iii
Foreword
ISO (the International Organization for Standardization) is a worldwide federation of national standards
bodies (ISO member bodies). The work of preparing International Standards is normally carried out
through ISO technical committees. Each member body interested in a subject for which a technical
committee has been established has the right to be represented on that committee. International
organizations, governmental and non-governmental, in liaison with ISO, also take part in the work.
ISO collaborates closely with the International Electrotechnical Commission (IEC) on all matters of
electrotechnical standardization.
The procedures used to develop this document and those intended for its further maintenance are
described in the ISO/IEC Directives, Part 1. In particular, the different approval criteria needed for the
different types of ISO documents should be noted. This document was drafted in accordance with the
editorial rules of the ISO/IEC Directives, Part 2 (see www.iso.org/directives).
Attention is drawn to the possibility that some of the elements of this document may be the subject of
patent rights. ISO shall not be held responsible for identifying any or all such patent rights. Details of
any patent rights identified during the development of the document will be in the Introduction and/or
on the ISO list of patent declarations received (see www.iso.org/patents).
Any trade name used in this document is information given for the convenience of users and does not
constitute an endorsement.
For an explanation of the voluntary nature of standards, the meaning of ISO specific terms and
expressions related to conformity assessment, as well as information about ISO’s adherence to
the World Trade Organization (WTO) principles in the Technical Barriers to Trade (TBT), see
www.iso.org/iso/foreword.html.
This document was prepared by Technical Committee ISO/TC 37, Language and terminology,
Subcommittee SC 2, Terminology workflow and language coding.
A list of all parts in the ISO 21636 series can be found on the ISO website.
Any feedback or questions on this document should be directed to the user’s national standards body. A
complete listing of these bodies can be found at www.iso.org/members.html.
iv
Introduction
More and more digital language resources (LRs) are being created (also by retro-digitization), archived,
processed and analysed. In this context, detailed and exact characterization of language varieties
present in a given language use event is quickly gaining importance. Here, language use includes all
modalities such as written, spoken or signed, and also new forms of language use supported by digital
technology (in social media and similar forms of digital communication). But this is just one way in
which languages vary internally. Others include, for instance, the well-known regional (dialectal) and
social variation.
While in the past a primary goal of working with LRs was the archiving and preservation of LRs, new
goals have emerged and are still emerging:
— institutions and individuals need to exchange metadata (that is, bibliographic description data
and other secondary information) for making the information on existing LRs widely available in a
harmonized form;
— researchers are looking for the primary data (that is, the LRs themselves) for many different
research purposes, including research on linguistic variation;
— researchers and developers need LRs for the development of more advanced language technologies
(LTs) and for testing purposes, as LTs, in particular speech recognition and language analysis, are
entering more and more dimensions of human communication.
In order to achieve the above-mentioned goals and purposes, along with others not outlined in this
document, a standardized set of metadata for the identification of language varieties is important
to guarantee frictionless exchange of secondary information. Well-organized metadata also help to
indicate the degree of interoperability (equalling re-usability and re-purposability of LRs), and the
applicability of LTs to different situations or LRs over time. These metadata are applicable in eBusiness,
eHealth, eGovernment, eInclusion, eLearning, smart environments, ambient assisted living (AAL) and
virtually all other applications which depend on information about LRs. A clear metadata approach is
also a prerequisite for the durability of language resource archiving (in particular in the case of cultural
heritage and scientific research data).
1)
The identification of different individual languages is the subject of ISO 639 , which identifies existing
(living, extinct and historical) individual languages, as well as language groups. This document, and
the ISO 21636 series in general, presupposes and complements ISO 639 by extending the language
code framework in order to allow for the identification of language varieties of different types (such
as geographical, social and modal varieties, among others). The identification of language varieties can
then be included in general, library and archival metadata for describing LRs (which can also include
technical information, time and location of recording, and similar general information, which are not
part of the ISO 21636 series).
The provisions of the ISO 21636 series cover:
— a general conceptual framework to deal coherently with language-internal linguistic variation;
— general rules for the identification and description of language varieties;
— a set of dimensions and open-ended or closed lists of values that can be assigned to each respective
dimension;
— a set of metadata categories and examples for the respective possible values, grouped according to
the most important aspects of the description of events of language use and resulting LRs, related
to linguistic variation.
The metadata categories and values addressed in this document can be candidates for a future highly
granular coding of language varieties based on these comprehensive principles. Thus, this document
1) Under preparation. Status at the time of publication: ISO/FDIS 639:2023.
v
(and the ISO 21636 series in general) conforms to the “recommendations on software and content
development principles 2010”, and fits within the general framework of the ISO/IEC 11179 series for
metadata.
Stakeholders include, but are not limited to:
— information and communication technologies (ICTs) industry (including LTs);
— libraries;
— the media industry (including entertainment);
— internet communities;
— people engaging in language documentation and preservation;
— language archivists;
— translators and interpreters;
— researchers (linguists, in particular sociolinguists, ethnologists, sociologists, etc.);
— people and institutions providing language training;
— emerging new user communities.
It is anticipated that these stakeholders need to refer not only to a certain individual language, but
also to a certain language variety, for instance for oral human-computer interaction, or for tailoring
a certain LR or tool to the needs and specific environment of a target user group. In order to identify
the dimension(s) of linguistic variation internal to individual languages involved, and the respective
relevant language varieties, a first step is to achieve the needed specificity. Adapting a conceptually
sound, uniform framework of reference as developed in this document is superior to the proliferation of
different individual ad hoc solutions.
vi
TECHNICAL REPORT ISO/TR 21636-2:2023(E)
Language coding — A framework for language varieties —
Part 2:
Description of the framework
1 Scope
This document, and the ISO 21636 series in general, provides the general principles for the identification
and description of varieties of individual human languages. It, therefore, does not apply to:
— artificial means of communication with or between machines such as programming languages;
— those means of human communication which are not fully or largely equivalent to human language
such as individual symbols or gestures that carry isolated meanings but cannot be freely combined
into complex expressions.
This document together with the other parts of the ISO 21636 series establishes the dimensions of
linguistic variation as well as core values necessary to identify individual varieties in these dimensions
or sub-dimensions.
This document forms the basis for the other parts by outlining the general framework for language
varieties.
2 Normative references
There are no normative references in this document.
3 Terms and definitions
No terms and definitions are listed in this document.
ISO and IEC maintain terminology databases for use in standardization at the following addresses:
— ISO Online browsing platform: available at https:// www .iso .org/ obp
— IEC Electropedia: available at https:// www .electropedia .org/
4 Linguistic variation and language varieties
4.1 Linguistic variation
Individual human languages differ from one another, and variation exists within each one. Since
language variation is inherent, this variation is also present in LRs. This document covers the
description of LRs that represent instances of use of individual languages regarding their status with
respect to linguistic variation.
While individual languages originally emerged and were mainly used for communication between
humans, their use is increasingly supported by ICTs. Events of language use involving machines are also
covered by this document.
Within individual human languages, linguistic variation occurs in distinct dimensions (listed in 4.2
and described in detail in the remaining text), resulting in different kinds of language varieties. Each
of the dimensions is independent from the others, although mutual influences exist. Each linguistic
manifestation in a given individual language, such as a written text, an utterance, an entry in a lexical
database, etc., can, therefore, be characterized by its location in each of these dimensions of linguistic
variation.
Some language varieties can be further differentiated into sub-varieties, but this level of detail is not
the focus of this document.
Furthermore, the borders between individual languages can be hard to establish, especially when
different groups of people disagree in their assessment of the mutual intelligibility or the socio-political
situation so that the status of a given set of idiolects as an individual language, language variety or
group of related languages is disputed. This difficulty is not addressed in this document (see ISO 639).
Similarly, the borders of language varieties are sometimes hard to establish. In many cases, different
language varieties being distinguished on the same dimension of linguistic variation (for instance
different dialects) do overlap — in other words, there can be idiolects that fulfil all respective criteria
to belong to both varieties. This is particularly true for the time dimension (see 4.4). However, it does
not invalidate the establishment of varieties.
In addition, individual languages and their language varieties are under constant gradual, sometimes
rapid, change so that the following applies:
— A given language variety can over time gain the status of an individual human language (to receive
its own language identifier in ISO 639). For instance, different dialects of Vulgar Latin developed
into distinct Romance languages. Conversely, over time an individual language can develop into a
language variety of another individual language, as appears to have been the case with a postulated
Spanish-based creole language spoken in Cuba that would have developed into a variety of Cuban
Spanish. In this kind of case, the resulting variety would be subject to the framework described in
this document, in which case an existing ISO 639 language identifier assigned to the earlier form of
this language would come to designate an extinct individual language.
— Linguistic expressions and features belonging to a given language variety at one point can shift
along one or more dimensions of linguistic variation (for instance markedly informal expressions
can become acceptable even in formal contexts, or regional expressions can spread over large parts
of the geographical territory where the individual language is spoken).
This document focuses on a framework for the identification and description of linguistic variation
within a language, that is, of language varieties and sub-varieties. It does not attempt to document
all varieties of all individual languages, rather to list the types of descriptors needed to precisely and
fully characterize the linguistic variety represented by an utterance or LR. It does so by describing
the dimensions of variation within languages, and by indicating the major language variations that
typically occur within individual languages.
This document considers individual languages to be sets of idiolects, which in turn are sets of
expressions of human language with individual meanings. Each idiolect is characterized by a coherent
system of structural features, and is homogenous by definition. This understanding of “idiolect” should
not be confused with its use in the sense of “personal variety” — the total repertoire a speaker uses in
a language, which is heterogenous, varying according to different situations and/or media, and thus
unsuited to defining the smallest unit of linguistic variation.
In this framework, all language varieties are subsets of individual languages. Each individual language
is classified into different varieties according to different kinds of external criteria and, at the same
time, of structural criteria (see below). The resulting language varieties in each classification can
overlap, and they can be sub-classified into smaller language varieties, again according to different
criteria.
Each language variety is characterized by structural criteria, in particular, of the sound system
(phonetics and phonology) or its counterpart in other language modalities (the visual-spatial system,
or system of graphemes, etc.), the morphology and syntax, the lexicon and the semantic system. At the
same time, each language variety is also characterized by certain external properties; for example, to
be used in a certain region, at a certain time, in a formal situation. These criteria are organized into a
few major types and constitute the different dimensions of linguistic variation; there is one dimension
of linguistic variation for each relevant major type of external properties of idiolects (such as properties
referring to geographical space, properties referring to time, etc.).
This conceptual framework for language varieties respects the major approaches in linguistic literature
without copying any particular one, though in general orientation and details such as the role assigned
[17]
to idiolects, it is most similar to the work of Lieb .
4.2 Dimensions of linguistic variation
Linguistic manifestations such as LRs and events of language use are characterized by the following
2)
dimensions of linguistic variation (see ISO 21636-1: —, 3.3):
a) space dimension;
b) time dimension;
c) social group dimension;
d) medium dimension;
e) situation dimension;
f) individual speaker dimension;
g) proficiency dimension;
h) communicative functioning dimension.
This document provides a general description of each dimension and its respective varieties. Instructions
on how to identify and indicate the respective varieties belonging to each of these dimensions are given
3)
in ISO 21636-3 . The structure of this document and ISO 21636-3 is strictly parallel so that the reader
can easily compare the general discussion and the instructions.
Although there can be interferences between some dimensions, the dimensions of linguistic variation
are in principle independent of one another. A complete characterization of any given language use
event and any LR with regard to linguistic variation would identify the respective varieties to which the
event or resource belongs in all and each of these eight dimensions of linguistic variation. In practice, a
description can focus only on a few salient or relevant dimensions of linguistic variation, and (perhaps
tacitly) assume default values for other dimensions of linguistic variation.
For reasons of readability, “speaker” is always used generically in this document, covering also more
specific concepts such as “writer”, “signer”, etc. Equally, other comments referring to properties specific
to speaking always hold analogously for the other language modalities.
Language varieties of some dimensions of linguistic variation can have sub-varieties, usually of the
same dimension, such that a given language use event or a given LR can have more than one value with
respect to a dimension of linguistic variation. Where two varieties of the same dimension apply, usually
one variety is more specific and the other broader. For instance, a recording of a speaker using a Norfolk
dialect can also be characterized as belonging to the broader East Anglian dialect of English.
There are even examples for sub-varieties of different dimensions. For instance, the middle Bavarian
period of the Bavarian dialect of German is different from the Bavarian dialect combined with the
middle German period. Such cases are not further addressed in this document.
There can also be mixed events of language use or LRs that contain several events of language use that
belong to different varieties according to the same dimension of linguistic variation. For instance, a
dialogue between speakers uses different dialects, or a dictionary covers several dialects, sociolects,
2) Under preparation. Status at the time of publication: ISO/CD 21636-1:2023.
3) Under preparation. Status at the time of publication: ISO/CD 21636-1:2023.
etc. of an individual language. In such cases, all respective (groups of) language varieties will have
to be identified, and if possible, the respective parts of such an LR will have to be related to their
respective language varieties; for instance, according to the different participants and to the different
time segments of such a dialogue, or to the different entries in such a dictionary.
Another special case involves LRs where the language use is “non-native” in the sense that a speaker
is deliberately imitating a language variety different from his/her own variety. For instance, a speaker
of a dialect X imitates speech of another dialect Y, or a speaker imitates another speaker, or an author
makes use, for example as a stylistic device, of linguistic expressions as conceived as typical for a
certain historical language period, or of a certain social group to which the writer does not belong. In
such cases, it is advisable to state the fact that the variety in question is being imitated. (In general,
[11]
such cases can be dealt with in a way compatible with Extension T of IETF BCP 47 .)
4.3 The space dimension and its varieties
With respect to the space dimension, an individual language can be differentiated into dialects (and
these into sub-dialects) and sometimes also a supra-regional standard variety. This is often the most
complex and differentiated kind of linguistic variation.
EXAMPLE 1 (English dialect): East Anglian.
EXAMPLE 2 (supra-regional standard variety): United Kingdom Standard English.
When determining the dialect of a speaker, the factor that is mainly considered is the geographical
region of the socialization of the speaker, that is, where the speaker grew up and also where the
speaker’s parents grew up. If these factors are heterogeneous, for example due to migration or due to
parents from different regions, sociolinguists try to identify the major dialect which the language use
of the speaker most strongly resembles. In such cases, also minor dialectal influences can be indicated,
for instance from the region of (one of) the speaker’s parents, or of a region where the speaker moved
to in a later phase of his or her life. The geographical region of an individual language can cover several
countries and continents, giving rise to new dialects.
EXAMPLE 3 Western American English with influence from southern British English.
The names given to individual dialects are often traditional and usually refer to the geographical region
where the dialect is spoken. When appropriate, the ISO 3166 series can be applied for countries and
established regions, or ISO 6709 and geographic identifiers in an ISO 19112 reference system for spatial
referencing. How many and which dialects are to be distinguished on a given level of specificity is often
debated between specialists, and so are the names and the borders of the dialects.
Most major and also many smaller, well-researched individual languages are characterized by a finite
number of established major dialects. In the case of little known or disputed language varieties, the
identification of a dialect can be supported by a reference to some scholarly work where the dialect is
established or identified.
EXAMPLE 4 Cheshire English [cf. Leigh E. Introduction in: A Glossary of Words Used in the Dialect of Cheshire
(Hamilton, Adams, and Co./Minshull and Hughes; 1877)].
Dialect areas can overlap and often can best be defined by similarity with some prototypical core
variety. Hence, belonging to a certain dialect can be a question of degree for an idiolect and does not
necessarily exclude it from also being acceptable as a member of another dialect to a certain degree.
In individual languages used in a larger geographical area, there is often one variety, usually based
on one specific traditional dialect or a group of dialects, that is recognized as “standard” by most or
all speakers across the whole or a larger part of the geographical area of the individual language.
The standard variety is often characterized by a high degree of normalization and is used in official
communication. In such cases, many speakers can use both a local dialect and the standard variety.
Again, in the case of the standard variety, often the influence of a local dialect is still evident (for instance
as an “accent”), even with speakers who do not have strong mastery in the original local dialect.
EXAMPLE 5 High German with a Bavarian accent.
In the case of individual languages with a very broad geographical coverage, especially those used in
different countries (e.g. German) and continents (in particular Arabic, Chinese, Dutch, English, French,
Portuguese and Spanish) there can be more than one standard variety.
A special case is posed by diaspora varieties spoken in a geographical region where other individual
languages are more strongly present. Usually the influence of the dominant language is evident in
the individual language in such a situation. This again can influence the native individual language of
speakers who live for a while in such an area, even when they speak their original dialect.
EXAMPLE 6 Urdu spoken in London (with influence from English).
When speakers speak several dialects and/or the standard variety more or less fluently, it can be
necessary to determine which of the dialects or standard variety has been used in each language use
event. For some purposes, it can additionally be useful to indicate which other individual languages and
which other dialects of the individual language in question the speakers are able to speak, because this
can influence their linguistic behaviour, even when speaking another dialect or a standard variety.
4.4 The time dimension and its varieties
With respect to the time dimension, an individual language can be differentiated into language epochs
and historical language periods. They can be named after the eras of political organization, of rulers, or
of cultural, social or economic development. Language epochs can comprise distinct language periods.
For several well-studied individual languages, in particular individual languages with a long tradition
of writing, scholars have established the language epochs “old X”, “middle X” and “modern X”, where X
stands for the name of the individual language. These language epochs are then often sub-divided into
“early” and “late” language periods.
EXAMPLE 1 (epoch): Early Middle English.
EXAMPLE 2 (period): Victorian English.
Language epochs can vary in their temporal extension between individual languages. Sometimes they
vary even between different dialects within one individual language. This holds even more so for the
historical language periods.
The establishment of historical language periods can vary between different experts and depends on
their interest or purpose. The beginning and end of a language period are usually not exact points, so
that a period can be characterized by vague delineations or prototypes, for example: “the period around
the 1880s” or “the 16th and early 17th century”. The closer to the present moment, the shorter are the
periods of an individual language or language variety that can be distinguished due to more detailed
knowledge of the structural features of the individual language. Still, language periods typically span
some decades up to a few centuries.
4.5 The social group dimension and its varieties
With respect to the social group dimension, an individual language can be differentiated into sociolects.
Sociolects refer to the socialization of speakers as belonging to a certain social group, such as class,
milieu, professional group, age group, religious group, ethnic group (if not accounted for by dialects) or
gender.
The number and specificity of sociolects that need to be distinguished varies very much from individual
language to individual language and reflects the social structure and in particular the social segregation
of the society in which the individual language is used. In some small egalitarian societies there can
be little or no social group variation at all (that means there is just one general neutral sociolect, or
maximally some age group differences which almost inevitably occur when different generations speak
the individual language). Other small societies can only strongly differentiate between two genderlects
(that is, a male and a female variety).
NOTE 1 The existence or not of genderlect
...
The standard SIST-TP ISO/TR 21636-2:2024 provides a comprehensive framework for understanding and describing language varieties. This document is instrumental in delineating the principles that govern the identification and documentation of individual human language varieties, making it a vital resource for linguists, language researchers, and sociolinguistics. One of the notable strengths of this standard is its clarity in defining the scope of applicability. It explicitly states that it focuses on human communication, distinguishing itself from artificial languages or non-complex human communication means. This focus reinforces the integrity of its framework by ensuring that it emphasizes the complexity inherent in human languages, which is essential for accurate linguistic analysis. The standard's framework is structured around the dimensions of linguistic variation, providing a systematic approach to identifying and categorizing language varieties. This systematic approach not only enhances the functionality of linguistic research but also encourages consistency across various studies. By establishing core values necessary to identify individual varieties in different dimensions or sub-dimensions, the standard facilitates a standardized method for comparing and contrasting language varieties, which is crucial for advancing linguistic scholarship. Moreover, this document serves as a foundational text for the continued development of the entire ISO 21636 series, underscoring its relevance within the broader context of language coding. By effectively outlining the general framework for language varieties, it lays the groundwork for subsequent parts of the series to build upon, thus ensuring coherence and comprehensive coverage in language variety documentation. In conclusion, SIST-TP ISO/TR 21636-2:2024 embodies a significant contribution to the field of linguistics by setting forth a robust framework for language variety description, which is at once systematic, clear, and relevant to current linguistic studies.
Die Standardisierung DIN ISO/TR 21636-2:2024 bietet einen grundlegenden Rahmen zur Beschreibung und Identifizierung von Sprachvariationen. Der Geltungsbereich dieses Dokuments umfasst die allgemeinen Prinzipien, die zur Klärung und Charakterisierung menschlicher Sprachvarianten notwendig sind. Dabei ist hervorzuheben, dass die Norm nur für natürliche menschliche Sprachen anwendbar ist und nicht für künstliche Kommunikationsmittel oder isolierte Symbole. Ein wesentliches Stärke des Standards ist seine umfassende Herangehensweise an die Dimensionen linguistischer Variation. Durch die klare Definition und Gliederung der Kernwerte, die für die Identifizierung individueller Sprachvariationen erforderlich sind, schafft das Dokument eine wertvolle Basis für die weiteren Teile der ISO 21636-Serie. Dies ermöglicht Forschern und Fachleuten, die Komplexität und Vielfalt der menschlichen Sprache systematisch zu erfassen. Die Relevanz des стандарта ISO/TR 21636-2:2024 liegt in seinem Beitrag zur linguistischen Wissenschaft und zur Entwicklung sprachlicher Technologien. Der Rahmen fördert ein einheitliches Verständnis von Sprachvariationen und unterstützt die präzise Beschreibung und Analyse von Sprachgebrauch in verschiedenen Kontexten. Durch die Festlegung von Grundlagen und Prinzipien hilft der Standard, die Kommunikation zwischen verschiedenen Sprachvarianten zu verbessern und Missverständnisse zu vermeiden. Insgesamt stellt die Norm eine bedeutende Ressource für Linguisten, Sprachwissenschaftler und Entwickler dar, die an der Erforschung und Anwendung der Vielfalt menschlicher Sprache interessiert sind.
SIST-TP ISO/TR 21636-2:2024は、言語の多様性を理解し定義するための重要な指針を提供しています。この標準の範囲は、個々の人間の言語の多様性の識別と記述に関する一般的な原則を確立しており、言語体系における複雑さをより深く理解するための基盤を形成しています。 この標準の強みは、その適用範囲の明確さにあります。具体的には、プログラミング言語などの人工的なコミュニケーション手段や、孤立した意味を持つ記号やジェスチャーなどの完全または大部分が人間言語に相当しない手段には適用されないことが明記されています。これにより、本標準は人間の言語特有の多様性とその識別方法に特化したものであることが強調され、他のコミュニケーション手段との差別化が図られています。 また、ISO 21636シリーズ全体を通じて、言語的変異の次元を確立し、これらの次元やサブ次元における個々の多様性を特定するために必要な核心的価値を提示しています。この文書は、他の部分を支える重要な役割を果たしており、言語多様性のフレームワークを概説しているため、関連する研究や実務において不可欠なリソースです。 従って、SIST-TP ISO/TR 21636-2:2024は、言語の多様性を探求するための基本的な枠組みを提供する標準として、高い関連性を持ち、言語学、コミュニケーション学、文化研究など様々な分野において重要な役割を果たすことが期待されます。
SIST-TP ISO/TR 21636-2:2024 문서는 언어 변종의 식별 및 설명을 위한 일반 원칙을 제공하며, ISO 21636 시리즈의 중요한 일환으로 기능합니다. 이 표준은 인간 언어의 다양성을 이해하는 데 핵심적인 역할을 하며, 언어 변종의 차원을 설정하고 이를 식별하는 데 필요한 핵심 가치를 제시합니다. 본 표준은 언어의 다양한 변종을 설명하는 데 필요한 프레임워크를 제공하며, 언어 연구 및 다국어 자료의 편집 및 비교에 중요한 기초를 형성합니다. 주요 강점으로는 언어 변종에 대한 포괄적이고 체계적인 접근 방식을 제공합니다. 이는 전 세계의 다양한 언어적 특성을 반영할 수 있는 기회를 제공하여, 다양한 언어 연구 커뮤니티에서 해당 표준의 활용을 촉진합니다. 특히, 인간 언어의 변종을 명확히 식별하고 설명할 수 있는 기준을 마련함으로써, 언어학자 및 교육자들에게 유용한 도구가 됩니다. 또한, 이 문서는 프로그래밍 언어와 같은 기계 간의 의사소통 수단이나 특정 기호나 제스처와 같은 인간 언어와 충분히 동등하지 않은 의사소통 수단에는 적용되지 않는다는 점에서 그 범위가 명확합니다. 이러한 제한 사항은 표준의 적절성과 유효성을 더욱 강화합니다. ISO 21636-2는 언어 학문 및 실행에 있어 제정된 규범을 제공하며, 향후 언어 변종 연구의 기초가 될 것입니다.
Le document SIST-TP ISO/TR 21636-2:2024, intitulé "Language coding - A framework for language varieties - Part 2: Description of the framework", présente un cadre exceptionnel pour l'identification et la description des variétés linguistiques humaines. Le périmètre de ce document est clairement défini, excluant des moyens de communication artificiels ou des symboles non linguistiques, ce qui renforce sa pertinence dans l'étude des langues humaines. L'une des principales forces de ce standard est sa capacité à établir des principes généraux qui servent de fondement pour la série ISO 21636. En détaillant les dimensions de la variation linguistique, ce document facilite non seulement l'identification d'individus variétés au sein de ces dimensions, mais fournit également une méthode structurée pour aborder des sous-dimensions. Cela permet aux chercheurs et aux linguistes de mieux comprendre la complexité et la richesse des langues humaines. De plus, le cadre qu'offre le document est d'une grande importance pour l'harmonisation des pratiques autour du codage des langues. En fournissant des valeurs centrales essentielles pour identifier et décrire les variétés linguistiques, le SIST-TP ISO/TR 21636-2:2024 devient un outil indispensable pour les professionnels du langage, qu'ils soient dans les domaines de la linguistique, de la traduction, ou du développement de technologies linguistiques. En conclusion, la pertinence de ce document dans le cadre de la normalisation des variétés linguistiques en fait une contribution significative à la compréhension des dynamiques langagières, tout en soutenant des initiatives futures de recherche et d'application dans ce domaine.














Questions, Comments and Discussion
Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.
Loading comments...