ASTM E1808-96(2021)
(Guide)Standard Guide for Designing and Conducting Visual Experiments
Standard Guide for Designing and Conducting Visual Experiments
ABSTRACT
This guide is intended to help the user decide on the type of viewing conditions, visual scaling methods, and analysis that should be used to obtain reliable visual experimental data. It is also intended to illustrate the techniques that lead to visual observations that can be correlated with objective instrumental measurements of appearance attributes of objects. This guide includes a review of issues regarding the choice and design of viewing environments, an overview of various classes of visual experiments, a review of experimental techniques for threshold, matching, and scaling experiments, a review for data reduction and analysis procedures. The three different threshold and matching techniques namely, the methods of adjustment, limits, and constant stimuli, are explained. Perceptual scaling techniques reviewed include ranking, graphical rating, category scaling, paired comparisons, triadic combinations, partitioning, and magnitude estimation or production. Brief descriptions and examples, along with references to more detailed literature, are given on the appropriate types of data analysis for each experimental technique.
SCOPE
1.1 This guide is intended to help the user decide on the type of viewing conditions, visual scaling methods, and analysis that should be used to obtain reliable visual data.
1.2 This guide is intended to illustrate the techniques that lead to visual observations that can be correlated with objective instrumental measurements of appearance attributes of objects. The establishment of both parts of such correlations is an objective of Committee E12.
1.3 Among ASTM standards making use of visual observations are Practices D1535, D1729, D3134, D4086, and E1478; Test Methods D2616, D3928, and D4449; and Guide E1499.
1.4 This standard does not purport to address all of the safety concerns, if any, associated with its use. It is the responsibility of the user of this standard to establish appropriate safety, health, and environmental practices and determine the applicability of regulatory limitations prior to use.
1.5 This international standard was developed in accordance with internationally recognized principles on standardization established in the Decision on Principles for the Development of International Standards, Guides and Recommendations issued by the World Trade Organization Technical Barriers to Trade (TBT) Committee.
General Information
Relations
Standards Content (Sample)
This international standard was developed in accordance with internationally recognized principles on standardization established in the Decision on Principles for the
Development of International Standards, Guides and Recommendations issued by the World Trade Organization Technical Barriers to Trade (TBT) Committee.
Designation: E1808 − 96 (Reapproved 2021)
Standard Guide for
Designing and Conducting Visual Experiments
This standard is issued under the fixed designation E1808; the number immediately following the designation indicates the year of
original adoption or, in the case of revision, the year of last revision. A number in parentheses indicates the year of last reapproval. A
superscript epsilon (´) indicates an editorial change since the last revision or reapproval.
1. Scope D4086 Practice for Visual Evaluation of Metamerism
D4449 Test Method for Visual Evaluation of Gloss Differ-
1.1 Thisguideisintendedtohelptheuserdecideonthetype
ences Between Surfaces of Similar Appearance
of viewing conditions, visual scaling methods, and analysis
E284 Terminology of Appearance
that should be used to obtain reliable visual data.
E1478 Practice for Visual Color Evaluation of Transparent
1.2 This guide is intended to illustrate the techniques that
Sheet Materials
leadtovisualobservationsthatcanbecorrelatedwithobjective
E1499 Guide for Selection, Evaluation, and Training of
instrumental measurements of appearance attributes of objects.
Observers
The establishment of both parts of such correlations is an
3. Terminology
objective of Committee E12.
3.1 The terms and definitions in Terminology E284 are
1.3 AmongASTM standards making use of visual observa-
applicable to this guide.
tions are Practices D1535, D1729, D3134, D4086, and E1478;
Test Methods D2616, D3928, and D4449; and Guide E1499.
3.2 Definitions:
3.2.1 appearance, n—in psychophysical studies, perception
1.4 This standard does not purport to address all of the
inwhichthespectralandgeometricaspectsofavisualstimulus
safety concerns, if any, associated with its use. It is the
are integrated with its illuminating and viewing environment.
responsibility of the user of this standard to establish appro-
priate safety, health, and environmental practices and deter-
3.2.2 observer, n—one who judges visually, qualitatively or
mine the applicability of regulatory limitations prior to use.
quantitatively, the content of one or more appearance attributes
1.5 This international standard was developed in accor-
in each member of a set of stimuli.
dance with internationally recognized principles on standard-
3.2.3 sample, n—a small part or portion of a material or
ization established in the Decision on Principles for the
product intended to be representative of the whole.
Development of International Standards, Guides and Recom-
3.2.4 scale, v—to assess the content of one or more appear-
mendations issued by the World Trade Organization Technical
ance attributes in the members of a set of stimuli.
Barriers to Trade (TBT) Committee.
3.2.4.1 Discussion—Alternatively, scales may be deter-
2. Referenced Documents
mined by assessing the difference in content of an attribute
with respect to the differences in that attribute among the
2.1 ASTM Standards:
members of the set.
D1535 Practice for Specifying Color by the Munsell System
D1729 Practice for Visual Appraisal of Colors and Color 3.2.5 specimen, n—a piece or portion of a sample used to
Differences of Diffusely-Illuminated Opaque Materials make a test.
D2616 Test Method for Evaluation of Visual Color Differ-
3.2.6 stimulus, n—any action or condition that has the
ence With a Gray Scale
potential for evoking a response.
D3134 Practice for Establishing Color and Gloss Tolerances
3.3 Definitions of Terms Specific to This Standard:
D3928 Test Method for Evaluation of Gloss or Sheen
3.3.1 anchor, n—the stimulus from which a just-perceptible
Uniformity
difference is measured.
1 3.3.2 anchor pair, n—a pair of stimuli differing by a defined
This guide is under the jurisdiction of ASTM Committee E12 on Color and
Appearance and is the direct responsibility of Subcommittee E12.11 on Visual
amount, to which the difference between two test stimuli is
Methods.
compared.
Current edition approved June 1, 2021. Published June 2021. Originally
3.3.3 interval scale, n—a scale having equal intervals be-
approved in 1996. Last previous edition approved in 2015 as E1808 – 96 (2015).
DOI: 10.1520/E1808-96R21.
tween elements.
For referenced ASTM standards, visit the ASTM website, www.astm.org, or
3.3.3.1 Discussion—Logical operations such as greater-
contact ASTM Customer Service at service@astm.org. For Annual Book of ASTM
than, less-than, equal-to, and addition and subtraction can be
Standards volume information, refer to the standard’s Document Summary page on
the ASTM website. performed with interval-scale data.
Copyright © ASTM International, 100 Barr Harbor Drive, PO Box C700, West Conshohocken, PA 19428-2959. United States
E1808 − 96 (2021)
3.3.4 law of comparative judgments—an equation relating 5. Viewing Conditions
the proportion of times any stimulus is judged greater, accord-
5.1 Light Source—The illumination of the specimens in
ing to some attribute, than any other stimulus in terms of
scaling experiments must be reproducible over the course of
just-perceptible differences.
the experiments. To achieve this, it is essential to control both
3.3.5 nominal scale, n—scale in which items are scaled
the spectral character and the amount of illumination closely in
simply by name.
both space and time. Failure to accomplish this can seriously
3.3.5.1 Discussion—Only naming can be performed with
undermine the integrity of the experiments.The spectral power
nominal-scale data.
distribution of the illumination should be known or, if this is
notpossible,thelightsourceshouldbeidentifiedastotypeand
3.3.6 ordinal scale, n—a scale in which elements are sorted
manufacturer. Information such as daylight-corrected fluores-
in order based on more or less of a particular attribute.
cent light, warm-white fluorescent light, daylight-filtered in-
3.3.6.1 Discussion—Logical operations such as greater-
candescent light, incandescent light, etc., together with param-
than,less-than,orequal-tocanbeperformedwithordinal-scale
eters such as correlated color temperature and color rendering
data.
index, if available, should be noted in the report of the
3.3.7 psychometric function, n—the function, typically
experiment.
sigmoidal,relatingtheprobabilityofdetectingastimulustothe
stimulus intensity. 5.2 Viewing Geometry—Almost all specimens exhibit some
degreeofgonioapparentorgoniochromaticvariation;therefore
3.3.8 psychophysics, n—the study of the functions relating
the illuminating and viewing angles must be controlled and
the physical measurements of stimuli and the sensations and
specified. This is particularly important in the study of speci-
perceptions the stimuli evoke.
mens exhibiting gloss variations, textiles showing
3.3.9 ratio scale, n—a scale which, in addition to the
directionality, or gonioapparent (containing metallic or pearl-
properties of other scales, has a meaningfully defined zero
escent pigments) or retroreflective specimens, among others.
point.
This control and specification can range from correct position-
3.3.9.1 Discussion—In addition to the logical operations
ing of the source and observer and the elimination of any
performable with other types of data, multiplication and
secondary light sources visible in the specimens, for the
division can be performed with ratio-scale data.
judgment of gloss specimens at and near the specular angle, to
3.3.10 scale, n—a defined arrangement of the elements of a
more elaborate procedures specifying a range of angles and
set of stimuli or responses.
aperture angles of illumination and viewing for gonioapparent
and retroreflective specimens. When fluorescent specimens are
4. Summary of Guide
studied, the spectral power distribution of the source must
4.1 This guide provides an overview of experimental design closely match that of a designated standard source.
and data analysis techniques for visual experiments. Carefully
5.3 Surround and Ambient Field—For critical visual scaling
conducted visual experiments allow accurate quantitative
work, the surround, the portion of the visual field immediately
evaluation of perceptual phenomena that are often thought of
surrounding the specimens, should have a color similar to that
asbeingcompletelysubjective.Suchresultscanbeofimmense
of the specimens.The ambient field, the field of view when the
value in a wide variety of fields, including the formulation of
observer glances away from the specimens, should have a
coloredmaterialsandtheevaluationoftheperceivedqualityof
neutral color (Munsell Chroma less than 0.2) and a Munsell
products.
ValueofN6toN7(luminousreflectance29to42);seePractice
4.2 This guide includes a review of issues regarding the
D1729).
choice and design of viewing environments, an overview of
5.4 Observers—Guide E1499 describes the selection,
various classes of visual experiments, and a review of experi-
evaluation, and training of observers for visual scaling work.
mental techniques for threshold, matching, and scaling experi-
Of particular importance is the testing of the observers’ color
ments. It also reviews data reduction and analysis procedures.
vision and their color discrimination for normality. Color
Three different threshold and matching techniques are
vision tests for this purpose are described in Guide E1499.
explained, the methods of adjustment, limits, and constant
stimuli. Perceptual scaling techniques reviewed include
6. Categories of Visual Experiments
ranking, graphical rating, category scaling, paired
6.1 Visual experiments tend to fall into two broad classes:
comparisons,triadiccombinations,partitioning,andmagnitude
(1) threshold and matching experiments designed to measure
estimation or production. Brief descriptions and examples,
visual sensitivity to small changes in stimuli (or perceptual
along with references to more detailed literature, are given on
equality), and (2) scaling experiments intended to generate a
the appropriate types of data analysis for each experimental
psychophysical relationship between the perceptual and physi-
technique.
cal magnitudes of a stimulus. It is critical to determine first
4.3 Forreviewsoftopicsinotherthanvisualsensorytesting
which class of experiment is appropriate for a given applica-
within ASTM, see Refs (1, 2).
tion.
6.1.1 Threshold and Matching Experiments—Threshold ex-
periments are designed to determine the just-perceptible dif-
The boldface numbers in parentheses refer to a list of references at the end of
this guide. ference in a stimulus, or JPD.Threshold techniques are used to
E1808 − 96 (2021)
measuretheobservers’sensitivitytoagivenstimulus.Absolute 6.1.2.3 Interval Scales—Intervalscaleshaveequalintervals.
thresholds are defined as the JPD for a change from no On an interval scale, if a pair of specimens were separated by
two units, and a second pair at some other point on the scale
stimulus, while difference thresholds represent the JPD from a
were also separated by two units, the differences between the
particular stimulus level greater than zero. The stimulus from
pair members would appear equal. However, there is no
whichadifferencethresholdismeasuredisknownasananchor
meaningfulzeropointonanintervalscale.Acommonexample
stimulus. Often, thresholds are measured with respect to the
ofanintervalscaleistheCelsiustemperaturescale.Inaddition
difference between two stimuli. In such cases, the difference of
to the mathematical operations listed for nominal and ordinal
apairofstimuliiscomparedtothedifferenceinananchorpair.
scales, addition and subtraction can be performed with
Absolute thresholds are reported in terms of the physical units
interval-scale data.
used to measure the stimulus, for example, a brightness
6.1.2.4 Ratio Scales—Ratio scales have all the properties of
threshold might be measured in luminance units of candelas
the above scales plus a meaningfully defined zero point. Thus
per square metre. Sensitivity is measured as the inverse of the
it is possible to equate ratios of numbers meaningfully with a
threshold, since a low threshold implies high sensitivity.
ratio scale. Ratio scales are often impossible to obtain in visual
Threshold techniques are useful for defining visual tolerances,
work. An example of a ratio scale is the absolute, or Kelvin,
such as color-difference tolerances. Matching techniques are
temperature scale. All of the mathematical operations that can
similar, except that the goal is to determine when two stimuli
be performed on interval-scale data can also be performed on
are not perceptibly different. Measures of the variability in
ratio-scale data, and in addition, multiplication and division
matching can be used to estimate thresholds. Matching experi-
can be performed.
ments provided the basis for CIE colorimetry through the
metamericmatchesusedtoderivethecolor-matchingfunctions
7. Threshold and Matching Methods
of the CIE standard observers.
7.1 Several basic types of threshold experiments are pre-
6.1.2 Scaling Experiments—Scaling experiments are in-
sented in this section in order of increasing complexity of
tended to derive relationships between perceptual magnitudes
design and utility of the data generated. Many modifications of
and physical magnitudes of stimuli. Several decisions must be
thesetechniqueshavebeendevelopedforspecificapplications.
made, depending on the type and dimensionality of the scale
Experimenters should strive to design an experiment that
required. It is important to identify the type of scale required
removes as much control of the results from the observers as
and decide on the scaling method to be used before any scaling
possible, thus minimizing the influence of variable observer
dataarecollected.Thisseemstobeanobviouspoint,butinthe
judgment criteria. Generally, this comes at the cost of imple-
rush to acquire data it is often overlooked, and later it may be
menting a more complicated experimental procedure.
foundthatthedataobtaineddonotyieldtheanswerrequiredor
7.1.1 Method of Adjustment—The method of adjustment is
cannot be used to perform desired mathematical operations.
the simplest and most straightforward technique for deriving
See Refs (3, 4) for further details. Scales are classified into the
threshold data. In it, the observer controls the stimulus mag-
following four classes:
nitude and adjusts it to a point that is just perceptible (absolute
6.1.2.1 Nominal Scales—Nominal scales are relatively threshold) or just perceptibly different (difference threshold).
The threshold is taken to be the mean setting across a number
trivial in that they scale items simply by name. For color, a
of trials by one or more observers. The
...
Questions, Comments and Discussion
Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.