analysis. This shift in the unit of analysis means that
the model is used to generate many items compared
with the traditional item development approach,
where each item is written individually by one SME.
This important shift also means that the number of
items is not dependent on the number of SMEs.
Instead, item development is linked to the number of
available models, where one SME can create a small
number of models that produce large numbers of
items thereby scaling the item development process.
The purpose of our paper is to describe and
illustrate strategies that rely on appending items with
descriptive information using content codes so the
items can be identified and differentiated in a bank.
Strategies that rely on content coding allow the user
to monitor, organize, and manage the generated items
in his, her, or their bank.
2 AIG AND CONTENT CODING
AIG is a scalable item development approach capable
of producing large numbers of test items when
correctly implemented. However, this influx of new
items must be organized and managed if this resource
is to be used effectively (Cole et al., 2020; Lane et al.,
2016). One strategy for organizing items is to append
each item with psychometric data (e.g., item
difficulty) so the item can be identified and
differentiated from other items in the bank. Typically,
generated items do not contain psychometric data
because the item sets are large. As an alternative,
content coding is a method that can be used to
describe generated items. Two different methods can
be used to describe items and models using content
codes. The first method relies on posthoc coding.
With this method, content codes are created by
reviewing the items and the models in order to
identify relevant content descriptors. The advantage
of posthoc content coding is its flexibility. This
method can be used to create new and novel content
coding systems for any item and model combination
because it does not require an existing content coding
system or taxonomy. However, this method often
suffers from a lack of specificity within a code, it is
often inconsistently applied across codes because the
content descriptions tend to be overly general, and
because of this generality, the codes are often difficult
to interpret (Gierl et al, 2022). In addition, posthoc
coding is time-consuming to implement, particularly
if large numbers of items and models are created and,
hence, need appended content codes after generation.
The second method relies on predefined (also
called a priori) coding. With this method, existing
content codes are drawn from a taxonomy or
established coding system and then applied to the
items and models as the content descriptors.
Predefined coding uses existing content codes
derived from established taxonomies or coding
systems thereby providing specific, consistent, and
interpretable descriptors. In addition, a content
coding taxonomy often contains data descriptions that
are related to one another because of their position
with other data descriptors in a hierarchy or system
(Gartner, 2016). As a result, the content codes in one
system or taxonomy can be linked to other content
codes and data descriptors both in the existing system
or taxonomy and to other related systems and
taxonomies. The disadvantage of predefined content
coding is that this method is prescriptive meaning that
coding is limited to the content in the taxonomy and,
hence, inflexible. Predefined content coding also
relies on the availability of an appropriate taxonomy
to describe the items and models. This type of
taxonomy is not available in some content areas.
One important benefit of using template-based
AIG is that the item model created in step 2
encompasses all of the content that is required for
generating items. Hence, the item model includes the
content that will be used to generate all of the items
specified in the cognitive model. Because content
coding is included in the item modelling step, the
content codes can also be integrated into the items
using the same assembly logic. In other words,
content codes can be used to create data to describe
the generated items. Content codes can be added at
three different levels in the item model. The first level
is option-level coding. Option-level coding describes
data in the multiple-choice response options or
alternatives. Content coding, therefore, can be used to
describe the item options when generating the
selected-response item type. The second level is item-
level coding. A cognitive model contains variables
and values. Variables are elements in an item model
that describe a particular outcome. Variables contain
values that will be manipulated during item
generation to create new items. Content coding can be
used to describe the generated items at both the
variable and value level. These variables and values
can be denoted as string or integer values. The third
level is model-level coding. Content codes can be
applied at the model level. With model-level coding,
specific codes that describe all of the generated items
from a particular item model are used.
Content codes are implemented in the template-
based AIG workflow during step 2. Because the unit
of analysis has shifted from the item to the model,
content coding is a straightforward process when