and their mutual relationships), Notational (graphical
representations of the score), Performance (computer-
based performances), and Audio (digital or digitised
recordings of the piece). IEEE 1599 adopts XML as
the language to encode symbolic contents and multi-
media synchronization information.
The main focus of an IEEE 1599 document is
the description of a single music piece. The Logic
layer contains a list and a definition of the music sym-
bols that compose the score. Music events, intended
as logic entities, can correspond to one or many in-
stances in other layers. In detail, each music event
(note, rest, etc.) can be described in up to n layers, in
up to n instances within the same layer, and in up to n
occurrences within the same instance.
IEEE 1599 is not a simple container for hetero-
geneous descriptions related to a music piece. Dif-
ferent logic or multimedia descriptions of the same
music events present references to a common logical
structure, known as the spine. For example, given
a note, its logical description (e.g. G]5), the corre-
sponding area in a graphical file (e.g. a rectangle with
vertex coordinates in pixels) and its timing in a num-
ber of different audio/video files (e.g. 1200 ms, 2.35
s, 712 frames) are in relationship with the same mu-
sic event. This aspect creates synchronization among
the instances within a layer (intra-layer synchroniza-
tion), and also synchronization among the contents
disposed in many layers (inter-layer synchronization).
The mentioned common data structure, namely
the spine, is a list of music events that are sorted and
labelled in order to allow references from other layers.
Against common sense, score symbols do not corre-
spond necessarily to the list of the events contained
in the spine. Because if it were, music works with
no notation, such as pieces where the performance is
improvised, or music whose score is unknown, could
not be supported by IEEE 1599, unless the user re-
constructs the score. On the contrary, a traditional
notated score or a complete encoding of the piece is
not required to produce a valid IEEE 1599 document.
Moreover, the author of the encoding can choose the
definition and granularity of events.
The spine has a fundamental theoretical impor-
tance within the format, thanks to its “glue” function
in such a multilayer environment. However, it sim-
ply lists (and does not define) events in order to pro-
vide a unique label for them. As a consequence, the
mere presence of an event in the spine has no seman-
tic meaning: what is listed in the spine structure must
have a counterpart in some layer, otherwise the music
event would not be defined.
For further details about the structure and the syn-
tax of the format, please refer to the official IEEE doc-
umentation or to scientific papers such as (Ludovico,
2008) and (Ludovico, 2009).
3 CHARACTERISTICS OF THE
FORMAT
A number of characteristics applicable to music rep-
resentation and education can be derived from what
stated in Section 2. First, the standard and its appli-
cations present no constraint about music genres, cul-
tural areas, historical periods, or different approaches
to music composition and analysis. IEEE 1599 is fit
to baroque counterpoint as well as jazz improvisation,
to operatic arias as well as pop songs. The features
of the encoding can change noticeably, but the over-
all approach preserves its capability to convey infor-
mation in an integrated, intuitive and immediate way.
The case studies presented in the following sections
will provide examples of the heterogeneity of sources
and materials which IEEE 1599 supports.
As regards multimedia contents, in IEEE 1599
they are represented by adopting in-use formats for
digital objects. Thus, multimedia contents are not
translated into XML format, as existing formats are
more suitable. Rather, multimedia documents are
synchronized with the other contents within the ap-
propriate layer and linked to the common data struc-
ture, i.e. the spine. For instance, IEEE 1599 supports
common graphical formats for score scans (e.g. BMP,
GIF, JPEG, TIFF, etc.), and well-known audio/video
file types for recordings (AIFF, MP3, WAV, etc.). This
approach presents some advantages:
• Existing collections and archives of digital objects
in standard formats can be reused;
• The design and implementation efforts of the
IEEE 1599 format are limited to the logical de-
scription of music events, and not to their multi-
media counterparts;
• The verbosity typical of an XML-based language
afflicts only the strictly necessary part of the en-
coding. When binary formats are more efficient
and effective, they are used to convey the required
information. For instance, this is the typical case
of compressed multimedia formats.
Another important matter is the possibility of mul-
timodal interaction with music contents. This feature
is not properly a characteristic, rather a consequence
of the structure and potentialities of the IEEE 1599
standard. Specific implementations apply the con-
cept of “multimodal interaction” with music contents
to the fields of music training and education, as ex-
plained in Section 6.
NEWFRONTIERSINMUSICEDUCATIONTHROUGHTHEIEEE1599STANDARD
147