2.1 DICOM for Visible Light
Whole-Slide Microscopic Images
Because images for digital pathology are very large
(possibly dozens of gigabytes), they most often can-
not entirely fit inside the memory of a standard com-
puter. For this reason, whole-slide images are in prac-
tice divided as a regularly-spaced set of tiles, each tile
being a small patch taken from the full image. A tile is
most often a square whose sides contain between 256
and 1024 pixels. Each individual tile can be accessed
separately, allowing a digital pathology application to
bring a tile into memory only when needed.
Compressing some whole-slide image amounts to
compressing each individual tile either using a loss-
less algorithm (JPEG 2000), or a destructive algo-
rithm (JPEG). Furthermore, to enable the navigation
over the entire whole-slide image, the full-sized im-
age is downscaled several times, leading to a pyramid
of images with decreasing spatial resolutions. Each
level of the pyramid is encoded as a separate tiled im-
age. This process is illustrated in Figure 1. Note that
similar encoding schemes are also commonly used to
serve cartography maps on the Web.
The DICOM standard models the real world as
follows: A given patient benefits during her life from
a set of medical imaging studies. Each study is made
of a set of series. Each series is in turn a set of in-
stances, the latter being a synonym for a DICOM file.
A single DICOM instance can be multi-frame, mean-
ing it can store several independent images (provided
all of its individual frames share the same size). As
a consequence, the whole-slide pyramid represented
in Figure 1 corresponds to one DICOM series, whose
parent study might contain other series acquired dur-
ing the same clinical episode, possibly coming from
other medical imaging modalities. This series is made
of several instances (the DICOM files), each instance
storing the individual tiles of one given pyramid level
as separate frames. A single instance is not allowed to
store tiles from multiple pyramid levels, yet the same
level can be spread over multiple instances, so as to
prevent the appearance of huge DICOM files.
Besides its individual frames, each DICOM in-
stance is associated with clinical data under the form
of a recursive key-value associative array. Such keys
are called the DICOM tags and are indexed with two
16-bit hexadecimal numbers. The DICOM standard
lists which DICOM tags are mandatory, conditional
or allowed for whole-slide images in the so-called
“VL Whole Slide Microscopy Image IOD” (NEMA —
National Electrical Manufacturers Association, 2016,
PS3.3, Section A.32.8).
A full enumeration of this set of DICOM tags is
obviously out of the scope of this paper, and the inter-
ested reader is kindly invited to refer to the DICOM
standard. In the context of this paper, it is sufficient
to know that each single instance must specify the
size of the pyramid level it is related to: The DICOM
tag “Total Pixel Matrix Columns” (0x0048,0x0006)
stores the width of the pyramid level, whereas “To-
tal Pixel Matrix Rows” (0x0048,0x0007) stores its
height. These two tags allow to know to which level
of the pyramid a given DICOM instance belongs.
Similarly, because the tiles of one pyramid level
can be shuffled over several multi-frame DICOM in-
stances, each frame is associated with the (x, y) po-
sition of the corresponding tile in the corresponding
pyramid level: The tag “Column Position In Total Im-
age Pixel Matrix” (0x0048,0x021e) contains the x-
position of one frame, and “Row Position In Total Im-
age Pixel Matrix” (0x0048,0x021f) its y-position.
This information is collected for each frame of the DI-
COM instance inside the tag “Per Frame Functional
Groups Sequence” (0x5200,0x9230). Our frame-
work almost exclusively relies on this set of tags.
2.2 DICOM-izer
The first component of our software framework for
telepathology is the tool that converts a whole-slide
image from a non-DICOM format to DICOM. This
tool will be referred to as the DICOM-izer. It takes
the form of a standalone, cross-platform command-
line tool, so that it can easily be integrated into any
pathology department.
The DICOM-izer features built-in support to read
the most widespread open file format for whole-slide
imaging (i.e. hierarchical TIFF), as well as common-
place image formats (PNG, JPEG and JPEG 2000). It
is also able to decode proprietary file formats for slide
scanners (SVS, BIF, VMS. . . ) through the OpenSlide
toolkit (Goode et al., 2013)
1
. It extracts as much clin-
ical information as possible from the meta-data of the
input image (such as the scanner manufacturer), but
additional information that is not verbatim available
inside the source file must be provided alongside (e.g.
the identifier of the patient, or the optical parameters
of the acquisition). The output of the DICOM-izer is
a set of compliant DICOM files that can be sent to any
DICOM modality, including the PACS of the hospital,
thanks to the standard DICOM C-Store command.
If the input image does not contain the full pyra-
mid but only its finest level, the DICOM-izer can au-
tomatically generate all the upper levels of the pyra-
mid. Similarly, the DICOM-izer can change the com-
1
Note that OpenSlide does not support the generation of
DICOM files by itself.
Open Implementation of DICOM for Whole-Slide Microscopic Imaging
83