considered via roof shapes. Having more than one
local maximum in the roof height is an indication
that the single building should be segmented into
multiple building objects.
6.2 Façades in the 3
rd
Dimension
Along the footprints of the façade one finds
elevation values in the DSM. These do attach to the
façade a 3rd dimension. Depending on the shape of
the roof, a façade could have a complex shape as
well. However, for use as a descriptor one might be
satisfied with a single elevation value for each
façade. We have now defined a vertical rectangle
for each façade footprint.
A refinement would consist of a consideration of
the change of elevations along the façade footprint.
This could be indicative of a sloping ground, or of a
varying roof line, or a combination of both. The
slope of the ground is known from the DTM. The
variations of the roof line are read off the difference
between the DSM and the DTM.
The issue of connected buildings along a
property line exists. One needs to identify such
façade footprints since they are virtual only. Such
facades can be identified via a look at the dense
point cloud. The elevation values above the Bald
Earth along a façade footprint will be zero at one
side of the footprint. If they are not, then buildings
are connected and this façade is only virtual.
6.3 Building and Roof Heights
A building has multiple façades (see Figure 9), and
each façade represents a value for the height of the
building. However, we have not yet considered the
shape of the roof and therefore may get multiple
building heights, depending on the façade one is
considering. Two elevation numbers are desired to
describe the building at a coarse level: we want to
assign a single building height as well as a single
roof height. The building height is the average of
the façade heights. The roof height is the difference
between the highest point in the building’s point
cloud that the previously computed building height.
6.4 Counting Floors and Windows
Conceptually we are dealing with a three-step
process of analyzing each façade. First, we must
project image content onto each façade rectangle or
other façade shape. Therefore the corner points of a
given façade get projected into each aerial
photograph using the poses of the camera. That will
define in each image a certain number of pixels. The
image area with the highest number of pixels is
likely to produce the best façade image. However, in
the interest of using redundancy, we produce
multiple façade images, one each per aerial
photograph that exceeds a minimum image area to
make sense in the further analysis.
Second, the image segments defined in this
manner will have to be subjected to a floor count.
An edge detector is applied to a given façade image
and the edges are used for the floor count. Therefore
the detected horizontal edge values will be
transformed into a binary format and for each row a
summation of the edge values will be performed. In
a next step all the local maxima are detected and out
of them the floors will be determined (see Figure 9).
Third is the definition of all the windows. This
task has recently received some attention, for
example by Čech and Šára (2007). The window
detection uses the normalized horizontal as well as
vertical gradients. Our approach is taken from Lee
and Nevatia (2004). It extracts windows
automatically via a profile projection method from
each of the single façade images. The Prewitt edges
get projected along the rows and columns of the
façade image and the accumulations of the edges
signify the presence or absence of a window row or
window column. We define straight lines along the
boundary of each accumulation, thereby obtaining
likely candidates for window areas in the 2D plane
of the façade. This method is not very accurate when
there are different shapes of windows in the same
column or line. To refine the window locations a one
dimensional search for the four sides of a window is
performed. Hypothesized lines are generated by
moving the line to its perpendicular direction and
test them. The refined position of the window is
where the hypothesized line has the best score for
the window boundary. Details are available from
Lee and Nevatia (2004).
Figure 10 illustrates the result for the example of
one façade, yet multiple images, and indicates that
the window areas do get defined to within ± 3 pixel
in both the horizontal as well as vertical dimensions,
converting to a value of ± 0.3 m vertically and ± 0.3
m horizontally. In the example shown in Figure 11,
all 33 windows of the façade were found in all 4
aerial images. As one can see in Figure 10 also the 6
basement window openings in every façade could be
detected by evaluating their positions and size in the
image. A door is also detected using its size and
location.
FROM AERIAL IMAGES TO A DESCRIPTION OF REAL PROPERTIES - A Framework
289