ogy is used for automatic cataloging of IT products
(Song et al., 2009).
Another method for ontology population is in the
risk management domain. The method tries to popu-
late the ontology semi-automatically from fact sheet
documents using combined Natural Language Pro-
cessing (NLP) techniques. It extracts the verbs from
natural language text and matches them to the cor-
responding relations in the T-Box ontology. How-
ever, the human intervention is still needed for control
and validation (Makki et al., 2009). There are some
other works that propose ontology population meth-
ods from unstructured texts based on NLP approaches
(Vargas-Vera et al., 2007), (Maynard et al., 2009). Up
to now, there exists no method to populate ontologies
in the building management domain.
2.2 Building Layouts
CAD design tools, such as AutoCAD, ArchiCAD or
Revit are commonly used for the creation of building
layouts representing two or three dimensional draw-
ing (plans, sections, elevations). Further CAD-based
software is used to plan and model many domains
of a building, such as ventilation, heating, access
controls and photovoltaic (Krahtov et al., 2009). The
number of elements in a sketch and its complexity
may vary (Donath, 2009) (see Figure 1).
Building information modeling (BIM) is the pro-
cess of development and use of computer generated
models to simulate the lifecycle of a facility including
planning, design, construction and operation (Azhar
et al., 2008). The resulting model of BIM is a data-
rich, object-oriented, intelligent and parametric digi-
tal representation of the facility and serves as shared
knowledge resource which helps the decision mak-
ing at each stage of the facility lifecycle (Azhar et al.,
2008). With the BIM model the former 2D construc-
tion drawings are augmented with intelligent contex-
tual semantic, where objects are defined in terms of
building elements and systems such as spaces, walls,
beams and columns (CRC, 2007). To achieve the BIM
concepts, an open standardized data model called IFC
(Industry Foundation Classes) for enabling interop-
erability between BIM software and containing the
semantic information of the facility has been devel-
oped. The reason one can not use the benefits of IFC
for the population of building ontologies is that to-
day after almost 20 years of IFC development (since
1994) we witness the low usage in actual construction
drawings. Laakso assumes that this is caused by the
slow adoption of collaborative model-based construc-
tion processes and industry reluctance to switch over
to new IT tools (Laakso and Kiviniemi, 2012). Even
if IFC becomes more used, there will still be cases
with drawing-like data without semantic information.
2.3 Geometric Pattern Matching
To populate the building ontology in an efficient way,
pattern matching algorithms are used to find all enti-
ties like doors, windows and furniture. There are a lot
of applications where pattern matching plays an im-
portant role. These include pose determination, com-
puter aided design, robot vision and many more. This
work considers a very small subset of pattern match-
ing methods. Spatial pattern matching is the process
of finding a geometric transformation to match two
given images. Only the special case where the im-
age is a 2D vector graphic is called geometric pat-
tern matching. Moreover only exact and total pattern
matching is considered. This is the case for whole
matches with an optimal transformation. This means
that matching patterns are identical. As described by
(Hagedoorn, 2000) there are different methods for ge-
ometric pattern matching, for instance graph match-
ing and geometric hashing. Graph matching means
that the structure of a pattern is described as a graph
and matching is performed between the graphs. Ge-
ometric hashing means that the pattern as a whole is
described by a normalised description. The drawback
of geometric hashing is that it works for known pat-
terns, as a data structure (the hash table) has to be
constructed for the whole data.
This work uses a correspondence method where
patterns are matched by fitting pairs of geometric
primitives. This method combines the geometric
primitives that make up the input pattern. An example
is pairing line segments, where each combination of
two line segments in the patterns must fit to make a
match.
3 SOLUTION
Our methodology uses a semi-automated approach
for the extraction of semantic information from build-
ing CAD drawings using user input at different stages.
The drawings are exported from CAD design software
using the exchange format DXF. OntoCAD imports
and draws the primitives, layers and view-ports. A T-
Box ontology (see Section 3.2) is used as input, which
allows the user to choose building elements from a
building taxonomy. The user can see and use the vec-
tor based primitive representations to add semantic
information. The population process is accelerated
through the OntoCAD user interface and the pattern
Semi-automatedOntologyPopulationfromBuildingConstructionDrawings
381