
5 CONCLUSIONS
The use of analysis patterns can contribute to the
improvement of GDB conceptual models because
they are tested and aproved solutions. This can
reduce the time needed to the conceptual project and
also reduce the possibility of making mistakes. The
obtainment of these patterns can be done by the
KDD process application. One of the important
phases of this process is the data preprocessing.
Specifically in GDB conceptual schemas the data
preprocessing consists in the integration of the
conceptual schemas designed based on different data
models and with naming variations to the same real
world concepts. Thus the integration must be
performed in two levels, syntacticaly and
semanticaly which was the focus of this paper. The
semantic integration among distinct conceptual
schemas must be aided by an ontology, which allows
searching by names and also searching by structure
as attributes and associations.
Another benefit of using ontologies, is the fact
the knowledge is stored and can be updated and
interchanged. Not only analysis patterns can be
deduced but also the ontologies existing concepts
can help the designer in modeling a new conceptual
schema. However to explore all the ontologies
potentialities and in an efficient way it is necessary
to combine it with another technique very used in
heterogeneous databases, known as similarity
matching.
The next steps of this research are the study of
the similarity matching techniques and more
important the definition of a set of criteria to be
considered in the similarity coefficient calculus and
also the weight of each one. The implementation of
the algorithm proposed in section 4 is also a future
work to test the efficiency of this solution to the
semantic unification.
REFERENCES
Bergamaschi, S. et al., 1998. An Intelligent Approach to
Information Integration. In Internation Conference on
Formal Ontology in Information Systems (FOIS’98).
Italy.
Bassalo, G.H.M.; Iochpe, C.; Bigolin, N., 2002.
Representando esquemas no Formato Atributo-Valor
para a Inferência de Padrões de Análise. In: IV
Brazilian Symposium on GeoInformatics - GeoInfo
2002. Caxambu, Brazil.
Cohen, W.W., 1998. Integration of Heterogeneous
Databases Without Common Domains Using Queries
Based on Text Similarity, In
Proceedings of the 1998
ACM SIGMOD international conference on
Management of data. USA.
Fayyad, U. M.; Piatetsky-Shapiro, G.; Smyth, P, 1996.
From Data Mining to Knowledge Discovery in
Databases. AI Magazine, v.17, n.3, p.37-54.
Gamma, H.E.; Johnson, R.; Vlissides J., 1995. Design
Patterns: Elements of Reusable Object-Oriented
Software. Addison-Wesley.
Guarino, N., 1998. Formal Ontology and Information
Systems. In Proc. of In Internation Conference on
Formal Ontology in Information Systems (FOIS’98).
Italy.
Hess, G.N.; Iochpe, C.; Silva, C.M.S., 2003. RoseGIS:
Uma ferramenta CASE para projeto de banco de dados
geográficos. In GISBrasil 2003. Brazil.
Hodge, G., 2000. Knowledge Organization Systems: An
Overview. In System of knowledge Organization for
Digital Libraries: Beyond Traditional authority files.
OpenGIS Consortium, 2001. Geography markup
Language (GML) 2.0. Open GIS Implementation
Specification. Available in http://www.opengis.net.
Parent, C. et al., 1999. Spatio-temporal conceptual models:
data structures + space + time. In Proc.7th ACM GIS,
Kansas City, USA.
Qin, J.; Paling, S., 2001. Converting a controlled
vocabulary into an ontology: the case of GEM,
Information Research 6, 2001.
Rocha, L. V.; Edelweiss, N.; Iochpe, C., 2001 GeoFrame-
T: A Temporal Conceptual Framework for Data
Modeling. In: ACM Symposium on Advances in GIS.
Atlanta, USA.
Sheth, A.P., 2000. Changing focus on interoperability in
information systems: From systems, syntax, structure
to semantics. In Interoperating Geographic
Information Systems”.
Silva, C.M.S.; Iochpe, C.; Engel, P.M., 2003. Using
Knowledge Discovery in Database to Identify
Analysis Patterns, 5
th
International Conference on
Enterprise Information System, Angers, France.
Sugumaran, V.; Storey, V., 2002. Ontologies for
Conceptual Modeling: their creation, use and
management. In Data & Knowledge Engineering.
Elsevier.
ICEIS 2004 - INFORMATION SYSTEMS ANALYSIS AND SPECIFICATION
512