and the most extended model is the relational one,
although the object oriented model or data
warehouse is being used more and more.
Databases world, in order to manage data from
all these heterogeneous sources and formats, have
had to adapt also to the new market, allowing for
example to store these new types of web data, or
changing the way to work with the data using them
as knowledge, as in the data warehouse.
Therefore, because almost all organizations
eventually get into the web, where XML is emerging
as a more effective means of describing the semantic
content of WWW documents, and because the main
amount of business information continues stored in
relational databases, it would be useful to manage a
relational database from a web perspective using
XML. That is, not only to use XML language as
interface to pass information between systems that
interact with each other on the web, also to employ a
XML based language to access to the databases of
these systems.
In this paper we are going to centre only in these
two main data sources, the conventional relational
database system, because as we just say most of the
business data continuing store in them, and XML
data, because it is the most extended data format to
exchange and represent information on the web.
The idea would be how to integrate these two
types of data in order for example to use them later,
to store them in a data warehouse or exchange their
information in the web.
Besides, one of the main requirements of the
query system, as we just said, is its XML based
appearance.
Another important requirement would be that
data maintains its original format, the information
remains stored in the database and it does not have
to go through any migration process.
Our goal will be to permit to query a relational
database, without taking into account its complexity,
size or subject of its information, in a similar way
that if it was a XML document and obtaining XML
data after query execution too. The end user will not
need to know the real storage of the data, for him it
will appear as if he were querying a XML document.
The main contribution of this paper is to explain the
foundations of this new query system called XBD
(XML for Databases) which implements this
functionality. XBD is a query system based on XML
for querying relational databases.
2 OBTAINING XML FROM
RELATIONAL SOURCES
The purpose of this research is to help to integrate
relational database and XML data, obtaining XML
data from relational database data. One time all the
information, from databases or XML documents,
was in the same format, XML, will be easier to
manage it together.
Last versions of the traditional Database
Management Systems (DBMS) as Oracle (Oracle,
2002a) (Oracle, 2002b), Microsoft SQL Server
(Microsoft, 2002) or IBM DB2 (IBM, 2002), have
had to be updated to work with XML data.
According to this, they already can store XML data
together with relational information, carrying new
types of data like XMLType in Oracle and SQL
Server, or XML column and XML collection in
DB2. But in the present research we are not
interested in the storing of XML information in
databases systems, we are only interested in how to
obtain XML data from relational sources. In this
sense, these new versions of the most used DBMS,
allow to query relational data in the database query
language SQL, but returning the query results in
XML format.
Using active server pages designed with Java
Server Pages, Microsoft Active Server Pages or PHP
web pages, is also possible to attain the same effect.
The query to the database in SQL is included inside
the web page code, and they will return the database
query results in XML. These results are added when
the web page is showed in a explorer.
In both kinds of tools, DBMS tools or active
server pages, the query language is SQL. In our
query system we want not only the query results
have a XML appearance, also the query language
have to be derived from XML, so, we will not use
SQL as query language.
Other systems that could be in relation with our
research are the ones that transforming the data
model of a relational database model, to a special
XML View called virtual XML view. This view
show to the user the database content in order he can
make later, queries over this database using XML
languages and obtaining the results in XML format,
too. The virtual XML view is similar to a XML
document where would be stored the database
records content, but it is “virtual” because the
database data are not really translated physically to
XML.
The two main systems of this type are
XTABLES (Funderburk, 2002) and SilkRoute
(Fernández, 2002).
In XTABLES the virtual XML view are called
“default view”. This system use XQuery as query
ICEIS 2005 - DATABASES AND INFORMATION SYSTEMS INTEGRATION
314