work such as ISO 15000 (ebXML) to provide a com-
plete, standards-based infrastructure that can extend
the benefits of existing EDI systems to businesses of
all sizes. UBL Library is based on a conceptual model
of information components known as Business Infor-
mation Entities (BIEs). These components are assem-
bled into specific document models. One document is
a set of information components that are interchanged
as part of a business transaction; for example, in plac-
ing an order. This approach facilitates the creation of
UBL-based document types beyond those specified in
this release(Bosak et al., 2011).
Our proposal uses ontologies implemented in
OWL (Web Ontology Language). Advantages using
ontologies compared to previous models, more ro-
bust and worked our proposal, we could say: OWL
is a standard semantic markup language for publish-
ing and sharing ontologies on the World Wide Web
and the Semantic Web. We have freedom to reuse the
content ontology in other services together in a ser-
vice document generation, i.e., because it is indepen-
dent of format ontology, and finally we can harness
the power of ontologies to infer new knowledge do-
main.
6 USING ONTOLOGIES FOR
MODELING DIGITAL
DOCUMENTS
Ontologies are designed for enabling knowledge shar-
ing and reuse on some domain that can be commu-
nicated between people and computers. Therefore,
to enable the sharing and reuse of knowledge, it is
necessary a formal specification of concepts. Ontolo-
gies define rules of relationships between concepts to
query, infer knowledge(Gruber, 1995).
The documents present information depending on
its purpose, and this information can be presented in
different ways. Our proposal is to build a model that
considers essential qualities of digital document. Fol-
lowing the approach of engineering documents, a doc-
ument is considered as combination of information
components and presentation components. Whereas
the information is independent of how it is presented.
This model is based on a integration of two ontolo-
gies. An ontology that represents the presentation
structure and an other that represents the information
according business context, i.e. , a format ontology
and a context ontology. These ontologies are indepen-
dent between each other. The objetive of format on-
tology is to achieve simple document portability. The
purpose of context ontology is to achieve document
interoperability. The mapping between the format on-
tology and content ontology for its physical interfaces
occurs through translators.
6.1 Format Ontology
Format ontology characterizes visual structure of doc-
ument, i.e., formatting settings and graphic infor-
mation. Format ontology specifies formally docu-
ment components, i.e., presentation structure includ-
ing metadata, paragraphs, texts, tables, lists, enumer-
ations, images, styles, etc. The Metadata is the in-
formation associated with the document, for exam-
ple: creation date, last modified date, text language,
document author, pages number, etc. The document’s
layout is based on tables. The tables are composed of
cells, cells can contain paragraphs with images, text
ou maybe another table. Each paragraph of text has
a presentation style, i.e., color, color-font, font, size,
horizontal alignment, vertical alignment, etc. The for-
mat ontology is shown in Figure 3. From the format
ontology, a document can be created in an appropriate
format for its purpose.
6.2 Context Ontology
A business context is a scope in which a special-
ized vocabulary is employed, so the business con-
text defines the type of document information. The
context is used to organize and analyze requirements
and rules information presentation. For example, a
student sheet, a medical record, rental contract, etc.
show different scenarios. For proof of concept this
work takes the context of a student sheet. The main
objetive of a student sheet is to provide academic in-
formation, i.e., grades, attendance, subjects, advisor,
date of birth, date of admission, etc. The context on-
tology was created based on that context. Context on-
tology is shown in Figure 4.
6.3 Generating Documents
Documents are generated from the combination of
format ontology and context. Figure 1 summarizes
the process of document generation. Usually number
of instances to represent a document is big. For exam-
ple, Figure 5 shows a tree of instances of format on-
tology that characterizes the text shown in Figure 6.
Figure 7 shows interaction between one instance of
Text concept of format ontology and another instance
of Institute concept of content ontology.
Document is composed of paragraphs, in-
stances of Text concept are always inside a para-
graph. The string of the Figure 7, #insti-
AnOntologyforPortabilityandInteroperabilityDigitalDocuments-AnApproachinDocumentEngineeringusing
Ontologies
377