• ITP Number: Identifier number of a given ITP;
• Quantity: Number of items in a given purchase
order;
• Register: Describes which document is the
record that a certain inspection phase was per-
formed;
• Test Result: Result of a given test or test for a
physical or chemical property;
• Type of Inspection: Indicates the type of man-
ufacturing inspection, which varies according to
the criticality of the material, operational com-
plexity of the material, complexity or novelty of
the manufacturing process and quality control,
and complexity or uniqueness of the project;
Object Properties
Object properties are basically divided into two
groups, must have and may have. During the elab-
oration of the ontology, it was argued that, in the case
of the quality certificate, some attributes must always
appear, i.e., they are mandatory for the verification
of completeness. However, other attributes have a
certain frequency in the documents, but they are not
mandatory for verifying completeness, i.e., the qual-
ity certificate can have these attributes. In the table
1 the properties, their domains, and their ranges are
displayed.
Axioms
The axioms developed have the function of impos-
ing restrictions on the classes used for completeness
checking. That is, these axioms represent the rules
that a certain type of document must respect in order
to be considered complete or incomplete. To imple-
ment these axioms, the descriptive logic provided by
Prot
´
eg
´
e was used. For each of the axioms a subclass
of Completeness was created. These classes are de-
scribed below.
• Class Quality Certificate Complete: A quality
certificate is said to be complete if all relevant in-
formation is correctly extracted. This information
is defined by the object property Quality certifi-
cate must have and are: Signature, Stamp, Cus-
tomer, Date, Certificate Number and Test Result;
• Class Quality Certificate Not Complete: A qual-
ity certificate is not complete if at least one of the
classes defined in the property Quality certificate
must have is not extracted;
• Class Purchase Orders Complete: A purchase or-
der is considered complete if it has all relevant
information present. That is, a purchase order
is considered complete if the Signature, Stamp,
NCM Code, Date, Description, Supplier, Item,
Reference Norm, Purchase Order Number and
Quantity information is extracted successfully;
• Class Purchase Orders Not Complete: A pur-
chase order is considered incomplete if at least
one of the relevant information is not extracted;
• Class Complete ITPs: A ITP is said to be com-
plete if it has all relevant information present.
That is, it must contain Signature, Stamp, Cus-
tomer, Date, Equipment, Inspection Phases, Sup-
plier, ITP Number, Registration;
• Class Non-complete ITPs: A ITP is said to be
non-complete if at least one of the relevant infor-
mation is not extracted.
• ASTM A193 B7 Compliant Chemical Analysis
Certificates: For a chemical analysis certificate
to comply with the ASTM A193 B7 standard, it
is necessary that the reference standard is the one
referred to and that the values of the chemical el-
ements correspond to those shown in the Table 2;
• Non-ASTM A193 B7 Chemical Analysis Certifi-
cates: For a chemical analysis certificate not to
comply with the ASTM A193 B7 standard, it is
necessary that the reference standard is the one
referred to and that at least one of the values pre-
sented in the Table 2 is different from that found
in the certificate.
2.2 Pipeline for Data Conformity
As described in this section, the data extracted
through OCR is stored in a database. Subsequently,
a Python script is utilized to query the database and
convert the data into instances within the ontology,
establishing the necessary relationships for accurate
representation.
In the Prot
´
eg
´
e environment, the generated file,
containing only the classes and instances, is opened
along with the OntoToT ontology file, which encom-
passes all the developed classes and rules. Following
this, the inference mechanism is executed, utilizing
the axioms to perform analysis on the completeness
and conformity of the documents. The results of this
analysis are displayed on the screen. The entire pro-
cess is visually depicted in Figure 4.
3 RESULTS
To evaluate the proposed system, a set of databooks
that align with the scope of the study was chosen.
Data Digitalization and Conformity Verification in Oil and Gas Industry Databooks Using Semantic Model Based on Ontology
185