based matching between different symbols (Belongie
et al., 2002). We see that most of the prior work fo-
cuses on extracting symbols from such engineering
diagrams or flow charts. To the best of our knowl-
edge, there exists no work which has proposed an end-
to-end pipeline for automating the information extrac-
tion from plant engineering diagrams such as P&ID.
In literature, Connected Component (CC) analy-
sis (Koo and Kim, 2013) has been used extensively
for extracting characters (Gupta et al., 2017) from im-
ages. However, connected components are extremely
sensitive to noise and thresholding may not be suit-
able for P&ID text extraction. Hence, we utilize
the recently invented Connectionist Temporal Pro-
posal Network (CTPN) (Tian et al., 2016) to detect
text in the image with impressive accuracy. For line
detection, we utilize Probabilistic hough transform
(PHT) (Kiryati et al., 1991) which is computationally
efficient and fast version of the standard hough trans-
form as it uses random sampling of edge points to
find lines present in the image. We make use of PHT
for determining all the lines present in P&ID sheets
which are possible candidates for pipelines. In our
paper, we propose the use of Fully convolutional neu-
ral network (FCN) based segmentation (Shelhamer
et al., 2016) for detecting symbols because trandi-
tional classification networks were unable to differ-
entiate among different types of symbols due to very
minute inter-class differences in visual appearances
and presence of noisy and textual information present
inside symbols. FCN incorporates contextual as well
as spatial relationship of symbols in the image, which
is often necessary for accurate detection and classifi-
cation of P&ID symbols.
3 OVERVIEW
The main objective of the paper is to extract
the information from the P&ID sheets representing
schematic process flow through various components
like pipelines, valves, actuators etc. The information
is extracted from P&ID and stored in a data struc-
ture that can be used for querying. The P&ID dia-
gram shown in Figure 1 depicts the flow of oil through
pipelines from inlet to outlet, where outlets and in-
lets denote the point of entry and exit of the oil, re-
spectively. Each outlet is unique and may connect to
multiple inlets, forming a one-to-many relationship.
The symbols indicate the machine parts present on
the pipeline to control the flow and to filter the oil in a
specific way. The pipelines are identified by a unique
P&ID code which is written on top of every pipeline.
To capture all the information from the P&ID
sheets, we propose a two-step process as follows :
• In the first step, we identify all the individual com-
ponents like pipelines, pipeline codes, symbols,
inlets and outlets. We use conventional image
processing and vision techniques like connected
component analysis (Koo and Kim, 2013), proba-
bilistic hough transform (Kiryati et al., 1991), ge-
ometrical properties of components etc. to local-
ize and isolate pipelines, pipeline codes, inlets and
outlets. Symbol detection is carried out by using
fully convolutional neural network based segmen-
tation (Shelhamer et al., 2016) as symbols have
very minute inter class variations in visual appear-
ances. Text detection is performed via a Connec-
tionist Text Proposal Network (CTPN), and the
recognition is performed via the tesseract OCR li-
brary.
• In the second step, we associate these components
with each other and finally capture the flow of
oil through pipelines by forming a tree-like data
structure. The tree is able to represent one-to-
many relationship where each outlet acts as root
node and each inlet is treated as a leaf node. The
pipelines represent intermediate nodes present in
the tree.
4 PROPOSED METHODOLOGY
In this section, we discuss the proposed methodolody
for extracting information from P&ID sheets in de-
tail. It is a two-step process as shown in Figure 2
in which the first step involves detection and recog-
nition of individual components like pipeline-codes,
symbols, pipelines, inletss and outlet. The second
step involves association of detected components with
the appropriate pipelines followed by formulation of
tree-like data structure for finding the process flow of
pipeline schematics. These steps are detailed as fol-
lows :
4.1 Detection and Recognition
We use vision techniques for extracting different com-
ponents like pipeline-codes, symbols, pipelines, inlets
and outlets present in P&IDs. We divide these com-
ponents into two-types : 1. text containing pipeline-
codes and 2. graphic objects like pipelines, symbols.
As observed from Figure 1, P&ID sheets have text
present which represents pipeline code, side notes,
sometimes as part of a symbol or container / symbol /
tag numbers, we call these text segments as pipeline-
code. The non-text components like pipelines, sym-
bols, inlets and outlets are termed as graphic objects.
ICPRAM 2019 - 8th International Conference on Pattern Recognition Applications and Methods
166