New York, New York, USA. ACM Press. Software
available at http://pdfx.cs.man.ac.uk/.
D
´
ejean, H. and Meunier, J. (2006). A system for converting
pdf documents into structured xml format. In Docu-
ment analysis systems VII, Lecture notes in computer
science, pages 129–140. Springer, Berlin. Software
available at https://sourceforge.net/projects/pdf2xml/.
Garain, U. and Chaudhuri, B. B. (2005). A corpus for ocr
research on mathematical expressions. International
Journal of Document Analysis and Recognition (IJ-
DAR), 7(4):241–259.
Glawe, M., et al. (2015). Knowledge-based engineering of
automation systems using ontologies and engineering
data. In Proceedings of the 7th International Joint
Conference on Knowledge Discovery, Knowledge En-
gineering and Knowledge Management (IC3K 2015),
pages 291–300.
Hildebrandt, C., e. a. (2017). Reasoning on engineering
knowledge: Applications and desired features. In
European Semantic Web Conference, volume 10250
of Lecture notes in computer science, pages 65–78,
Cham. Springer International Publishing.
Hildebrandt, C., e. a. (2020). Ontology building for cyber–
physical systems: Application in the manufacturing
domain. IEEE Transactions on Automation Science
and Engineering, 17(3):1266–1282.
iText Group nv (2021). itext. Retrieved March 6, 2021 from
https://itextpdf.com/.
Jager, T., e. a. (2011). Mining technical dependen-
cies throughout engineering process knowledge. In
ETFA2011, pages 1–7. IEEE.
Khusro, S., Latif, A., and Ullah, I. (2015). On methods
and tools of table detection, extraction and annotation
in pdf documents. Journal of Information Science,
41(1):41–57.
Lee, C., Bzdak, J., and Lannon, B. (2014). pdf-
table-extract. Retrieved March 6, 2021 from
https://github.com/ashima/pdf-table-extract.
Liu, Y. (2009). Tableseer: Automatic Table Extrac-
tion, Search an Understanding. Dissertation, The
Pennsylvania State University. Software available at
https://sourceforge.net/projects/tableseer/.
Loibl, A., Manoharan, T., and Nagarajah, A. (2020).
Procedure for the transfer of standards into
machine-actionability. Journal of Advanced Me-
chanical Design, Systems, and Manufacturing,
14(2):JAMDSM0022–JAMDSM0022.
Luong, M., Nguyen, T. D., and Kan, M. (2012).
Logical structure recovery in scholarly articles
with rich document features. In Multimedia
Storage and Retrieval Innovations for Digi-
tal Library Systems, pages 270–292. Software
available at https://github.com/knmnyn/ParsCit/
tree/master/bin/sectLabel.
Manoharan, T., e. a. (2019). Approach for a machine-
interpretable provision of standard contents using
welded constructions as an example. Proceedings of
the Design Society: International Conference on En-
gineering Design, 1(1):2477–2486.
Nitro Software, Inc. (2021). Nitro pro. Retrieved March 6,
2021 from https://www.gonitro.com/.
Oro, E. and Ruffolo, M. (2008). Xonto: An ontology-based
system for semantic information extraction from pdf
documents. In 2008 20th IEEE International Confer-
ence on Tools with Artificial Intelligence, pages 118–
125. IEEE.
Oro, E. and Ruffolo, M. (2009). Pdf-trex: An approach for
recognizing and extracting tables from pdf documents.
In 2009 10th International Conference on Document
Analysis and Recognition, pages 906–910. IEEE.
Perez-Arriaga, M. O., Estrada, T., and Abad-Mota, S.
(2016). Tao: System for table detection and extraction
from pdf documents. Proceedings of the Twenty-Ninth
International Florida Artificial Intelligence Research
Society Conference, pages 591–596.
Pitale, S. and Sharma, T. (2011). Information extrac-
tion tools for portable document format. Interna-
tional Journal of Computer Technology 2011, Vol
2(6):2047–2051.
Schmidberger, T. and Fay, A. (2007). A rule format for in-
dustrial plant information reasoning. In 2007 IEEE
Conference on Emerging Technologies & Factory Au-
tomation (EFTA 2007), pages 360–367. IEEE.
Sciweavers LLC (2021). i2ocr. Retrieved March 6, 2021
from https://www.i2ocr.com/.
Sumatra (2021). Sumatra pdf reader. Retrieved March
6, 2021 from https://www.sumatrapdfreader.org/free-
pdf-reader.
Suzuki, M., e. a. (2004). An integrated ocr soft-
ware for mathematical documents and its output
with accessibility. In Computers Helping Peo-
ple with Special Needs, volume 3118 of Lec-
ture notes in computer science, pages 648–655.
Springer, Berlin and Heidelberg. Software available at
http://www.inftyreader.org/.
Wei, X., Croft, B., and McCallum, A. (2006). Table ex-
traction for answer retrieval. Information Retrieval,
9(5):589–611.
Yildiz, B., Kaiser, K., and Miksch, S. (2005). pdf2table:
A method to extract table information from pdf
files. IICAI, pages 1773–1785. Software available at
http://ieg.ifs.tuwien.ac.at/projects/pdf2table.
Zhang, J. and El-Gohary, N. M. (2015). Automated infor-
mation transformation for automated regulatory com-
pliance checking in construction. Journal of Comput-
ing in Civil Engineering, 29(4).
Zhang, J. and El-Gohary, N. M. (2017). Semantic-based
logic representation and reasoning for automated reg-
ulatory compliance checking. Journal of Computing
in Civil Engineering, 31(1):04016037.
Towards Automation of Regulatory Compliance Checking in the Product Design Phase
143