need of looking at the whole content. Furthermore,
reducing the produced metadata to only the most rele-
vant and uniquely representing parts will increase the
interpretability and comparability. Lastly, collecting
such metadata on research data for a research project
could be used to act as a knowledge graph of the
whole research to identify what the research is actu-
ally about and discovering the novelty of it.
REFERENCES
Burgess, A. B. and Mattmann, C. A. (2014). Automati-
cally classifying and interpreting polar datasets with
apache tika. In Joshi, J., editor, 2014 IEEE 15th Inter-
national Conference on Information Reuse and Inte-
gration (IRI), pages 863–867, Piscataway, NJ. IEEE.
Corcho, O., Eriksson, M., Kurowski, K., Ojster
ˇ
sek,
M., Choirat, C., van de Sanden, M., and Cop-
pens, F. (2020). Eosc interoperability frame-
work (v1.0): Draft for community consultation.
https://www.eoscsecretariat.eu/sites/default/files/eosc-
interoperability-framework-v1.0.pdf.
Corcoglioniti, F., Rospocher, M., and Aprosio, A. P. (2016).
Frame-based ontology population with pikes. IEEE
Transactions on Knowledge and Data Engineering,
28(12):3261–3275.
Cyganiak, R., Lanthaler, M., and Wood, D. (2014). RDF
1.1 concepts and abstract syntax. W3C recommenda-
tion, W3C. http://www.w3.org/TR/2014/REC-rdf11-
concepts-20140225/.
DCMI Usage Board (2002). Dcmi metadata terms.
https://www.dublincore.org/specifications/dublin-
core/dcmi-terms/.
Grunzke, R., Hartmann, V., Jejkal, T., Kollai, H., Prabhune,
A., Herold, H., Deicke, A., Dressler, C., Dolhoff,
J., Stanek, J., Hoffmann, A., M
¨
uller-Pfefferkorn, R.,
Schrade, T., Meinel, G., Herres-Pawlis, S., and Nagel,
W. E. (2018). The masi repository service — com-
prehensive, metadata-driven and multi-community re-
search data management. Future Generation Com-
puter Systems.
Heilmann, J., Tucci, A., Plante, E., and Miller, J. F. (2020).
Assessing functional language in school-aged chil-
dren using language sample analysis. Perspectives of
the ASHA Special Interest Groups, 5(3):622–636.
Iglezakis, D. and Schembera, B. (2019). EngMeta - a Meta-
data Scheme for the Engineering Sciences.
Jane Greenberg (2004). Metadata extraction and harvesting.
Journal of Internet Cataloging, 6(4):59–82.
Knublauch, H. and Kontokostas, D. (2017). Shapes
constraint language (SHACL). W3C recommenda-
tion, W3C. https://www.w3.org/TR/2017/REC-shacl-
20170720/.
Kumar, Y. and Singh, N. (2019). A comprehensive view
of automatic speech recognition system - a systematic
literature review. In 2019 International Conference on
Automation, Computational and Technology Manage-
ment (ICACTM), pages 168–173.
Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kon-
tokostas, D., Mendes, P. N., Hellmann, S., Morsey,
M., van Kleef, P., Auer, S., and Bizer, C. (2015). Db-
pedia – a large-scale, multilingual knowledge base ex-
tracted from wikipedia. Semantic Web, 6(2):167–195.
Lubas, R. L., Jackson, A. S., and Schneider, I. (2013). In-
troduction to metadata. In Lubas, R. L., Jackson,
A. S., and Schneider, I., editors, The Metadata Man-
ual, Chandos Information Professional Series, pages
1–15. Chandos Publishing.
Mattmann, C. and Zitting, J. (2011). Tika in action.
Politze, M., Bensberg, S., and M
¨
uller, M. (op. 2019a).
Managing discipline-specific metadata within an inte-
grated research data management system. In Filipe, J.,
editor, Proceedings of the 21st International Confer-
ence on Enterprise Information Systems ICEIS 2019,
Heraklion, Crete - Greece, May 3 - 5, 2019, ICEIS
(Set
´
ubal), pages 253–260, [S. l.]. SciTePress.
Politze, M., Schwarz, A., Kirchmeyer, S., Claus, F., and
M
¨
uller, M. S. (2019b). Kollaborative Forschungsun-
terst
¨
utzung : Ein Integriertes Probenmanagement. In
E-Science-Tage 2019 : data to knowledge / heraus-
gegeben von Vincent Heuveline, Fabian Gebhart und
Nina Mohammadianbisheh, pages 58–67, Heidelberg.
E-Science-Tage, Heidelberg (Germany), 27 Mar 2019
- 29 Mar 2019, Universit
¨
atsbibliothek Heidelberg.
R. Smith (2007). An overview of the tesseract ocr engine.
In Ninth International Conference on Document Anal-
ysis and Recognition (ICDAR 2007), pages 629–633.
Rodrigo, G. P., Henderson, M., Weber, G. H., Ophus, C.,
Antypas, K., and Ramakrishnan, L. (2018). Science-
search: Enabling search through automatic metadata
generation. In 2018 IEEE 14th International Confer-
ence on e-Science (e-Science), pages 93–104.
Schmitz, D. and Politze, M. (2018). Forschungsdaten man-
agen – bausteine f
¨
ur eine dezentrale, forschungsnahe
unterst
¨
utzung. o-bib. Das offene Bibliotheksjournal /
Herausgeber VDB, 5(3):76–91.
Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J. J.,
Appleton, G., Axton, M., Baak, A., Blomberg, N.,
Boiten, J.-W., da Silva Santos, L. B., Bourne, P. E.,
Bouwman, J., Brookes, A. J., Clark, T., Crosas, M.,
Dillo, I., Dumon, O., Edmunds, S., Evelo, C. T.,
Finkers, R., Gonzalez-Beltran, A., Gray, A. J. G.,
Groth, P., Goble, C., Grethe, J. S., Heringa, J., ’t Hoen,
P. A. C., Hooft, R., Kuhn, T., Kok, R., Kok, J., Lusher,
S. J., Martone, M. E., Mons, A., Packer, A. L., Pers-
son, B., Rocca-Serra, P., Roos, M., van Schaik, R.,
Sansone, S.-A., Schultes, E., Sengstag, T., Slater, T.,
Strawn, G., Swertz, M. A., Thompson, M., van der
Lei, J., van Mulligen, E., Velterop, J., Waagmeester,
A., Wittenburg, P., Wolstencroft, K., Zhao, J., and
Mons, B. (2016). The fair guiding principles for sci-
entific data management and stewardship. Scientific
data, 3:160018.
KDIR 2020 - 12th International Conference on Knowledge Discovery and Information Retrieval
234