
developers need to have a cross-domain knowledge,
namely of core information models and APIs, so that
their service can be used across application domains
such as Smart Cities, Smart Industry, Smart Agricul-
ture, and additionally they also need a deeper under-
standing of the terms in the specific domain and use
cases in order to efficiently design or select the appro-
priate conceptual scheme for the required data model.
As this requires a high level of expertise in domain
concepts as well as cross-domain and core standards,
and as such expertise appears to be rare, a standard-
ised information model for the domain could make
the process more efficient on the service development
side. Without supporting standards, building the data
model from scratch appears to be time-consuming,
as the different concepts that may exist for a given
term, depending on the semantic artefact in which it
is found, need to be carefully investigated. For ex-
ample, (Baker et al., 2019) shows a side-by-side visu-
alisation of the data (properties and values) provided
by the semantic artefacts GACS (GACS, 2024) and
FoodOn (FoodOn, 2024), proposing comparable but
not identical concepts for maize. The choice of an
appropriate concept based on terms is crucial to en-
sure that the code is not only syntactically correct, but
also logically meaningful and functional. Develop-
ers typically capture this understanding in “Informa-
tion Models”, from which a program can perform au-
tomated reasoning. Such models can also be called
“knowledge representations” of the domain (Hoek-
stra, 2009) and are derived from semantic artefacts.
Semantic artefacts in general use specific “semantic
technologies” to describe the knowledge of a domain.
They cover the perspectives, concepts and terminol-
ogy of their provider’s domain of interest, which may
vary widely even within the same domain. They re-
flect “concepts” as building blocks of a specific do-
main theory. Semantic artefacts include “thesauri”,
“taxonomies”, “controlled vocabularies” and “ontolo-
gies”. Many have in common that they are based on
research data. (Subirats-Coll et al., 2022) highlights
two basic principles for using semantic resources to
achieve semantic interoperability in an IT service:
This is the use of a globally unique web identifica-
tion - a “URI” (Uniform Resource Identifier) - to iden-
tify a logical/physical resource used by web technolo-
gies, and secondly, a data model that can accommo-
date all existing data sets and formats, essentially a
graph-oriented data model. The “Resource Descrip-
tion Framework” (RDF) follows these principles of
linked data and is seen as a catalyst for semantic inter-
operability (Subirats-Coll et al., 2022). The “Linked
Data Concept” (Berners-Lee, 2006) is based on this,
dealing with redundancy and handling it with links.
Semantic resources can appear in different knowl-
edge representation languages such as OWL, RDFS,
SKOS, OBO and UMLS-RRf etc.
(Baker et al., 2019) arranges semantic artefacts
or resources on a “semantic spectrum” ranging from
“lightweight” to heavyweight”. Leightweight re-
sources include e.g. “controlled lists” as standard-
ised and organised arrangements of words or phrases
or as thesauri and taxonomies with a hierarchical
structure. Languages such as RDF/RDFS, SKOS,
UML are suitable for representing these, as no rea-
soning is required (Hoekstra, 2009). In the “semantic
spectrum” given by (Baker et al., 2019), the seman-
tic resources GACS and ARGROVOC (FAO, 1981)
are labeled as “concept schemes” and classified as
“middleweight”, whereas the well-known FoodOn,
Crop (Ontology, 2024a) and Gene Ontology (Ontol-
ogy, 2024b) are labelled as “Ontologies” and “heavy-
weight”. They represent knowledge and are reusable
terminological knowledge representations (Hoekstra,
2009). Ontologies are semantically described with
rich metadata (Drury et al., 2019). Languages suitable
for representing ontologies are developed to support
ontology-based reasoning services within knowledge-
based applications. The main standards used are the
OWL-DL family (Web Ontology Language Descrip-
tion Logic) (Hoekstra, 2009; Baker et al., 2019). Over
the last few decades, international organisations have
built up large semantic artefacts for agronomy and
have carried out a multifaceted collaboration on ed-
itorial service tools, concept mapping, etc. Examples
include the prominent AGROVOC of the Food and
Agriculture Organisation (FAO), the NAL Thesaurus
(NALT) (NALT, 2024) of the National Agricultural
Library (NAL) and the CAB Thesaurus of the Cen-
tre for Agricultural Bioscience International (CABI,
2024). Developers can find them in repositories such
as AgroPortal (AgroPortal, 2024) or Fairsharing.org
etc. In addition to these general agronomic semantic
resources, there is a wide range of specific semantic
resources, all of which provide specific exchange for-
mats and associated services. A good overview can
be found in (Arnaud et al., 2020). Nevertheless, the
use of ontologies in IT service development remains
underexploited (Drury et al., 2019). As argued above,
one reason may be that the proliferation of semantic
resources that redefine the same concepts, even when
based on linked data, is not straightforward for the av-
erage developer. Key questions are on how to improve
the linking of various resources (Subirats-Coll et al.,
2022) and how to encourage developers to use this
complex and rich knowledge of semantic artefacts for
service development. The next two sections describe
some domain-specific semantic artefacts and reposi-
DATA 2024 - 13th International Conference on Data Science, Technology and Applications
350