the traffic sign detection model might not have been
exposed to engineers and analysts. On the other hand,
if the machine learning model is not trained to detect
pedestrians, then pedestrians are unknowns to the traf-
fic sign detection model. However, this does not im-
ply pedestrians are unknowns to the autonomous ve-
hicle as it can have another machine learning model
which can detect pedestrians. In this example, we
can observe that identifying unknowns of a machine
learning model helps us to understand what additional
machine learning models we will need to ensure that
we can identify objects that are part of ODD and
can compromise safety. Exposing unknowns to au-
tonomous vehicles will aid in making better design
decisions. Similarly, exposing unknowns to engineers
and analysts helps to make better safety solutions,
gather better data, and revise ODD. These observa-
tions lead to our central research question: “Should
we consider unknowns from different perspectives
to ensure safety of an autonomous vehicle?”. We
address this central research question by answering
the following research questions:
RQ1. What are unknowns from the perspective of
machine learning models, autonomous vehicles, and
engineers and analysts, respectively?
RQ2. What are similarities and dissimilarities
among the unknowns from the three perspectives?
In this paper, we address these research questions
by comparing the unknowns from each perspective
with others using a running example. We discuss
unknowns and their sub-categories (i.e., unknown
knowns and unknown unknowns) from the perspec-
tives of engineers and analysts, autonomous vehi-
cles, and machine learning models in an autonomous
vehicles. We discuss similarities and dissimilari-
ties among the unknowns from the three perspectives
and provide the details on how sub-categories of un-
knowns from one perspective can relate and differ to
sub-categories of unknowns from other perspectives.
The rest of the paper is organized as follows. Sec-
tion 2 discusses about the classification of knowns and
unknowns. Section 3 details about the importance of
unknowns with respect to ISO 21448 and UL 4600.
Section 4 discusses the types of unknowns with re-
spect to the three perspectives mentioned earlier with
an example. Section 5 provides insights and observa-
tions, and we finally conclude in Section 6.
2 KNOWNS AND UNKNOWNS
We can classify knowns and unknowns based on the
knowledge possessed by an intelligent agent/person
(or a group of intelligent agents/persons) such as an
engineer, analyst, autonomous system or organiza-
tion. We can consider a set U which represents an
entire universal knowledge. Considering each intelli-
gent agent, we can divide U into two subsets: a set K
denoting the knowledge possessed by the agent and
a set N denoting the rest of universal knowledge not
possessed by the agent. We can represent this mathe-
matically in Equation 1, where K ⊂ U and N ⊂ U.
U = K ∪ N (1)
We can further divide a set K into subsets. There
are many classifications of K proposed by the existing
literature (Smith, 2001; McCormick, 1997; De Jong
and Ferguson-Hessler, 1996) (e.g., explicit and tacit
knowledge (Smith, 2001); factual, procedural, con-
ceptual and meta-cognitive knowledge (McCormick,
1997)). In this paper, to focus on knowns and un-
knowns, we classify K into following subsets: 1) a set
D representing the direct knowledge, i.e., knowledge
which an intelligent agent can comprehend and/or an-
alyze easily after seeing an object, action or event,
and 2) a set I representing the indirect knowledge, i.e.,
knowledge inferred using the knowledge from D. We
can represent the relation between K, D, and I using
Equation 2.
K = D ∪ I (2)
To understand the relation between sets D and I,
let us consider ‘P (D)’ which is the power set of D
and a set V = {valid, invalid}. We can define a set
I as shown in Equation 3, where f(x) is a function
which provides the inference that can be generated
based on input x, g(f(x)) is a function which tells if
the inference output given from f(x) is a valid or in-
valid inference. Each element in I must have a map-
ping to only one of the elements in V, i.e., the intelli-
gent agent shall be able to derive valid or invalid in-
ferences from D. We mentioned element ‘x’ belongs
to P (D) because I cannot exist without D, i.e., with-
out the knowledge from a set D, we cannot infer the
corresponding knowledge in I.
I = { f (x) | x ∈ P (D) and g( f (x)) ∈ V } (3)
Based on the knowledge possessed and informa-
tion recognized by an intelligent agent, we can clas-
sify the knowns and unknowns similar to ones clas-
sified by the existing literature (Pickard et al., 2010;
Collins and Cruickshank, 2014; Jensen et al., 2017)
as shown in Figure 1. We also illustrate the relation
between these classifications and sets U, K, N, D and I
in Figure 2. The figure shows four widely recognized
classifications as follows.
Resolving Confusion of Unknowns in Autonomous Vehicles: Types and Perspectives
647