known.
The very meaning of Unexpectedness, as stated
before, implies that it is relative to previously
existing knowledge. We use the term Relevance,
rather than relativity, to stress that the particular
frame of reference is a chosen field of interest, e.g.
material properties of metals, migratory birds, or
software design patterns.
These two concepts, Unexpectedness and
Relevance, are not just two faces of the same coin,
one relative to the other, but two really independent
functions. Indeed, they are separately quantified.
The previous knowledge exists whether or not
new knowledge is acquired. Thus, the chosen field
of interest can be characterized – say, by some kind
of metadata – before and independently of any
knowledge discovery event. In particular, the
Relevance of any piece of knowledge, say a search
result – be it novel or not – can be quantified relative
to the reference metadata.
New knowledge is not determined by the
previously known. In fact, one can acquire two
pieces of new knowledge, having quite distinct
contents, thus differently quantified, The
Unexpectedness of one of them could be larger than
the other one’s, even with the same previous
knowledge.
The first sense of interestingness is Relevance to
the field of interest. In this sense, an item is
interesting because it fits the field for which one has
a rather stable interest, either professional or
amateur. For instance, metals conduct heat, conduct
electricity and have a shiny appearance. Copper
concerns people interested in metals because it fits
the context metadata.
The second sense of interestingness is
Unexpectedness. In this second sense, an item is
interesting because it calls one’s attention by marked
deviation from the typical item in the context. For
instance, mercury is the only metal which is liquid at
room temperature.
But, mercury is really interesting when one
comes across it, because it is both a typical metal –
conducting and shiny – and has unexpected
properties – a liquid forming spherical drops.
Thus, we really have two functions, viz.
Unexpectedness and Relevance.
2.2 No More than Two Functions
We now argue that there are no more than two
functions related to interestingness.
Suppose that we acquire a new piece of
knowledge that by genetic modification, an
agricultural station has developed yellow tomatoes,
instead of the usual red ones. Yellowness certainly is
a function that can be quantified by colorimetric
methods and by accepted standards.
Should we include yellowness – or for that
matter any other intrinsic property of the new
knowledge – in the calculation of its interestingness?
The answer is negative.
Let us look at the time dependence of
interestingness, after such a knowledge discovery.
At the discovery time, that piece of knowledge has a
certain quantifiable amount of Unexpectedness.
The typical action after knowledge discovery is
to incorporate the new piece into the body of
knowledge available in the respective field of
interest. The Unexpectedness of that piece of
knowledge decreases dramatically. Interestingness
decreases accordingly.
The situation is totally different with yellowness.
A short time afterwards, the yellow tomatoes are still
yellow. The time dependence of yellowness has
nothing to do with one’s knowledge of it.
Obtaining again the same piece of knowledge – a
short time interval afterwards – will not be
considered a discovery anymore. This is analogous
to the registration of a new patent. Once a patent is
registered, its contents are not novel anymore and it
cannot be registered again.
Interestingness – through its Unexpectedness – is
not an intrinsic property of any knowledge piece. It
is a function of the discovery process.
2.3 The Formal Framework
The unifying paradigm for interestingness is
formally expressed by the following equation:
Interest = R
U
(2-1)
where Interest is an expression of interestingness, U
is an Unexpectedness function, R is a Relevance
function,
is the symbol for function composition,
meanimg either mathematical or computational
function composition.
In equation (2-1) one should first apply the
Unexpectedness function on the search results. On
the respective output, one then applies the Relevance
function, in this order. In its most general form,
function composition is not commutative, in analogy
to matrix multiplication.
One could conceivably think of other pairs of
functions for which rather U R is the appropriate
order. But as long as the operator between functions
is
the generic function composition, whatever is
the order it cannot be reversed.
In certain particular cases, composition may be
just multiplicative, and therefore commutative:
INTERESTINGNESS - A UNIFYING PARADIGM - Bipolar Function Composition
197