Penn Tree Bank, British National Corpus or the web.
The later works on noun compounds have fol-
lowed on from either (Levi, 1978) or (Warren, 1978)
with some of them coming up with a slightly differ-
ent variation while others have defined a finer grained
set of relations dictated by the data sets used for the
study. For example, (Tratz and Hovy, 2010) reports
a set of 43 relations grouped into 10 upper level cat-
egories. Most of the relations from different studies
can be mapped to an equivalent relation in other stud-
ies.
For this study we chose the set of relations pro-
posed by in (Levi, 1978) for two reasons. Firstly, our
analysis of corpus for anaphor-antecedent relations
seemed to map better to Levi’s set of nine relations
for noun compounds and secondly more of these rela-
tions can be computationally determined from exist-
ing lexicons such as WordNet and the Web. In terms
of natural language processing, a linguistic theory is
only useful if it can be reasonable implemented in a
computational system.
(Levi, 1978) proposes that compound NPs are de-
rived from underlying clause or complement struc-
tures by the two processes of predicate deletion and
normalization. Her work is based on similar frame-
work as (Lees, 1966) except Less’ transformational
process is based on verb classifications. Levi pro-
poses that, in the case of normalization, ‘the underly-
ing predicate survives overtly in the head noun, with
the modifier deriving from either the subject or the
object of the underlying S’, giving rise to subjective
(eg. industrial production) or objective (eg. heart
massage) normalization. For the case of predicate
deletion, Levi maintains that the number or deletable
predicates is limited to only nine primitive relations.
They are CAUSE, HAVE, MAKE, USE, BE, IN,
FROM, ABOUT and FOR. In the next section we ar-
gue that these relations also describe anaphoric use of
NPs.
3 ANAPHORA RESOLUTION
FRAMEWORK
In the Introduction we stated that anaphora interpreta-
tion and noun compound generation are two indicants
of the same underlying relational framework between
entities. Hence, a framework describing compound
noun generation has to apply to anaphora usage as
well. In the proposed framework we extend the re-
lations proposed for compound noun generation by
predicate deletion from (Levi, 1978) for interpretation
of NP as well as pronominal anaphora.
An indirect reference such as window referring to
house and diesel referring to truck is based on the
predicates “house has windows” and “a truck uses
diesel”. In the case of noun compound generation,
the predicate is deleted and the two entities are jux-
taposed to form the noun compounds house window
and diesel truck. For interpretation of the compound
noun the consumer is expected to reconstruct the rela-
tion between the modifier and the head noun ((Down-
ing, 1977; Levi, 1978)). We propose that the com-
pound noun generation process is very similar to NP
anaphora, except in the latter case the modifier is not
necessarily bound to the head noun as part of a noun
compound. That is, it may exist in another clause,
however the same relation is still expected to be re-
constructed for a full interpretation of the anaphor.
Hence, for the example for the predicate “house has
window”, we could have the full NP, house window
produced by predicate deletion. However in addition,
the same predicate coulc also be expressed anaphori-
cally as in the following example:
John bought a house in Glen Eden.
The windows are wooden.
In the example above the related entities from the
predicate “house has windows” are separated into two
different sentences, however the consumer is still ex-
pected to reconstruct the semantic relation as in the
case of noun compound generation. Hence, with the
proposed anaphora interpretation framework, the only
difference between noun compound generation and
anaphoric use of NPs is that in the former the two
nouns are used together as a compound noun while
in the latter the nouns are used separately as anaphor-
antecedent pairs, however they are both governed by
the same semantic framework.
Semantic relations between certain entities exit
by default and can be assumed as part of the lexical
knowledge by a producer. For example, the HAVE
relation between car and tyre is part of lexicon so
the noun compound car tyre and the noun tyre used
anaphorically to refer to car is readily understood.
However, in addition, a semantic relation can also be
formed between two between two entities which are
not usually related by the relation. In this case the re-
lation is explicitly expressed by a predicate as context
before the predicate is deleted to form a noun com-
pound or used anaphorically. For example, after spec-
ifying the relation “the box has tyres”, the noun tyres
can be used to indirectly refer to box in the same way
as the reference of tyres to car. However, the former
can only be used in the context of the discourse in
which the relation was expressed. This corresponds
to Downing’s (Downing, 1977) fortuitous relations.
We distinguish between these two type of relations
as persistent or contextual. Persistent relations are
ICAART 2012 - International Conference on Agents and Artificial Intelligence
94