
4.1 Question Analysis and Processing – Generating the
Answer Template
The goal of the question analysis phase is to determine the focus of the question and
the expected type of answer for it. We then generate an answer template that
represents the form of the answer corresponding to a question, with a label “X”
introduced at the “answer slot” (the position where the answer key-word or key-
phrase needs to fit on). “X” represents the entity to be searched for as an answer. In
this work we use a set of transformational rules to arrive at the answer template.
These rules have been categorized on the basis of different question types, like the
“wh” questions, the “yes/no” questions, the “how” questions etc. The rules in each
category check different features of the question. In particular the rule may detect the
presence of a particular word or phrase occurrence, like words of a given part of
speech or those of a particular semantic category. The rules introduce the label “X” at
the location of the keyword or the key phrase in the answer pattern. For example, the
rule, HOW:AUX:1:V(PPL) > 1:AUX:V(PPL):BY:X, works upon a question such as
How is water contaminated? which is transformed to its corresponding answer
template Water is contaminated by X.
Most of these rules are language dependent. We have used a rule base of around
50 transformation rules for English and around 20 rules for Hindi, which offers a
relatively easier answer type identification and template generation.
An example could be – 1:kha^M:V(pre) > 1:X
:maoM:V(pre) which matches
questions like, “jala kha^M bahta hO?”(Where does water flow?) and have a easy
transformation to the answer “jala naidyaaoM maoM bahta hO”, while its English
counterpart will have to follow from a rule like where:does:1:V(base):2 >
1:V(pres):Prep(pos):X:2 to produce “water flows in rivers.” which requires a verb-
form transformation also.
There will be cases where more than one transformation rule applies for a given
question. In such cases, we let all the possible answer templates pass on to the next
phase. This is very common in case of complex query structures, for example the
“how” queries in English and the “kOsao” queries in Hindi. No answer selection is
done at this stage.
Thus the queries in one language are converted into the corresponding answer
templates in the same language and passed on to the next phase – the semantic parser.
4.2 From the Natural Language (NL) Answer Template to
the NL semantic predicates
The answer template is converted into a pseudo UNL representation by a multilingual
HPSG Parser (Sharma et al, 2002) which operates on lexicons specifying the
semantic selection (as against the categorical selection) properties of phrasal heads.
We have build separate semantic lexicons for English and Hindi with shallow
syntactic rules. Semantic relation attributes are mostly used in the lexicon instead of
syntactic subcategory features since the parsed answer form needs to be unified with a
database that is in the UNL semantic predicate format, i.e. the UNL Document. The
UNL structure uses relations that are defined in terms of semantic features such as
agency, place, etc. Therefore, these relations need to be identified in the parsed
answer form for structure matching to be possible. To take examples of a lexical
120