retrieve related sentences and images from the web
and generate colorful animations.
The preliminary results show that our approach
can be quite effective on the related task. This study
can be considered as a proof of concept for the idea
and we are encouraged to further explore several is-
sues for future work.
As technical improvements, we would like to
exploit resources using the same standard for the
phoneme representation, so that we do not have to
apply a mapping mechanism for the calculation of
pronunciation similarity. As an other improvement,
we want to take into consideration phonemes instead
of letters, and the information regarding syllables and
stresses, and investigate whether the performance can
be improved in this way. In addition, we can inves-
tigate the effect of using interval values for relaxed
matches during the calculation of pronunciation simi-
larity. We are aware that sentences retrieved from the
web with our current technique are not reliable all the
time. For instance, they might contain inappropriate
content for students. However, we are assured that
reliability is a crucial issue in an e-learning environ-
ment especially for children. As a possible solution,
we would like to explore the impact of conducting a
domain control with LSA.
Next, for handling the cases in which no sentences
containing the keywords and the translation are re-
trieved, we plan to explore the effect of applying lexi-
cal substitution on sentences containing one or two of
the query words. However, this is a challenging prob-
lem since we have to make sure that the new sentence
conforms to the grammar rules.
Regarding images, we would like to improve our
method to discriminate images with different senses
for the same query word. Since LSA uses a low-
dimensional representation for terms, terms with sim-
ilar meanings are close in the low-dimensional space,
and the representation of meaning is with better qual-
ity in comparison to a traditional vector space method.
Accordingly, we can handle polysemy by using the
synset information in the query to disambiguate the
text information of images in the low-dimensional
space. Second, we plan to conduct experiments on
the effect of using different texts related to the image
such as the title, content or other pieces of text occur-
ring in the page containing the image.
To evaluate the performance of the overall system,
we are going to convert our prototype to an online
service and collect user feedback for further improve-
ments. More specifically, we will ask users whether
the memory tips have been useful for the memoriza-
tion so that we can find out which modules have a
bigger impact on the learning process. Additionally,
users will be able to rate the tips. For instance, they
will be able to state whether the selected keywords
are appropriate, or the sentence is meaningful and/or
humorous, or the displayed image conveys the mean-
ing of the target word. In addition to the online feed-
back, we are also considering to conduct more spe-
cific experiments in which we will host subjects in a
closed environment. We will provide a subset of these
subjects with memorization tips for a set of words in
a language which they were not exposed to before,
while traditional methods will be used to teach the
same vocabulary to the rest. At the end of this pro-
cess, we will make a vocabulary test to investigate the
impact of our method and make a comparison with
traditional methods.
REFERENCES
Berry, M. (1992). Large-scale sparse singular value compu-
tations. International Journal of Supercomputer Ap-
plications, 6(1):13–49.
BNC-Consortium (2000). British national corpus.
http://www.hcu.ox.ac.uk/BNC/.
Borman, A., Mihalcea, R., and Tarau, P. (2005). Pic-net:
Pictorial representations for illustrated semantic net-
works. In Proceedings of the AAAI Spring Sympo-
sium on Knowledge Collection from Volunteer Con-
tributors.
Budanitsky, A. and Hirst, G. (2006). Evaluating WordNet-
based measures of lexical semantic relatedness. Com-
putational Linguistics, 32(1):13–47.
CALC-09 (2009). Workshop on computational approaches
to linguistic creativity (calc-09). http:// aclweb.org/
aclwiki/index.php?title=CALC-09.
Deerwester, S., Dumais, S. T., Furnas, G. W., Landauer,
T., and Harshman, R. (1990). Indexing by latent se-
mantic analysis. Journal of the American Society for
Information Science, 41(6):391–407.
Duyar, M. S. (2001). Accelerated Word Memory Power.
Kardelen Ofset Ltd. Sti.
Fujii, A. and Ishikawa, T. (2005). Toward the automatic
compilation of multimedia encyclopedias: Associat-
ing images with term descriptions on the web. In Pro-
ceedings of the 2005 IEEE/WIC/ACM International
Conference on Web Intelligence, WI ’05, pages 536–
542, Washington, DC, USA. IEEE Computer Society.
Fujita, S. and Nagata, M. (2010). Enriching dictionaries
with images from the internet - targeting wikipedia
and a japanese semantic lexicon: Lexeed -. In Pro-
ceedings of the 23rd International Conference on
Computational Linguistics (Coling 2010), pages 331–
339, Beijing, China. Coling 2010 Organizing Com-
mittee.
Hayashi, Y., Bora, S. A., and Nagata, M. (2009). Utiliz-
ing images for assisting cross-language information
retrieval on the web. In Proceedings of the 2009
CSEDU 2011 - 3rd International Conference on Computer Supported Education
86