6 Conclusions and Future Work
This paper describes another approach to linguistic data collecting. It is designed mainly
for collecting common sense propositions within Czech language. Czech is a minor lan-
guage thus we cannot expect millions of propositions within a few months like GWAP
[9]. We are strongly interested to players’ motivation.
Game history is available for each game, so we can identify the words that are hard
to explain (many passes, few propositions) or conversely the words that are easy to
explain (best scored guesses). Further analysis should answer the question why some
words are “easy” and others are not. We have to carefully choose the words for each
level so that players stay motivated.
The major contribution of this work is the method how to collect common sense
propositions in Czech. We have to evaluate the reliability of the collection over time.
We expect that a plausible number of common sense propositions will be collected over
time.
Acknowledgements
This work has been partly supported by the Ministry of Education of CR within the
Center of basic research LC536.
References
1. Corpus querying and grammar writing for the sketch engine. Retrieved March 2, 2010 from
http://trac.sketchengine.co.uk/wiki/SkE/CorpusQuerying.
2. Kilgarriff, A. and Rundell, M. (2002). Lexical profiling software and its lexicographic appli-
cations - a case study. In Proceedings of the Tenth EURALEX International Congress, pages
807–818.
3. Kilgarriff, A., Rychl
´
y, P., Smr
ˇ
z, P., and Tugwell, D. (2004). The sketch engine. In Proceed-
ings of the Eleventh EURALEX International Congress, pages 105–116.
4. Lenat, D. B. (1995). CYC: A large-scale investment in knowledge infrastructure. Commu-
nications of the ACM, 38(11):33–38.
5. Mueller, E. T. (2003). ThoughtTreasure: A natural language/commonsense platform. Re-
trieved November 9, 2009 from http://alumni.media.mit.edu/ mueller/papers/tt.html.
6. Smith, B. (1995). Formal ontology, common sense and cognitive science. International
Journal of Human-Computer Studies, pages 641–667.
7. Stork, D. G. (2001). Toward a computational theory of data acquisition and truthing. In
COLT ’01/EuroCOLT ’01: Proceedings of the 14th Annual Conference on Computational
Learning Theory and and 5th European Conference on Computational Learning Theory,
pages 194–207, London, UK. Springer-Verlag.
8. Stork, D. G. (2007). Open mind initiative — about. Retrieved October 28, 2007 from
http://openmind.org.
9. von Ahn, L. (2006). Games with a purpose. Computer, 39(6):92–94.
10. von Ahn, L., Kedia, M., and Blum, M. (2006). Verbosity: a game for collecting common-
sense facts. In CHI ’06: Proceedings of the SIGCHI conference on Human Factors in com-
puting systems, pages 75–78, New York, NY, USA. ACM.
52