Table 1: Examples of common natural queries.
1. Find all documents of Peter employee
2. Find all documents are pdf format
3. Show me some reports of sales department contain
the keywords "financial statement"
4. Documents of accounting department were uploaded
in 2010
5. Search any documents of sales department are "excel
2003" and pdf format
6. I want to find any documents that I can download
7. Look up the documents are "word 2007" format in
my Announcements folder
8. Look up all of documents that people has staff role
can download
9. I want to read the software instruction manuals of
technical department
10. Find the folders were created on 06-18-2010
11. Search all documents are less than "1 Mb" and are
word or pdf format
12. Which the documents were created in "February
2010" that the employees of financial department can
be deleted and viewed?
13. Which the documents are pdf format that Peter
shares with me in Public folder?
14. The employees have the manageable role can
download or delete the documents
15. List all of the documents were uploaded from 01-
20-2010 to 09-20-2010 and are excel format
16. Search any documents in Report folders have the
size from "500 Kb" to "1.5 Mb"
17. James was sent the announcements to sales
department in October?
18. I want to download the documents of sales,
accounting and financial department
19. List the folders I have created after "Jan 2010"
20. Show me the documents that contain the keywords
"computer science" belong to "Plans" folder
21. Find all of the folders have the size are less than "5
Mb" and are created in September 2010
22. Find the documents of technological department
were uploaded at 11-16-2010 contain the keyword
"ontology"
23. The head of sales department shares the documents
between everyone works for accounting department
24. Which the documents technological department can
be download from "May 2009" to "May 2010"
25. Which the documents employees of financial
department can be view or share in "Financial
Department's Public Documents" folder
26. Find any documents are .doc or .docx format and its
sizes are more than "1.2 Mb"
27. Which the folders sales department was created
from "January 2010" to "December 2010" and more
than "8 Mb"
28. David employee uploaded the documents contains
the keywords "conceptual graph"
29. Which the documents of sales department that the
head of financial department can be view?
30. The documents are docx format, upload from 01-
10-2010 to 10-10-2010 and Peter share it with me
Table 2: Performance comparison on retrieval.
Techniques Recall Precision F-measure
Vector-space-
model (VSM)
78% 87% 82%
VSM + Multi-
clustering
92% 94% 93%
CG-based
Queries
98% 93% 95%
the retrieval of information in document
management system. The initial experimental results
have shown that our proposed approach is capable of
handling effectively most of the typical search
requests in natural language. By avoiding using a
fixed grammar and making use of a domain
ontology, our approach can handle the problem of
imprecise and incomplete information. In addition,
minor grammatical errors, which may probably
occur in queries submitted casually by users in many
practical situations, can also be tolerated reasonably.
ACKNOWLEDGEMENTS
This research project is funded by University of
Nguyen Tat Thanh, Ho Chi Minh City, Vietnam We
are also grateful for the technical helps of the EViet
software company in terms of hosting services and
experimental data provided.
REFERENCES
Androutsopoulos, I., Ritchie, G. D. and Thanisch, P.
(1995). Natural Language Interfaces to Databases: An
Introduction. Journal of Natural Language
Engineering, 1(1), 29-81.
Berners-Lee, T., Hendler, J. and Lassila, O. (2001).
The Semantic Web. Scientific American. Retrieved
May 17, 2001 from
http://www.scientificamerican.com/article.cfm?id=the-
semantic-web
Cimiano, P., Haase, P., Sure, Y., Völker, J. and Wang, Y.
(2006). Question answering on top of the BT digital
library. ACM Publisher, In Proceedings of the 15th
International Conference on World Wide Web,
861-862. doi:10.1145/1135777.1135915
Frost, A. R. and Fortier R. J. (2007). An Efficient
Denotational Semantics for Natural Language
Database Queries. Springer-Verlag, In Proceedings of
the 12th International Conference on Applications of
Natural Language to Information, 4592, 12-24.
doi:10.1007/978-3-540-73351-5_2
Guarino, N. and Giaretta, P. (1995). Ontologies and
Knowledge Bases - Towards a Terminological
Clarification. IOS Press, Toward Very Large
SEMANTIC-LITE RETRIEVAL ON IMPRECISE AND INCOMPLETE NATURAL QUERIES USING CONCEPTUAL
GRAPHS
267