documents have common features.
− Criterion is the parameter name that measures
the quality of documents or data sources.
− Document is the file that contains information
of interest for the user. It is characterized by an
identifier, a title, a size, a size unit, a city,
keywords, a URL, a country, a language, a date,
a last updating date, a format, authors and an
abstract.
− Data Source is a document container described
by identifier, name, country, city and URL.
− Historical is the user-search historical whose
attributes are date, keyword phrase, search type
and query.
− Data Source instance is the data source
category.
− User described by login, password, user type
and e-mail.
5 CONCLUSIONS
We have addressed here a solution for the problem
of selecting web data sources by means of SQLfi
use. This system has fuzzy querying capabilities that
are useful for expressing quality criteria and
selecting data sources with discrimination between
them. Expression of such criteria would be very
difficult with traditional querying languages.
We have presented a web data source selection
tool based on SQLf. It is a complete and friendly
system that facilitates the execution of fuzzy queries
on a web data source and documents catalog. This
tool allows storing the preferences of each user
enlarging the degree of satisfaction with the results
obtained. This system could be improved or
expanded; nevertheless we are sure that its current
state already represents a great aid for the end user.
We have not deal here with the performance
issue of fuzzy querying and information retrieval
systems. Nevertheless the querying system uses an
evaluation strategy based on relationship between
fuzzy sets and crisp sets. This mechanism is known
as the derivation principle and has shown to posses
the best performance between proposed evaluation
mechanisms. It is matter of further work the whole
implementation of the data source selection system
and its performance study.
At present time, we only give support to finding
data sources that are registered in existing relational
catalogs. Therefore, it is also necessary to extend the
system in order to support discovering data sources
that publish Web documents. A step more would be
to integrate this system with a Web tool for
discovering information. Thereafter it would be
possible to define and implement an intelligent Web
querying tool that automatically optimize user
request with available data sources on the Web.
ACKNOWLEDGEMENTS
We would like to acknowledge the contribution of
our development team conformed by the Computer
Engineering students Fabio Canache, Irwing
Herrera, Felix González, Giuseppe Pellegrino, Jesús
Graterol and Denise Videtta. This work has been
made with the subsidy of FONACIT via the project
G-2005000278. Finally we acknowledge Jesus
Christ, our all time helper, who gives us the
inspiration for working, creating and living.
REFERENCES
ANSI X3, 1992. Database Language SQL, 135-1992,
American National Standards Institute, New York.
Bosc, P. and Pivert, O., 1995. SQLf: A Relational
Database Language for Fuzzy Querying. IEEE
Transactions on Fuzzy Systems, 3.
Eduardo, J., Goncalves, M and Tineo, L., 2004. A Fuzzy
Querying System based on SQLf2 and SQLf3.
Proceedings of CLEI.
Galindo J., Urrutia A., Piattini M., (2006). Fuzzy
Databases: Modeling, Design and Implementation.
Idea Group Publishing Hershey.
Goncalves, M and Tineo, L., 2001. SQLf Flexible
Querying Language Extension by means of the norm
SQL2. In Proceedings of FUZZ-IEEE, 1.
Goncalves, M and Tineo, L., 2001. SQLf3: An extension
of SQLf with SQL3 features. In Proceedings of FUZZ-
IEEE, 3.
Goncalves, M and Tineo, L., 2005. WWW Data Source
Selection with SQLfi. In Proceedings of FUZZ-IEEE.
Konopnicki, D. and Shmueli, O., 1995. W3QS: A query
system for the World Wide Web”. In Proceedings of
VLDB, 54-65.
Loo, G. and Lee, K., 2000. An Interface to Databases for
Flexible Query Answering: A Fuzzy-Set Approach”.
In Proceedings of DEXA, 654-663.
Melton, J., 1992. ISO/ANSI Working Draft: Database
Language SQL (SQL3), X3H2-93-091/ISO DBL
YOK-003.
Mendelzon, A., Mihaila, G. and Milo, T., 1997. Querying
the World Wide Web. Journal of Digital Libraries, 68-
88.
Mihaila, G., Raschid, L. and Tomasic, A., 1998. Equal
time for data on the Internet with WebSemantics. In
Proceedings of EDBT, 87-101.
ICEIS 2007 - International Conference on Enterprise Information Systems
452