capable of recovering the context of tags and drive
emergent semantics, using them to organically build
ontologies. The generated semantic hierarchy is used
to enforce structure and semantics in collaborative
tagging. That approach was adopted in practice in the
Online Open Publishing System.
Our work also goes toward filling gaps in Web Sci-
ence research, in the area of designing and develop-
ing infrastructures for collaboration on the Web. The
term Web Science was first introduced by Berners-
Lee. It has since given origin to large international
research efforts, including The Web Science Trust
(http://webscience.org/) or the Brazilian Institute for
Web Science Research (http://webscience.org.br).
Formally, research in Web Science is concerned
with the Web as the primary object of interest. In
our work, this means among others concentrating on
organographs as a means of sharing and exchanging
knowledge. Furthermore, once document organiza-
tions are shared, the researchers can reuse each other’s
data – which is the essence of scientific collaboration
– without having to concern themselves with estab-
lishing standards for document organization.
5 CONCLUSIONS
This paper presented a conceptual framework to au-
tomate information organization and support collab-
orative work on the Web. Our core proposal is the
organograph – a persistent and shareable organiza-
tion that emerges from automatic feature-extraction,
classification and clustering. Though our discussion
was centered in document sharing and reuse, our end-
users are scientists that work cooperatively in some
eScience domain. In such a context, documents refer
not only to scientific papers and reports, but also data
files containing experimental data, or images. Under
this perspective, our proposal can be extended to other
domains in which cooperation on the Web is required.
At the same time, we need to concern ourselves
with the Web Science issue of visibility. It is not
enough to share organographs, if we also want the
documents to be visible beyond a research group. In-
deed, the validation of scientific experiments requires
reproducibility – and this means that documents asso-
ciated with an eScience project must all, at the end,
become available. This means that we must also con-
sider some sort of Publication Directory, in which a
group’s (or a project’s) organographs can be accessed
by all interested in accessing the main results of that
group. This kind of solution is part of our ongoing
research. We will validate this concept using real ap-
plications that run in the Web and that have been im-
plemented by our research group, in distinct scientific
domains.
ACKNOWLEDGEMENTS
This work was supported by Fapesp, CNPq, CAPES
and INCT in Web Science (CNPq 557.128/2009-9).
REFERENCES
Bloehdorn, S., Cimiano, P., and Hotho, A. (2005). Learn-
ing ontologies to improve text clustering and classi-
fication. In From Data and Information Analysis to
Knowledge Engineering: Proceedings of the 29th An-
nual Conference of the German Classification Society.
Bonifacio, M., Bouquet, P., and Manzardo, A. (2000). A
distributed intelligence paradigm for knowledge man-
agement. In AAAI Spring Symposium Series 2000 on
Bringing Knowledge to Business Processes.
Chen, L. and Roberts, C. (2007). Semantic tagging for
large-scale content management. In WI ’07: Proceed-
ings of the IEEE/WIC/ACM International Conference
on Web Intelligence.
Dakka, W. and Ipeirotis, P. G. (2008). Automatic extrac-
tion of useful facet hierarchies from text databases. In
ICDE, pages 466–475.
Dakka, W., Ipeirotis, P. G., and Wood, K. R. (2007). Faceted
browsing over large databases of text-annotated ob-
jects. In ICDE, pages 1489–1490.
Du, Y. and Chen, L. (2007). Using personalized knowledge
portal for information and knowledge integration and
sharing. In SKG ’07: Proceedings of the Third In-
ternational Conference on Semantics, Knowledge and
Grid.
Giannakidou, E., Kompatsiaris, I., and Vakali, A. (2008).
Semsoc: Semantic, social and content-based cluster-
ing in multimedia collaborative tagging systems. In
ICSC ’08: Proceedings of the 2008 IEEE Interna-
tional Conference on Semantic Computing.
Gordon, A. (1996). Hierarchical classification. Clustering
and classification.
Jackson, P. and Moulinier, I. (2002). Natural language pro-
cessing for online applications: text retrieval, extrac-
tion, and categorization. John Benjamins Publishing
Company.
Lacher, M. and Groh, G. (2001). Facilitating the exchange
of explicit knowledge through ontology mappings. In
Proceedings of the Fourteenth International Florida
Artificial Intelligence Research Society Conference.
Senra, R. D. A. and Medeiros, C. B. (2009). SciFrame: a
conceptual framework to describe data sharing in e-
Science. SBBD. III e-Science Workshop.
Uschold, M. and Gruninger, M. (1996). Ontologies: Princi-
ples, methods and applications. The Knowledge Engi-
neering Review, 11(02).
Weigend, A. S., Wiener, E. D., and Pedersen, J. O. (1999).
Exploiting hierarchy in text categorization. Inf. Retr.,
1(3).
WEBIST 2011 - 7th International Conference on Web Information Systems and Technologies
588