tures of the document.
We have run some experiments with a wide range
type of web pages and we observe the following facts:
• The method achieve a good performance with
documents that have several levels of granularity,
in other words, when the tree associated to the
document has many levels and the nodes have a
lot of branches, the calculation of the fractal di-
mension helps to get a summary with information
more diversified according to the document struc-
ture.
• With wrong structured web pages the method ob-
tains bad results since the calculation of fractal di-
mension doesn’t give information in those cases.
In conclusion, we have seen in our experiments
that traditional summarization extracts most of the
sentences from few chapters, fractal summarization
with D = 1 extracts the sentences distributively from
each section, and with our new approximation using
the fractal dimension of the document, the method
share out the sentences according to their content and
their position.
9 CONCLUSIONS
In this paper, we present an improvement to the frac-
tal summarization method. The propagation formula
have been modified according to the fractal view
method, and it uses the novel concept of fractal di-
mension of text documents presented in (Ruiz M. D.,
2006).
We have used this automatic summarization
method in web pages with a large content of text and
with a good structure as in figure 2, giving very good
results and showing the good performance of the pro-
posed method.
In the future, we are going to use a similarity mea-
sure taking into account the semantic of words giving
a more complete solution to the problem of summa-
rizing documents. Moreover, we are working about
the problem of summarizing the document according
the preferences of the user, giving more importance
to those sections that the user wants to spread out
using the fractal dimension. We also want to adapt our
method in the case of summarizing a group of docu-
ments with similar contents.
REFERENCES
Buyukkokten O., Garcia-Molina H., P. A. (2001). See-
ing the whole in parts: Text summarization for web
browsing on handheld devices. In 10
th
International
WWW Conference, Hong Kong.
Camastra F., V. A. (2002). Estimating the intrinsic dimen-
sion of data with a fractal-based method. IEEE Trans-
actions on Pattern Analysis and Machine Intelligence.
Daume III H., M. D. (2005). Induction of word and phrase
alignments for automatic document summarization.
Computational Linguistics, 31 (4):505–530.
Edmundson, H. P. (1969). New methods in automatic ex-
tracting. Journal of the Association for Computing
Machinery, 16 (2):264–285.
Goldstein J., Kantrowitx M., M. V. C. J. (1999). Summariz-
ing text documents: sentence selection and evaluation
metrics. pages 121–128.
Grasberger P., P. I. (1983). Measuring the strangeness of
strange attractors. pages 189–208.
Guerrini G., Mesiti M., S. I. (2006). An overview of similar-
ity measures for clustering XML documents. Chapter
in Athena Vakali and George Pallis (eds.).
Koike, H. (1995). Fractal views: a fractal-based method for
controlling information display. ACM Transactions on
Information Systems, 13 (3):305–323.
Kraft, R. (1995). Fractals and dimensions. HTTP-Protocol
at www.weihenstephan.de.
Liebovitch, L. S., T. T. (1989). A fast algorithm to deter-
mine fractal dimensions by box counting. Physics Let-
ters A, 141 (8,9):386–390.
Luhn, H. P. (195 8). The automatic creation of literature
abstracts. IBM Journal, pages 159–165.
Mandelbrot, B. B. (1986). Self-affine fractal sets. Pietronero
L. & Tosatti E. (eds.): Fractals in Physics, Amster-
dam.
Morris G., Kasper G. M., A. D. A. (1992). The effect and
limitation of automated text condensing on reading
comprehension performance. Information System Re-
search, pages 17–35.
Ruiz M. D., B. A. B. (2006). Fractal dimension of text
documents: Application in fractal summarization. In
IADIS International Conference WWW/Internet, vol-
ume 2, pages 349–353.
Yang C. C., Chen H., H. K. (2003a). Visualization of large
category map for internet browsing. Decision Support
Systems, 35:89–102.
Yang C. C., W. F. L. (2003b). Fractal summarization for mo-
bile devices to access large documents on the web. In
12
th
International WWW Conference, Budapest, Hun-
gary.
Yang C. C., W. F. L. (2003c). Fractal summarization: Sum-
marization based on fractal theory. In SIGIR 2003,
Toronto, Canada.
Yang C. C., W. F. L. (2004). A relevance feedback model
for fractal summarization. Lecture Notes in Computer
Science, 3334:368–377.
SUMMARIZING DOCUMENTS USING FRACTAL TECHNIQUES
33