H-INDEX CALCULATION IN ENRON CORPUS

Anton Timofieiev, Václav Snásěl, Jiří Dvorský

Abstract

Development of modern technologies is expanded with communications possibilities. Electronic systems of communications make possible overcoming traditional barriers of communication, for example, such as distance. On their basis there are new types of communities which any more have no geographical restrictions. Increasing popularity of electronic communities among which projects LiveJournal, LiveInternet, and also projects popular in Russian-speaking part Internet Mamba, MirTesen, VKontakte, Odnoklassniki, etc., makes as never earlier actual questions on working out of techniques of research of similar social networks. However communications of members of such communities only by means of electronic communications create difficulties at definition of such communities. In this paper we describe method for measurement of the importance of particular people within the community. The method is based on h-index calculation. Approach is demonstrated on Enron corpus.

References

  1. Berry, M. W., Browne M. (2005). Email Surveillance Using Nonnegative Matrix Factorization. Proceedings of Workshop on Link Analysis, Counterterrorism and Security, SIAM International Conference on Data Mining 2005. Newport Beach, CA, 45-54.
  2. Chapanond, A., Krishnamoorthy, M. S., Yener, B. (2005). Graph Theoretic and Spectral Analysis of Enron Email Data, Proceedings of Workshop on Link Analysis, Counterterrorism and Security, SIAM International Conference on Data Mining 2005. Newport Beach, CA, 15-22.
  3. Grieve, T. (2003): The Decline and Fall of the Enron Empire. Slate. http://www.salon.com/news/ feature/2003/10/14/enron/index np.html. (2003, October 14)
  4. Hirsch J. E. (2005). An index to quantify an individual's scientific research output. Proc.Nat.Acad.Sci.
  5. Keila, P.S., D.B. Skillicorn (2005). Structure in the Enron Email Dataset. Proceedings of Workshop on Link Analysis, Counterterrorism and Security, SIAM International Conference on Data Mining 2005. Newport Beach, CA, April 2005, 55-64.
  6. Klimt B., Yang Y. (2004) Introducing the Enron Corpus, Proceedings of First Conference on Email and AntiSpam (CEAS).
  7. McCallum, A., Corrada-Emmanuel, A., Wang, X. (2005). The Author-Recipient-Topic Model for Topic and Role Discovery in Social Networks, with Application to Enron and Academic Email. Proceedings of Workshop on Link Analysis, Counterterrorism and Security, SIAM International Conference on Data Mining.Newport Beach, CA, April 2005, 33-44.
  8. McLean B., Elkind P. (20030. The Smartest Guys in the Room: The Amazing Rise and Scandalous Fall of Enron. Portfolio.
  9. Newman M. E. J. (2000): The Structure and Function of Complex Networks. SIAM Review. vol. 45 (2003), 167-256.
  10. Ravasz E., Barabási A.-L. (2003): Hierarchical organization in complex networks. Phys. Rev. E, 67 (2003), art. no. 026112.
  11. Tutte W. T. (1984). Graph Theory, Encyclopedia of mathematics and its applications, Addison Wesley, volume 21.
Download


Paper Citation


in Harvard Style

Timofieiev A., Snásěl V. and Dvorský J. (2008). H-INDEX CALCULATION IN ENRON CORPUS . In Proceedings of the Third International Conference on Software and Data Technologies - Volume 3: ICSOFT, ISBN 978-989-8111-53-1, pages 206-211. DOI: 10.5220/0001891802060211


in Bibtex Style

@conference{icsoft08,
author={Anton Timofieiev and Václav Snásěl and Jiří Dvorský},
title={H-INDEX CALCULATION IN ENRON CORPUS},
booktitle={Proceedings of the Third International Conference on Software and Data Technologies - Volume 3: ICSOFT,},
year={2008},
pages={206-211},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001891802060211},
isbn={978-989-8111-53-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Third International Conference on Software and Data Technologies - Volume 3: ICSOFT,
TI - H-INDEX CALCULATION IN ENRON CORPUS
SN - 978-989-8111-53-1
AU - Timofieiev A.
AU - Snásěl V.
AU - Dvorský J.
PY - 2008
SP - 206
EP - 211
DO - 10.5220/0001891802060211