Term-frequency Inverse Document Frequency for the Assessment of Similarity in Central and State Climate Change Programs: An Example for Mexico

Iván Paz-Ortiz, Diego García-Olano, Carlos Gay-García

2015

Abstract

In the present work we present a preliminary approach intended for the assessment of the development of the climate change programs. Particularly we are interested in policies that are develop top-to-bottom by following specific central guidelines. To this end, the numerical statistic “term frequency-inverse document frequency” is used to compare the similarity between the action plans on climate change at national and state level in the case of Mexico. The results allow us to construct a similarity matrix to extract information about how these plans capture local level characteristics and their degree of attachment to the central policy.

References

  1. Edenhofer, Ottmar; Kowarsch, Martin (2015) Cartography of pathways: A new model for environmental policy assessment. Environmental Science & Policy Volume 51, August 2015, Pages 56-64.
  2. Akhtar, M. and Sidek, O. (2013). An intelligent adaptive arbiter for maximum CPU utilization, fair bandwidth allocation and low latency. IETE Journal of Research, 59(1):48-54.
  3. Dedecius, K. and Ettler, P. (2014). Hierarchical modelling of industrial system reliability with probabilistic logic. In Proceedings of the 11th international conference on informatics in control, automation and robotics (ICINCO), Vienna, Austria.
  4. Marsi E, Ozturk P, Aamot E, Sizov G, Ardelan M. (2011) Towards Text Mining in Climate Science: Extraction and their Relations. LREC-Language Resources and Evaluation Conference, BioTXM Workshop, At Reykjavik, Iceland.
  5. Luhn, Hans Peter (1957). "A Statistical Approach to Mechanized Encoding and Searching of Literary Information" (PDF). IBM Journal of research and development (IBM) 1 (4): 315. doi:10.1147/rd.14.0309. Retrieved 2 March 2015.
  6. Farahnakian, F., Liljeberg, P., and Plosila, J. (2013). Lircup: Linear regression based cpu usage prediction algorithm for live migration of virtual machines in data centers. In Software Engineering and Advanced Applications (SEAA), 2013 39th EUROMICRO Conference on, pages 357-364.
  7. Isermann, R. (2011). Fault Diagnosis Applications: Model Based Condition Monitoring, Actuators, Drives, Machinery, Plants, Sensors, and Fault-tolerant Systems. Springer Verlag.
  8. Manning, C. D.; Raghavan, P.; Schutze, H. (2008). "Scoring, term weighting, and the vector space model". Introduction to Information Retrieval (PDF). p. 100. doi:10.1017/CBO9780511809071.007. ISBN 9780511809071. edit.
  9. Pino-Díaz J, Chiadmi-García L, Cebrián-Menchero D, and Bailón-Moreno R. (2014) Text Mining of Scientific Big Data for Decision Making in Conservation of Mediterranean Marine Biodiversity. Journées d'Intelligence Économique - BIG DATA MININ.
  10. Josang, A. (2008). Conditional reasoning with subjective logic. Journal of Multiple-Valued Logic and Soft Computing, 15(1):5-38.
  11. KárnÉ, M., Böhm, J., Guy, T. V., Jirsa, L., Nagy, I., Nedoma, P., and Tesa?r, L. (2006). Optimized Bayesian Dynamic Advising: Theory and Algorithms. Springer.
  12. Rajaraman, A.; Ullman, J. D. (2011). "Data Mining". Mining of Massive Datasets (PDF). pp. 1-17. ISBN 9781139058452.
  13. Robertson, S. (2004). "Understanding inverse document frequency: On theoretical arguments for IDF". Journal of Documentation 60 (5): 503-520. doi:10.1108/00220410410560582.
  14. Wang, W., Huang, X., Qin, X., Zhang, W., Wei, J., and Zhong, H. (2012). Application-level CPU consumption estimation: Towards performance isolation of multi-tenancy web applications. In 2012 IEEE 5th International Conference on Cloud Computing, pages 439-446.
  15. Shanmuganathan, S. (2013). “Data/Text mining techniques in modeling climate change effects on crops.” The International Conference on Computational and Network Technologies 2013 (ICCNT), University of South Australia, Adelaide.
  16. Spärck Jones, K. (1972). "A Statistical Interpretation of Term Specificity and Its Application in Retrieval" (PDF). Journal of Documentation 28: 11-21. doi:10.1108/eb026526.
  17. "Unstructured Data and the 80 Percent Rule". Breakthrough Analysis. (http://breakthroughanalysis. com/2008/08/ 01/unstructured-data-and-the-80-percent-rule/) Retrieved 2015-02-23.
  18. Xiaoran A, Auroop R. Ganguly, Yi Fang, (2014) Tracking Climate Change Opinions from Twitter Data. Available at (http://www.cse.scu.edu/yfang/climate-fang.pdf).
Download


Paper Citation


in Harvard Style

Paz-Ortiz I., García-Olano D. and Gay-García C. (2015). Term-frequency Inverse Document Frequency for the Assessment of Similarity in Central and State Climate Change Programs: An Example for Mexico . In Proceedings of the 5th International Conference on Simulation and Modeling Methodologies, Technologies and Applications - Volume 1: MSCCES, (SIMULTECH 2015) ISBN 978-989-758-120-5, pages 542-546


in Bibtex Style

@conference{mscces15,
author={Iván Paz-Ortiz and Diego García-Olano and Carlos Gay-García},
title={Term-frequency Inverse Document Frequency for the Assessment of Similarity in Central and State Climate Change Programs: An Example for Mexico},
booktitle={Proceedings of the 5th International Conference on Simulation and Modeling Methodologies, Technologies and Applications - Volume 1: MSCCES, (SIMULTECH 2015)},
year={2015},
pages={542-546},
publisher={SciTePress},
organization={INSTICC},
doi={},
isbn={978-989-758-120-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 5th International Conference on Simulation and Modeling Methodologies, Technologies and Applications - Volume 1: MSCCES, (SIMULTECH 2015)
TI - Term-frequency Inverse Document Frequency for the Assessment of Similarity in Central and State Climate Change Programs: An Example for Mexico
SN - 978-989-758-120-5
AU - Paz-Ortiz I.
AU - García-Olano D.
AU - Gay-García C.
PY - 2015
SP - 542
EP - 546
DO -