Grammatical Evolution Association Rule Mining to Detect Gene-Gene Interaction

Aicha Boutorh, Ahmed Guessoum

Abstract

An important goal of human genetics is to identify DNA sequence variations that increase or decrease specific disease susceptibility. Complex interactions among genes and environmental factors are known to play a role in common human disease etiology. Methods for association rule mining (ARM) are highly successful; especially that they produce rules which are easily interpretable. This has made them widely used in various domains. During the different stages of the knowledge discovery process, several problems are faced. It turns out that, the search characteristics of Evolutionary Algorithms make them suited to solve this kind of problems. In this study, we introduce GEARM, a novel approach for discovering association rules using Grammatical Evolution. We present the approach and evaluate it on simulated data that represents epistasis models. We show that this method improves the performance of gene-gene interaction detection.

References

  1. Agrawal, R. and Srikant, R. (1994). Fast algorithms for mining association rules in large databases. 20th International Conference on Very Large Data Bases, Santiago, Chile.Morgan Kaufmann ISBN 1-55860-153-8.
  2. Creighton, C. and Hanash, S. (2003). Mining gene expression databases for association rules. Bioinformatics 19(1): 79-86.
  3. Espejo, P., Ventura, S., and Herrera, F. (2010). A survey on the application of genetic programming to classification. IEEE Transactions on Systems, Man, and Cybernetics, vol. 40, no. 2, pp. 121-144.
  4. He, H., Oetting, W., Brott, M., and Basu, S. (2009). Power of multifactor dimensionality reduction and penalized logistic regression for detecting gene-gene interaction in a case-control study. BMC Med Genet, 10:127.
  5. Holzinger, E., Buchanan, C., Dudek, S., Torstenson, E., Turner, S., and Ritchie, M. (2010). Initialization parameter sweep in athena: Optimizing neural networks for detecting gene interactions in the presence of small main effects. Genetic and Evolutionary Computation Conference, 12:203-210.
  6. Koo, C., Liew, M., Mohamad, M., and Salleh, A. (2013). A review for detecting gene-gene interactions using machine learning methods in genetic epidemiology. BioMed Research International, Article ID 432375, 13 pages, 2013. doi:10.1155/2013/432375.
  7. Lehr, T., Yuan, J., Zeumer, D., Jayadev, S., and Ritchie, M. (2011). Rule-based classifier for the analysis of genegene and gene-environment interactions in genetic association studies. Bio Data Mining, 4:4 .
  8. Luna, J., Romero, J., and S., S. V. (2010). A grammar guided genetic programming algorithm for mining association rules. IEEE Congresso in Evolutionary Computation (CEC). pp. 1-8.
  9. Mata, J., Alvarez, J., and Riquelme, J. (2001). Mining numeric association rules via evolutionary algorithms. the 5th International Conference on Artificial Neural Networks and Genetic Algorithms, Prague, Czech Republic, pp. 264-267.
  10. McKinney, B., Reif, D., Ritchie, M., and Moore, J. (2006). Machine learning for detecting gene-gene interactions: a review. Appl. Bioinformatics, 5, 7788.
  11. Moore, J. H. (2005). A global view of epistasis. Nat Genet. 37(1):13-4.
  12. Motsinger, A., Ritchie, M., and Reif, D. (2007). Novel methods for detecting epistasis in pharmacogenomics studies. Pharmacogenomics, 8:1229-1241.
  13. Motsinger-Reif, A., Deohdar, S., Winham, S., and Hardison, N. (2010). Grammatical evolution decision trees for detecting gene-gene interactions. BMC Bio Data Mining.
  14. Motsinger-Reif, A., Dudek, S., Hahn, L., and Ritchie, M. (2008). Comparison of approaches for machinelearning optimization of neural networks for detecting gene-gene interaction in genetic epidemiology. Genetic Epidemiol, 32:325-340.
  15. O'Neill, M. and Ryan, C. (2003). Grammatical evolution: Evolutionary automatic programming in an arbitrary language. Boston: Kluwer Academic Publishers.
  16. Salleb-Aouissi, A., Vrain, C., and Nortet, C. (2007). Quantminer: A genetic algorithm for mining quantitative association rules. the 20th International Joint Conference on Artificial Intelligence, Hyberadad, India.
  17. Steen, K. V. (2011). Travelling the world of gene-gene interactions. Brief Bioinform 1-19.
  18. Winham, S., Colby, C., Freimuth, R., Wang, X., de Andrade, M., and Biernacka, J. (2012). Snp interaction detection with random forests in high-dimensional genetic data. BMC Bioinformatics, 13:164. doi: 10.1186/1471-2105-13-164.
Download


Paper Citation


in Harvard Style

Boutorh A. and Guessoum A. (2014). Grammatical Evolution Association Rule Mining to Detect Gene-Gene Interaction . In Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2014) ISBN 978-989-758-012-3, pages 253-258. DOI: 10.5220/0004913702530258


in Bibtex Style

@conference{bioinformatics14,
author={Aicha Boutorh and Ahmed Guessoum},
title={Grammatical Evolution Association Rule Mining to Detect Gene-Gene Interaction},
booktitle={Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2014)},
year={2014},
pages={253-258},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004913702530258},
isbn={978-989-758-012-3},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2014)
TI - Grammatical Evolution Association Rule Mining to Detect Gene-Gene Interaction
SN - 978-989-758-012-3
AU - Boutorh A.
AU - Guessoum A.
PY - 2014
SP - 253
EP - 258
DO - 10.5220/0004913702530258