Performance Assessment and Interpretation of Random Forests by Three-dimensional Visualizations

Ronny Hänsch, Olaf Hellwich

Abstract

Ensemble learning techniques and in particular Random Forests have been one of the most successful machine learning approaches of the last decade. Despite their success, there exist barely suitable visualizations of Random Forests, which allow a fast and accurate understanding of how well they perform a certain task and what leads to this performance. This paper proposes an exemplar-driven visualization illustrating the most important key concepts of a Random Forest classifier, namely strength and correlation of the individual trees as well as strength of the whole forest. A visual inspection of the results enables not only an easy performance evaluation but also provides further insights why this performance was achieved and how parameters of the underlying Random Forest should be changed in order to further improve the performance. Although the paper focuses on Random Forests for classification tasks, the developed framework is by no means limited to that and can be easily applied to other tree-based ensemble learning methods.

References

  1. Breiman, L. (1996). Bagging predictors. In Machine Learning, pages 123-140.
  2. Breiman, L. (2001). Random forests. Machine Learning, 45(1):5-32.
  3. Geurts, P. and Wehenkel, D. E. L. (2006). Extremely randomized trees. Machine Learning, 63(1):3-42.
  4. Hänsch, R. (2014). Generic object categorization in PolSAR images - and beyond. PhD thesis.
  5. Rodriguez, J. J., Kuncheva, L. I., and Alonso, C. J. (2006). Rotation forest: A new classifier ensemble method. IEEE Transactions on Pattern Analysis and Machine Intelligence, 28(10):1619-1630.
  6. Shotton, J., Johnson, M., and Cipolla, R. (2008). Semantic texton forests for image categorization and segmentation. In IEEE Conference on Computer Vision and Pattern Recognition, pages 1-8.
  7. Zeiler, M. D. and Fergus, R. (2014). Visualizing and understanding convolutional networks. In Computer Vision ECCV 2014, volume 8689 of Lecture Notes in Computer Science, pages 818-833.
Download


Paper Citation


in Harvard Style

Hänsch R. and Hellwich O. (2015). Performance Assessment and Interpretation of Random Forests by Three-dimensional Visualizations . In Proceedings of the 6th International Conference on Information Visualization Theory and Applications - Volume 1: IVAPP, (VISIGRAPP 2015) ISBN 978-989-758-088-8, pages 149-156. DOI: 10.5220/0005310901490156


in Bibtex Style

@conference{ivapp15,
author={Ronny Hänsch and Olaf Hellwich},
title={Performance Assessment and Interpretation of Random Forests by Three-dimensional Visualizations},
booktitle={Proceedings of the 6th International Conference on Information Visualization Theory and Applications - Volume 1: IVAPP, (VISIGRAPP 2015)},
year={2015},
pages={149-156},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005310901490156},
isbn={978-989-758-088-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 6th International Conference on Information Visualization Theory and Applications - Volume 1: IVAPP, (VISIGRAPP 2015)
TI - Performance Assessment and Interpretation of Random Forests by Three-dimensional Visualizations
SN - 978-989-758-088-8
AU - Hänsch R.
AU - Hellwich O.
PY - 2015
SP - 149
EP - 156
DO - 10.5220/0005310901490156