Text Categorization Methods Application for Natural Language Call Routing

Roman Sergienko, Tatiana Gasanova, Eugene Semenkin, Wolfgang Minker

Abstract

Natural language call routing can be treated as an instance of topic categorization of documents after speech recognition of calls. This categorization consists of two important parts. The first one is text preprocessing for numerical data extraction and the second one is classification with machine learning methods. This paper focuses on different text preprocessing methods applied for call routing. Different machine learning algorithms with several text representations have been applied for this problem. A novel text preprocessing technique has been applied and investigated. Numerical experiments have shown computational and classification effectiveness of the proposed method in comparison with standard techniques. Also a novel features selection method was proposed. The novel features selection method has demonstrated some advantages in comparison with standard techniques.

References

  1. Chu-Carroll, J. and Carpenter, B. (1999). Vector-based natural language call routing. Computational linguistics, 25(3):361-388.
  2. Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., and Lin, C.-J. (2008). Liblinear: A library for large linear classification. The Journal of Machine Learning Research, 9:1871-1874.
  3. Gasanova, T., Sergienko, R., Semenkin, E., Minker, W., and Zhukov, E. (2013). A semi-supervised approach for natural language call routing. Proceedings of the SIGDIAL 2013 Conference, pages 344-348.
  4. Ishibuchi, H., Nakashima, T., and Murata, T. (1999). Performance evaluation of fuzzy classifier systems for multidimensional pattern classification problems. Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on, 29(5):601-618.
  5. Jan, E.-E. and Kingsbury, B. (2010). Rapid and inexpensive development of speech action classifiers for natural language call routing systems. In Spoken Language Technology Workshop (SLT), 2010 IEEE, pages 348-353. IEEE.
  6. Kuo, H.-K. J. and Lee, C.-H. (2003). Discriminative training of natural language call routers. Speech and Audio Processing, IEEE Transactions on, 11(1):24-35.
  7. Salton, G. and Buckley, C. (1988). Term-weighting approaches in automatic text retrieval. Information Processing & Management, 24(5):513-523.
  8. Sarikaya, R., Hinton, G. E., and Ramabhadran, B. (2011). Deep belief nets for natural language call-routing. In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on, pages 5680- 5683. IEEE.
  9. Shafait, F., Reif, M., Kofler, C., and Breuel, T. M. (2010). Pattern recognition engineering. In RapidMiner Community Meeting and Conference, volume 9.
  10. Soucy, P. and Mineau, G. W. (2005). Beyond tfidf weighting for text categorization in the vector space model. In IJCAI, volume 5, pages 1130-1135.
  11. Witt, S. M. (2011). Semi-automated classifier adaptation for natural language call routing. In INTERSPEECH, pages 1341-1344.
Download


Paper Citation


in Harvard Style

Sergienko R., Gasanova T., Semenkin E. and Minker W. (2014). Text Categorization Methods Application for Natural Language Call Routing . In Proceedings of the 11th International Conference on Informatics in Control, Automation and Robotics - Volume 2: ASAAHMI, (ICINCO 2014) ISBN 978-989-758-040-6, pages 827-831. DOI: 10.5220/0005139708270831


in Bibtex Style

@conference{asaahmi14,
author={Roman Sergienko and Tatiana Gasanova and Eugene Semenkin and Wolfgang Minker},
title={Text Categorization Methods Application for Natural Language Call Routing},
booktitle={Proceedings of the 11th International Conference on Informatics in Control, Automation and Robotics - Volume 2: ASAAHMI, (ICINCO 2014)},
year={2014},
pages={827-831},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005139708270831},
isbn={978-989-758-040-6},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 11th International Conference on Informatics in Control, Automation and Robotics - Volume 2: ASAAHMI, (ICINCO 2014)
TI - Text Categorization Methods Application for Natural Language Call Routing
SN - 978-989-758-040-6
AU - Sergienko R.
AU - Gasanova T.
AU - Semenkin E.
AU - Minker W.
PY - 2014
SP - 827
EP - 831
DO - 10.5220/0005139708270831