SAURIDA: CLOUD COMPUTING BASED - Data Mining System in Telecommunication Industry

Qing Ke, Bin Wu, Yuxiao Dong, Lei Qin

2011

Abstract

Telecommunication data mining has been often used as a background application to motivate many technical problems in data mining research. However, traditional mining algorithms face new challenges which are tremendous amount of data and high time and space complexity of algorithms. Recently, Map-Reduce parallel computing model has been emerging. In this paper, we combine data mining with Map-Reduce based cloud computing to meet the challenges and showcase our applied system named Saurida. As a full functionality system, we provide data flow oriented preprocessing utilities which achieve almost linear speedup and extensively support for user defined functions, and we also provide many data mining algorithms. More importantly, we elaborate several application scenarios as real-word requirements of telecom industry by employing a large volume of data obtained from telecom operator. And we validate our system has a good scalability, effectiveness and efficiency.

References

  1. Wold, S., Esbensen, K., Geladi, P., 1987. Principal Component Analysis. Chemometrics and Intelligent Laboratory Systems 2, pp. 37-52.
  2. Wold, S., Esbensen, K., Geladi, P., 1987. Principal Component Analysis. Chemometrics and Intelligent Laboratory Systems 2, pp. 37-52.
  3. T. Wang, B. Yang, J. Gao, D. Yang, S. Tang, H. Wu, K. Liu, and J. Pei, 2009. MobileMiner: A Real World Case Study of Data Mining in Mobile Communication. In SIGMOD'09, Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data.
  4. T. Wang, B. Yang, J. Gao, D. Yang, S. Tang, H. Wu, K. Liu, and J. Pei, 2009. MobileMiner: A Real World Case Study of Data Mining in Mobile Communication. In SIGMOD'09, Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data.
  5. Dean, J., Ghemawat, S., 2004. MapReduce: Simplified data processing on large clusters. In OSDI 7804, Sixth Symposium on Operating System Design and Implementation.
  6. Dean, J., Ghemawat, S., 2004. MapReduce: Simplified data processing on large clusters. In OSDI 7804, Sixth Symposium on Operating System Design and Implementation.
  7. Williams R. J., Rumelhart D. E., Hinton G. E., 1986. Learning representation by back-propagating errors. Nature, vol. 323, pp. 533-536.
  8. Williams R. J., Rumelhart D. E., Hinton G. E., 1986. Learning representation by back-propagating errors. Nature, vol. 323, pp. 533-536.
Download


Paper Citation


in Harvard Style

Ke Q., Wu B., Dong Y. and Qin L. (2011). SAURIDA: CLOUD COMPUTING BASED - Data Mining System in Telecommunication Industry . In Proceedings of the 1st International Conference on Cloud Computing and Services Science - Volume 1: CLOSER, ISBN 978-989-8425-52-2, pages 516-519. DOI: 10.5220/0003387905160519


in Harvard Style

Ke Q., Wu B., Dong Y. and Qin L. (2011). SAURIDA: CLOUD COMPUTING BASED - Data Mining System in Telecommunication Industry . In Proceedings of the 1st International Conference on Cloud Computing and Services Science - Volume 1: CLOSER, ISBN 978-989-8425-52-2, pages 516-519. DOI: 10.5220/0003387905160519


in Bibtex Style

@conference{closer11,
author={Qing Ke and Bin Wu and Yuxiao Dong and Lei Qin},
title={SAURIDA: CLOUD COMPUTING BASED - Data Mining System in Telecommunication Industry},
booktitle={Proceedings of the 1st International Conference on Cloud Computing and Services Science - Volume 1: CLOSER,},
year={2011},
pages={516-519},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003387905160519},
isbn={978-989-8425-52-2},
}


in Bibtex Style

@conference{closer11,
author={Qing Ke and Bin Wu and Yuxiao Dong and Lei Qin},
title={SAURIDA: CLOUD COMPUTING BASED - Data Mining System in Telecommunication Industry},
booktitle={Proceedings of the 1st International Conference on Cloud Computing and Services Science - Volume 1: CLOSER,},
year={2011},
pages={516-519},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003387905160519},
isbn={978-989-8425-52-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 1st International Conference on Cloud Computing and Services Science - Volume 1: CLOSER,
TI - SAURIDA: CLOUD COMPUTING BASED - Data Mining System in Telecommunication Industry
SN - 978-989-8425-52-2
AU - Ke Q.
AU - Wu B.
AU - Dong Y.
AU - Qin L.
PY - 2011
SP - 516
EP - 519
DO - 10.5220/0003387905160519


in EndNote Style

TY - CONF
JO - Proceedings of the 1st International Conference on Cloud Computing and Services Science - Volume 1: CLOSER,
TI - SAURIDA: CLOUD COMPUTING BASED - Data Mining System in Telecommunication Industry
SN - 978-989-8425-52-2
AU - Ke Q.
AU - Wu B.
AU - Dong Y.
AU - Qin L.
PY - 2011
SP - 516
EP - 519
DO - 10.5220/0003387905160519