Word Frequency Statistics Based on Serverless Computing
Zhaoxin Jia
2024
Abstract
Thanks to the continuous updating of server architectures, serverless computing has gradually become a research hotspot in the current cloud computing field in recent years because of its agile, scalable, and cost-effective features. This paper proposes a word frequency statistics method based on the serverless computing technology and MapReduce framework. Specifically, author add a step of preprocessing based on the basic method. Then, author adaptively assign tasks to each thread according to the total number of words in each file, which helps to reduce time waste. The filegroup will be split in the map stage. During the reduce stage, author further count the word frequency in each thread and store it in a temporary file. The project mainly achieved multi-threaded parallel task completion. Extensive experimental results successfully demonstrated the superiority of multi-threaded parallel processing efficiency in dealing with large amounts of data, which author suppose can bring more new insight for developing serverless computing.
DownloadPaper Citation
in Harvard Style
Jia Z. (2024). Word Frequency Statistics Based on Serverless Computing. In Proceedings of the 1st International Conference on Engineering Management, Information Technology and Intelligence - Volume 1: EMITI; ISBN 978-989-758-713-9, SciTePress, pages 100-104. DOI: 10.5220/0012910600004508
in Bibtex Style
@conference{emiti24,
author={Zhaoxin Jia},
title={Word Frequency Statistics Based on Serverless Computing},
booktitle={Proceedings of the 1st International Conference on Engineering Management, Information Technology and Intelligence - Volume 1: EMITI},
year={2024},
pages={100-104},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012910600004508},
isbn={978-989-758-713-9},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 1st International Conference on Engineering Management, Information Technology and Intelligence - Volume 1: EMITI
TI - Word Frequency Statistics Based on Serverless Computing
SN - 978-989-758-713-9
AU - Jia Z.
PY - 2024
SP - 100
EP - 104
DO - 10.5220/0012910600004508
PB - SciTePress