Design and Implementation of New Media Manuscript Retrieval System

Peng Chen

2023

Abstract

With the rise of the Internet and mobile Internet in recent years, new media has also achieved vigorous development, and new media articles and manuscripts have also shown an explosive growth trend. In the face of massive and multi-format new media data information, how to quickly and accurately find the required manuscript information in such large-scale data information has become a problem faced by users of we-media. According to the above problems and requirements, this paper designs and develops the architecture of Spring+SpringMVC+Hibemate, combines Solr search engine service and Baidu speech recognition tool, and proposes a new media manuscript retrieval system with B/S architecture. The system uses Java as the development language to implement. This paper focuses on the analysis of the key technologies and strategies used in the system architecture design, and develops and designs a new media manuscript retrieval system based on Solr, which mainly includes pre-processing, building solr system, user query and database. Based on the open source search engine Solr as the core of the system, this paper studies the implementation principle of the core technology index of search engine. In order to ensure the efficiency and quality of word segmentation, the algorithm of word segmentation and the performance comparison of various Chinese word segmentation are studied. In order to facilitate Solr to use text to build index, the text conversion method of non-text files is studied.

Download


Paper Citation


in Harvard Style

Chen P. (2023). Design and Implementation of New Media Manuscript Retrieval System. In Proceedings of the 2nd International Seminar on Artificial Intelligence, Networking and Information Technology - Volume 1: ANIT; ISBN 978-989-758-677-4, SciTePress, pages 333-337. DOI: 10.5220/0012282900003807


in Bibtex Style

@conference{anit23,
author={Peng Chen},
title={Design and Implementation of New Media Manuscript Retrieval System},
booktitle={Proceedings of the 2nd International Seminar on Artificial Intelligence, Networking and Information Technology - Volume 1: ANIT},
year={2023},
pages={333-337},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012282900003807},
isbn={978-989-758-677-4},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 2nd International Seminar on Artificial Intelligence, Networking and Information Technology - Volume 1: ANIT
TI - Design and Implementation of New Media Manuscript Retrieval System
SN - 978-989-758-677-4
AU - Chen P.
PY - 2023
SP - 333
EP - 337
DO - 10.5220/0012282900003807
PB - SciTePress