manuscripts accurately and quickly in the massive
manuscript library becomes particularly important
(Zhang, K. X., 2017). When facing the search request,
the general processing method is to query the
database by keywords. With the increasing amount of
data in the database, the query speed through
keywords will become slow, and it is more and more
difficult to meet the demand for quick query.
Therefore, it is urgent to design a separate search
system. Therefore, this paper designs Solr retrieval
system for enterprise manuscript retrieval, which can
provide users with quick response query function. In
this paper, an enterprise-oriented new media
manuscript retrieval system is designed, which uses
Solr search engine framework to provide fast retrieval
service (Fairbairn, D., 2019).
From the perspective of the development history
of search engines, it can be roughly divided into three
stages of development. The first stage is the first-
generation search engine represented by Yahoo and
InfoSeek, which is based on the World Wide Web and
supports natural language search and advanced
grammar search for the first time, requiring manual
directory sorting. At this stage, the input information
resources are limited, the number of indexes is not
large, and the index query speed is slow; The second
stage is mainly the second generation of search
engine technology represented by Google Browser,
which is based on data(Pereira-Sanchez, V., 2022).
Mining technology and website rating technology and
machine retrieval through keywords, using
distributed service technology, so the retrieval speed
and accuracy have been greatly improved; The third
stage is the third generation of search engine
technology represented by the "technology-driven"
search engine concept proposed by Microsoft
Corporation (Nogueira, M. S., 2021). The second
generation of search engine technology has been
greatly upgraded and improved, which will provide
users with more quality search services and search
experience. At the moment.
The first-class search engine companies in the
world include Google, Microsoft, Baidu and Yahoo,
etc. The mainstream search companies in China
include Baidu, 360 and Sogou. These companies are
leading the trend to provide Internet users with high-
quality search services. For most small and medium-
sized enterprises in China, it is necessary to carry out
information management, quickly and accurately
retrieve enterprise product information and improve
work efficiency. Enterprises can customize the
enterprise search engine through the services
provided by mature search engine companies, but the
enterprise personalized search is limited and the
function scalability is not strong, which cannot
conduct in-depth analysis according to the fields of
different enterprises, resulting in low retrieval
accuracy and slow query speed. Therefore, it is
necessary to build a set of enterprise personalized
search engine (Zhao, X., 2020).
2 METHODS
2.1 Search Engine
Search engine is a technology that searches out the
information and data with high matching degree from
the huge information data of the Internet by adopting
certain computer algorithms and technical means.
Search engines need a lot of computer technology as
technical support, in order to meet the Internet users
fast search, high matching search needs and user
experience. At present, the computer technology
related to search engines includes big data, web
crawler, index sorting, natural language processing
and other technologies. With the advent of the 5G era,
search engines will combine advanced technologies
such as big data, artificial intelligence, and pattern
recognition to provide Internet users with more high-
quality and humanized services (Ufer, N., 2021).
The basic working principle of search engines is to
use web crawlers to continuously obtain a large
number of web resources from various websites on
the Internet, collect these web resources into local
databases, and then process them through web
technology to remove useless interfering information
and further extract key information from useful web
information to build indexes. After the index is
successfully constructed, it is stored in the index
database. When the user uses the search engine of the
browser to query information, it will quickly search
through the index database of the search engine to
find the index and web page information with high
similarity and matching degree with the keywords
entered by the user, and sort the search results by
relevant sorting algorithms. The value is returned to
the user in the order of matching degree from high to
low (Qin, P., 2017).
Search engine contains a variety of types of search
methods, search methods according to the different
characteristics of collecting and querying information
can be divided into the following four ways, including
full-text search, meta search, vertical search and
directory search. Among them, full-text search engine
is a search method that obtains a large number of web
page resources on the Internet through a crawler
ANIT 2023 - The International Seminar on Artificial Intelligence, Networking and Information Technology
334