spatial heat influencing factor is added to optimize the
common algorithm. The formula is as follows:
Formula (3) is the optimized scoring algorithm,
and Formula (4) is the common scoring algorithm.
In Formula(4), coor is the coordination
factor, which is based on the number of query items
contained in the document. This factor weights
documents that contain more query items using a
method similar to AND.
is the normalized value for each
search, which is the square sum of the weights of each
query item. The shorter the field, the higher the
weight of the field. If a word appears in a field such
as the title, it is more relevant than when it appears in
a field such as the body of content.
is the inverse document frequency (idf),
which is the frequency of a term appearing in all
documents in the collection. The higher the
frequency, the lower the weight. It is used to measure
the "uniqueness" of an item. Terms with a higher
frequency has a lower idf, and uncommon terms have
a higher idf. The inverse document frequency is the
logarithm of the number of documents in the index,
divided by the number of documents that contain the
term, the calculation is as Formula(5).
is the frequency of a term
appearing in the document. The higher the frequency,
the higher the weight. A field that mentions the same
term five times is more relevant than a field that
mentions it only once. Term frequency is the square
root of the number of times the term appears in the
document, the calculation is as Formula(6).
is the weight of query item.
is a weighted factor related to length.
In Formula(3), num
is the number of calls.
is the level of matching between the keywords
and the spatial heat, the calculation is as Formula(7).
In Formula(7), overlap is the number of matched
terms in the query, maxoverlap is the total number of
terms in the query.
is the weight of the spatial heat, the calculation
is as Formula(8). In Formula(8),
is the number of
calls of the map tile k,
is the total number of calls
for all tiles.
4 EXPERIMENT AND ANALYSIS
4.1 Environment and Data
In this article, the experimental data was the point of
interest data from Tengzhou, Zaozhuang, Shandong
Province. The index database was constructed using
Elasticsearch. The search framework was constructed
using front-end web technology. The experimental
data was analyzed to empirically verify the method
proposed by this article. The steps are as follows:
Step 1: Use Elasticsearch to retrieve the users’
map tile call log data.
Step 2: Filter the search results: Retain user
records with a Tilematrix value less than 13.
Step 3: Obtain the Tilematrix, Tilecol, TileRow of
the called map tile and calculate the center point
coordinates and range of the tile.
Step 4: Visualize data to form images, calculate
and , and add them to the score sorting of point of
interest search results.
Step 5: Compare the optimized point of interest
search method to the original method. Carry out
experiments with 6 different search terms, and each
term is searched 50 times. The returned results are
compared, and the corresponding precision ratios are
calculated.
4.2 Results and Analysis
Table 1 shows the comparison of experimental data
between the improved and the original
algorithms.After comparing the results between the
two methods, the improved method had a similar
precision ratio with the original algorithm when the
log data was empty. However, along with the
accumulation of the user's map tile browsing history
data, the outcome of the optimized search method was
significantly improved, whereas the accuracy of the
original method was basically unchanged. It
demonstrated that the method proposed in this article
can return the maximum relevant results based on the
user’s focal hotspot areas, which makes the search
more personalized and intelligent.
A Point of Interest Intelligent Search Method based on Browsing History
191