obtained. As shown in Figure 1, the top three
keyword phrases are extracted, and the top scores
are 0.5737, 0.5395 and 0.6565, respectively.
According to the length of the subject keyword, it is
found that the subject keyword is not a word, but a
phrase. By setting the length of different extracted
keywords, the keyword phrases are extracted.
Through the experimental comparison, Figure 2
shows that when the keyword phrase length is set to
2, the score is the highest, and the first scores are
0.6921, 0.7151 and 0.7797 respectively.
Table 1: Extraction result with keyword set to 1
introduction to
logistics keywords
Keyword extraction
professional
logistics service
supply
(logistics, 0.5737)
(railway transportation, 0.5546)
(service provider, 0.549)
network
advantage
(logistics, 0.5395)
Railway, 0.5367)
(transport, 0.4916)
modern logistics
system
(logistics, 0.6565)
(railway transportation, 0.6275)
(railway, 0.5503)
Table 2: Extraction result with keyword set to 2
introduction to
logistics keywords
Keyword extraction
professional
logistics service
supply
(logistics service provider,
0.6921)
(enterprise logistics, 0.6807)
(Railway special goods, 0.651)
Network
advantage
(logistics network, 0.7151)
(railway resources, 0.7147)
(railway logistics, 0.7066)
modern logistics
system
(logistics system, 0.7797)
(logistics mode, 0.7319)
(railway logistics, 0.73)
According to the keywords and contents, we can
see that some key words are artificially summarized
and not completely included in the contents.
Therefore, if we process the data of the keywords
and remove the manual summary, we can see that
the experimental results are obviously better.
The unsupervised keyword extraction method used
in this experiment uses single text data, which is
limited. In the next step, we will learn to use
supervised method to extract keywords.
REFERENCES
STERCKX L, CARAGEA C, Demeester T, et al. (2016) .
Supervised keyphrase extraction as positive unlabeled
learning[C]//Proceedings of the 2016 Conference on
Empirical Methodsin Natural Language Processing,
Austin, Nov 1- 4..Stroudsburg: ACL, 2016:
1924-1929.
Alzaidy R, Caragea C, Giles C L. (2019). Bi-LSTM-CRF
sequence labeling for keyphrase extraction froms
scholarly scholarly documents [C] //The World Wide
Web Conferenceon-WWW’19.New York: ACM Press,
2019: 2551-2557.
WEI H X, GAO G L, SU X D. (2016). LDA-Based Word
Image Representation for Keyword Spotting on
Historical Mongolian Documents[C]//Neural
Information Processing(ICONIP).Springer, 2016:
432-441.
Campos R, Vitor M, Pasouali A, et al. (2018). YAKE!
Collection In dependent Automatic Keyword
Extractor[C]//In Advances in Information
Retrieval-40th European Conference on Information
Retrieval. Springer ECIR 2018, Lecture Notes in
Computer Science, Grenoble, France. Cham, 2018:
806-810.
Boudin F. (2013). A Comparison of Centrality Measures
for Graph-Based Keyphrase Extraction.
[C]//Proceedings of the 6th International Joint
Conference on Natural Language Processing. Nagoya:
Asian Federation of Natural Language Processing,
2013: 834-838.
Bordolin M, Chatterjee P C, Biswas S K, et al. (2020).
Keyword extraction using supervised cumulative
TextRank [J] . Multimedia Tools and Applications, 79
(41 / 42):31467-31496.
Florescu C, Caragea C. (2017). A New Scheme for
Scoring Phrases in Unsupervised Keyphrase
Extraction[C]//Proceedings of the Advances in
Information Retrieval-39thEuropean Conference on
Information Retrieval.ECIR 2017, Lecture Notes in
Computer Science Aberdeen, UK.
Liu Z Y, Li P, Zheng Y B, et al. (2009). Clustering to find
exemplar terms for keyphrase
extraction[C]//Proceedings of the 2009 Conference on
Empirical Methods in Natural Lan-guage Processing.
Singapor: ACL Press: 257-266.
Gagliard I , Artese M T. (2020). Semantic unsupervised
automatic keyphrases extraction by integrating word
embedding with clustering method [J].Multimoda
Technologies and Interaction, 4 (2): 30.