loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Wei Ding 1 ; Yongji Liu 1 and Jianfeng Zhang 2

Affiliations: 1 China Defense Science and Technology Information Center, China ; 2 National University of Defense Technology, China

Keyword(s): Chinese Keywords, Fuzzy Search, Extraction, Encrypted Documents.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Information Extraction ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Symbolic Systems

Abstract: Cloud storage for information sharing is likely indispensable to the future national defence library in China e.g., for searching national defence patent documents, while security risks need to be maximally avoided using data encryption. Patent keywords are the high-level summary of the patent document, and it is significant in practice to efficiently extract and search the key words in the patent documents. Due to the particularity of Chinese keywords, most existing algorithms in English language environment become ineffective in Chinese scenarios. For extracting the keywords from patent documents, the manual keyword extraction is inappropriate when the amount of files is large. An improved method based on the term frequency–inverse document frequency (TF-IDF) is proposed to auto-extract the keywords in the patent literature. The extracted keyword sets also help to accelerate the keyword search by linking finite keywords with a large amount of documents. Fuzzy keyword search is intr oduced to further increase the search efficiency in the cloud computing scenarios compared to exact keyword search methods. Based on the Chinese Pinyin similarity, a Pinyin-Gram-based algorithm is proposed for fuzzy search in encrypted Chinese environment, and a keyword trapdoor search index structure based on the n-ary tree is designed. Both the search efficiency and accuracy of the proposed scheme are verified through computer experiments. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.206.13.112

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Ding, W.; Liu, Y. and Zhang, J. (2015). Chinese-keyword Fuzzy Search and Extraction over Encrypted Patent Documents. In Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2015) - KDIR; ISBN 978-989-758-158-8; ISSN 2184-3228, SciTePress, pages 168-176. DOI: 10.5220/0005581001680176

@conference{kdir15,
author={Wei Ding. and Yongji Liu. and Jianfeng Zhang.},
title={Chinese-keyword Fuzzy Search and Extraction over Encrypted Patent Documents},
booktitle={Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2015) - KDIR},
year={2015},
pages={168-176},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005581001680176},
isbn={978-989-758-158-8},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2015) - KDIR
TI - Chinese-keyword Fuzzy Search and Extraction over Encrypted Patent Documents
SN - 978-989-758-158-8
IS - 2184-3228
AU - Ding, W.
AU - Liu, Y.
AU - Zhang, J.
PY - 2015
SP - 168
EP - 176
DO - 10.5220/0005581001680176
PB - SciTePress