Figure 3: Results of the Set Expander tool for input: “Ein-
stein, Newton, Bach”. As one could expect, the common
category of this input is a general “last name”, as these peo-
ple did not share a profession that would point to a more
specific category. Surprisingly, there are also “craters” that
are named after the entered names, and so more caters have
been returned as a possible expansion.
2. to post-process the results of the expansion step
to filter out some entities that are less common,
have less “evidence” of their quality (currently,
we have taken a step in this direction by ordering
and selecting k first candidate entities by the num-
ber of “related” relationships between the candi-
dates and the seeds).
5 SUMMARY
In this paper, we reviewed the existing approaches to
the ESE problem, with a special focus on the most
recent ones. We have put forward a proposal of a
new knowledge-based system called Set Expander
that builds on semantic technologies and knowledge
graph resources. The system is available online and
upon configuration works in an interactive mode. The
tool described in this paper is flexible and extendable
and we plan to continue working on its improvements.
REFERENCES
Adrian, W. T., Alviano, M., Calimeri, F., Cuteri, B., Do-
daro, C., Faber, W., Fusc
`
a, D., Leone, N., Manna,
M., Perri, S., et al. (2018). The asp system dlv: ad-
vancements and applications. KI-K
¨
unstliche Intelli-
genz, 32(2):177–179.
Adrian, W. T. and Manna, M. (2018). Navigating online
semantic resources for entity set expansion. In Proc.
of PADL’18, pages 170–185.
Baader, F., Horrocks, I., and Sattler, U. (2004). Descrip-
tion logics. In Handbook on ontologies, pages 3–28.
Springer.
Chen, J., Chen, Y., Zhang, X., Du, X., Wang, K., and Wen,
J.-R. (2018). Entity set expansion with semantic fea-
tures of knowledge graphs. Journal of Web Semantics,
52:33–44.
Hu, W. and Jia, C. (2015). A bootstrapping approach to
entity linkage on the semantic web. Journal of Web
Semantics, 34:1–12.
Huang, J., Xie, Y., Meng, Y., Shen, J., Zhang, Y., and Han,
J. (2020). Guiding corpus-based set expansion by aux-
iliary sets generation and co-expansion. In Proceed-
ings of The Web Conference 2020, pages 2188–2198.
Kohita, R., Yoshida, I., Kanayama, H., and Nasukawa, T.
(2020). Interactive construction of user-centric dic-
tionary for text analytics. In Proceedings of the 58th
Annual Meeting of the Association for Computational
Linguistics, pages 789–799.
Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., and
Dean, J. (2013). Distributed representations of words
and phrases and their compositionality. In Advances in
neural information processing systems, pages 3111–
3119.
Navigli, R. and Ponzetto, S. P. (2012). Babelnet: The au-
tomatic construction, evaluation and application of a
wide-coverage multilingual semantic network. Artifi-
cial Intelligence, 193:217–250.
P
´
erez, J., Arenas, M., and Gutierrez, C. (2009). Seman-
tics and complexity of sparql. ACM Transactions on
Database Systems (TODS), 34(3):1–45.
Rastogi, P., Poliak, A., Lyzinski, V., and Van Durme, B.
(2019). Neural variational entity set expansion for au-
tomatically populated knowledge graphs. Information
Retrieval Journal, 22(3-4):232–255.
Rong, X., Chen, Z., Mei, Q., and Adar, E. (2016). Egoset:
Exploiting word ego-networks and user-generated on-
tology for multifaceted set expansion. In Proceedings
of the Ninth ACM international conference on Web
search and data mining, pages 645–654.
Sarmento, L., Jijkuon, V., De Rijke, M., and Oliveira, E.
(2007). ” more like these” growing entity classes from
seeds. In Proceedings of the sixteenth ACM confer-
ence on Conference on information and knowledge
management, pages 959–962.
Shen, J., Wu, Z., Lei, D., Shang, J., Ren, X., and Han,
J. (2017). Setexpan: Corpus-based set expansion
via context feature selection and rank ensemble. In
Joint European Conference on Machine Learning and
Knowledge Discovery in Databases, pages 288–304.
Springer.
Wang, R. C. and Cohen, W. W. (2008). Iterative set expan-
sion of named entities using the web. In 2008 eighth
IEEE international conference on data mining, pages
1091–1096. IEEE.
Xiao, Z., Li, C., and Chen, H. (2020). Patternrank+ nn: A
ranking framework bringing user behaviors into entity
set expansion from web search queries. ACM Trans-
actions on the Web (TWEB), 14(3):1–15.
Yan, J., Wang, C., Cheng, W., Gao, M., and Zhou, A.
(2018). A retrospective of knowledge graphs. Fron-
tiers of Computer Science, 12(1):55–74.
Set Expander: A Knowledge-based System for Entity Set Expansion
499