the relevance of each recognized entity to the full-text
description of the listing. Furthermore, the ML model
for image recognition returns a number representing
the accuracy of the entity detection. In the case of the
features and amenities explicitly defined by the users
without the support of any ML model, it is considered
a confidence factor of 1.0.
Both the relevance factor and the confidence value
are hence taken into account to calculate the final
score. However, before computing the score and rank-
ing the results, a query expansion process is applied
to deal with synonymous tags, parent/child terms and
translations in different languages. This process is
carried out by means of tables with recursive relation-
ships and the WordNet lexical database
5
(this step
is still under development). Finally, the results are
ranked by the score.
5 DISCUSSION AND
CONCLUSION
Short-term rental removes long-term housing from
the market, forces rent prices up, saturates areas with
tourism, generates safety and liability concerns and
by and large, it is awash with likely illegal listings. In
this paper, we have presented MISTRuST (accoMmo-
datIon Short Term Rental Scanning Tool) a machine
learning based system aimed at uncovering whether
a given property is being listed in short-term rental
(STR) platforms. It enables users to monitor property
by scheduling automatic searches and checking list-
ings returned by the system against the target prop-
erty. During the development of MISTRuST valid-
ity considerations have been contemplated. Firstly,
the manual allocation of the relevant values of the
amenities and features is a time-consuming task. The
required time can be reduced by assigning default
values to the categories they belong. Secondly, we
are subjected to the terms and conditions of the STR
platforms For example, Airbnb bans the use of au-
tomated data scrapping tools but Alltherooms.com
doesn’t. Nonetheless, this may change in the fu-
ture. In addition, Alltheroms.com doesn’t currently
provide an API so we are using web scraping tech-
niques. Any future changes in the user interface de-
sign of Alltheroms.com may impact our tool. Lastly,
a large-scale evaluation, with many properties and in
different locations would be necessary in order to ob-
tain accurate results. However, no software will be
able to ensure a 100% the accuracy of its guesses due
to the dependence on completeness of the property
5
https://wordnet.princeton.edu/
data provided by the user and the uncertainty of the
ML algorithms themselves. Our next step is to apply
a machine learning model to improve the ranking al-
gorithm for the top N retrieved listings, to avoid man-
ually adjust the relevance’s value of each feature or
amenity. To accomplish that, a substantial amount of
training data is required. So we will introduce a user
feedback system to register the success or the failure
of the algorithm executions and so to train the ML
model. Finally, we plan to release the software as
open-source.
ACKNOWLEDGEMENTS
The work has been supported by the University of
Cadiz with grant ref. EST2018-167 and funds of its
Department of Computer Engineering.
REFERENCES
Cox, M. (2017). Inside Airbnb: adding data to the debate.
Accessed: 2019-09-11.
Filippas, A. and Horton, J. J. (2017). The tragedy of your
upstairs neighbors: When is the home-sharing exter-
nality internalized? Available at SSRN 2443343.
Gurran, N. and Phibbs, P. (2017). When tourists move in:
How should urban planners respond to Airbnb? Jour-
nal of the American Planning Association, 83(1):80–
92.
Katz, V. (2015). Regulating the sharing economy. Berkeley
Technology Law Journal, 30(4):1067–1126.
Lee, D., Hyun, W., Ryu, J., Lee, W. J., Rhee, W., and Suh,
B. (2015). An analysis of social features associated
with room sales of Airbnb. In Proceedings of the 18th
ACM Conference Companion on Computer Supported
Cooperative Work & Social Computing, pages 219–
222. ACM.
Ma, X., Hancock, J. T., Lim Mingjie, K., and Naaman, M.
(2017). Self-disclosure and perceived trustworthiness
of Airbnb host profiles. In Proceedings of the 2017
ACM Conference on Computer Supported Coopera-
tive Work and Social Computing, pages 2397–2409.
ACM.
Nieuwland, S. and van Melik, R. (2018). Regulating
Airbnb: how cities deal with perceived negative ex-
ternalities of short-term rentals. Current Issues in
Tourism, pages 1–15.
Oates, B. J. (2005). Researching information systems and
computing. Sage.
Quattrone, G., Proserpio, D., Quercia, D., Capra, L., and
Musolesi, M. (2016). Who benefits from the ”sharing”
economy of Airbnb? In Proceedings of the 25th Inter-
national Conference on World Wide Web , WWW ’16,
pages 1385–1394, Republic and Canton of Geneva,
WEBIST 2020 - 16th International Conference on Web Information Systems and Technologies
176