operations are not supported by these systems,
although in most cases aggregation and
displacement of the user generated content would be
necessary to achieve more readable maps.
In this paper, we try to demonstrate that the
already existing cartographic knowledge could be
used to automatically create maps showing the
sentiments towards places, which are more
appealing and more expressive than the usual maps
with markers. For this purpose we use well known
natural language processing and opinion mining
tools and generate maps of reviews for towns. These
maps consist of a simple base map and specially
designed point symbols, which represent for each
location the corresponding sentiment values by their
size and colour. If locations are too close to each
other the map symbols will be minimized and
slightly displaced. Thus, easily readable maps are
produced, which enable the user to capture at a
glance where attractive touristic locations are and
how many reviews have contributed to their ratings.
The rest of the paper is organized as follows: In
Section 2 the sentiment analysis method we utilized
is described and evaluated. Section 3 addresses the
design process for the map symbols representing
sentiments. A method to displace point symbols is
developed in Section 4. Afterwards, the map
symbols and the displacement are applied to real
world data and the results are presented in Section 5,
before Section 6 concludes the paper.
2 SENTIMENT ANALYSIS
We considered three methods for sentiment analysis,
namely SentiStrength (Thelwall et al., 2010),
Lexicon-Based Classifier (Paltoglou and Thelwall,
2012) and SO-Cal (Taboada et al., 2011), which
have been developed for informal web content. As
the latter performs best in preliminary tests, we will
only present and discuss its results.
We extracted 36,715 reviews about locations in
the USA from a travel social network site,
preprocessed them with the Brill Tagger (Brill,
1992) and classified them using SO-CAL. The
majority of them, 24,367 were classified as positive,
8,659 as negative and 2,017 as neutral. In addition,
500 randomly selected reviews have been manually
classified, in order to evaluate this analysis. The
classification task was to assign to each review
either a positive, a negative or a neutral value,
depending on the sentiments expressed with respect
to the location. Table 1 lists the resulting values for
precision, recall and the f-measure of these 500
reviews.
Table 1: Evaluation results for 500 randomly selected
reviews, considering only location specific sentiments.
# Precision Recall F-measure
Positive 304 0.86 0.90 0.88
Neutral 132 0.87 0.39 0.54
Negative 64 0.43 0.81 0.56
The result for positive reviews is satisfying,
whereas neutral reviews have a rather low recall and
negative reviews a low precision, resulting in a
disappointing f-measure for both classes. One reason
for this shortcoming of the method is that in a lot of
reviews not only a location is described and rated,
but also its historic background. Often the history is
connected to a war or a natural disaster,
consequently the text contains a lot of negative
expressions, which are misjudged as a negative
sentiment towards the corresponding location.
Additionally, neutral reviews, which rather express
facts then sentiments about a location, are seldom
written completely in a factual diction. Instead, they
quite often contain negative as well as positive
judgments on the facts. If the manual classification
task is modified, i.e., if the reviews should be
classified by considering all sentiments expressed in
the text, the results are significantly improved, as
Table 2 shows. Still the recall for neutral and the
precision for negative reviews are not as good as for
positive ones, but they are in accordance to the
results reported in (Taboada et al., 2011).
Table 2: Evaluation results for 500 randomly selected
reviews, considering all sentiments.
# Precision Recall F-measure
Positive 315 0.92 0.92 0.92
Neutral 77 0.90 0.70 0.79
Negative 108 0.81 0.87 0.84
Hence, the method seems to be appropriate for
our domain. Nevertheless, a preprocessing step,
which filters background information out of reviews
would be necessary, in order to get only the location
specific sentiments.
3 MAP SYMBOLS FOR
SENTIMENTS
According to the intended communication goal, the
GISTAM2015-1stInternationalConferenceonGeographicalInformationSystemsTheory,ApplicationsandManagement
130