now, so to speak. In Fig. 1, the skiing condition is
recorded for 4
th
April, but it hardly relevant for users
in July.
3 EXPERIMENTS
We next give overview of the data in the user
collection so far as on 25
th
October 2010. The
collection includes lots of test photos, and the
number of users is small, which may somewhat
skew the results. Nevertheless, some trends and
observations can be seen.
In total, there were 3589 photos of which most
are city views (839), then pictures of nature (801)
and other people (279). Few pictures are also taken
from events (90), documents (40) and animals (59).
In addition, there are photos that are counted as test
photos or failure pictures.
Another point of view is what kind of
descriptions has been typed in by the users. Due to
the experimental stage, a large amount of the photos
(27%) are without any description. The lack of
descriptions is also caused by the difficulty to type
by mobile phone, but descriptions can be added later
from the web interface.
Among the photos that have some kind of
description, significant amount of photos (35%)
have just garbage, some test word (Symbian_test), or
very generic object description (Mug, Wires, Mouse)
indicating test use. In total, 65% of all photos have a
meaningful description. Mostly documented
descriptions are travel photos of places (685), nature
(579), general objects (263), architecture (212)
people (210), and few general descriptions of events
and animals.
People are often described by their names, or by
their roles (runner, floorball player). Only few are
related to place (Untung / STMIK), age (Young
Andrei) or relationship to the person (my son Amir).
Events are significantly more often found in the
user description than could be concluded by content
analysis alone. In our case, events include mostly
work-related meetings described by their acronyms
(ecse, abi, mopsi meeting, ubiikki) but also running
competition (Åland half marathon) and actions
attached with feelings (quality time in skiing
elevator).
Another difference between content and user
description are travel photos. The location is not
easy to recognize from content but it could be
concluded from the positioning data. For example,
Clarke Quay, Geger beach, Suceava, Tahkovuori
and Aholansaari are locations whereas the following
descriptions include additional details: Petronas
Towers (building complex), Heureka (science
center), Singapore flier (Ferris wheel) and Olavin
linna (castle). The extreme case is Musta Pekka
mutkan takana (Black Pete behind the curve) where
Black Pete is the name of a particular slope in Tahko
skiing resort.
Table 1: Distribution of keywords (tags) used in Picasa
and Flickr, in comparison to the user descriptions of
MOPSI collection.
Description:
Picasa Flickr
MOPSI
All Real
Places --- 28% 21% 32%
Events and action 31% 17% 5% 7%
People 6% 7% 6% 10%
Objects --- 5% 8% 12%
Architecture and
nature
25% 21% 23% 37%
Animals --- 3% 2% 2%
Other 20% 16% --- 0%
Garbage 19% 2% 35% ---
Table 1 compares the textual description used in
MOPSI with two other photo sharing sites. The main
difference is that, in MOPSI, location is provided
automatically without any user interaction.
In Picasa, users provide the location by dragging
the photo on GoogleMap. Keywords and location
are thus provided explicitly as two different entities,
and consequently, users tend not to type any location
related keywords. Flickr has somewhat more
complicated interface based on Yahoo! Maps. Only a
predefined set of keywords are allowed, which
explains the quality of tags (only 2% garbage).
Despite the automatic positioning in MOPSI, it
does not reflect on the distribution of the type of
descriptions written. Unlike in Picasa, users still tend
to describe the location anyway for travel pictures,
probably because the position is not confirmed in the
device, but it happens hidden in the background.
Overall, the distribution of topics is rather similar to
that of Flickr. There are slightly more people and
objects described, but these could be just artifacts
from the system being at testing stage.
For photo collecting, two mobile applications
were developed (Java and Symbian C++). A large
number of failures were caused by the Java version,
which lacks several important features. Firstly, there
is an unavoidable delay from the click sound and
when the photo is actually taken. People tend to
move the camera right after they hear the sound and
before the actual picture will be taken. Secondly,
Auto-focus supported by Symbian helps a lot with
picture quality but it was not available in Java. Other
WEBIST 2011 - 7th International Conference on Web Information Systems and Technologies
416