IMPROVING HOW USERS ORGANIZE WEB INFORMATION
FOR GATHERING TASKS
Anwar Alhenshiri, Michael Shepherd and Carolyn Watters
Faculty of Computer Science, Dalhousie University, 6050 University Ave., Halifax, NS, Canada
Keywords: Web information retrieval, Gathering tasks, Organization and management.
Abstract: This paper presents a Visual Information Management System (VIMS) for the Web. In this prototype
system, the task of Web information gathering was taken into consideration with respect to how users locate
information for the task, organize task information, preserve and re-find task information, and compare
information for effective reasoning and decision making. VIMS was designed and built based on
recommendations from previous studies in a larger research. A user study was conducted to evaluate VIMS.
The results of the study show promising indications of the improvements achieved in VIMS with respect to
Web information gathering tasks.
1 INTRODUCTION
Research has studied user tasks in order to identify
task frameworks that would help with understanding
user interactions with the Web. Web tasks have
been classified into fact finding, navigation,
performing a transaction, and information gathering
(Broder, 2002, Kellar at al., 2007). The latter type
accounts for a large portion of the overall tasks on
the Web, representing between 51.7% (Broder,
2002) and 61.5% (Rose and Levinson, 2004).
A Web information gathering task is a composite
of subtasks/activities users perform while interacting
with the Web for accomplishing a goal described in
the task. User activities during Web information
gathering may involve finding sources of Web
information (Web documents), searching for
information on the sources located for the task,
finding related information to the already located
sources and information, comparing information for
reasoning and decision making, organizing task
information, and preserving and re-finding
information (Alhenshiri et al., 2010b).
Since information mismatching and overloading
are two significant problems while search engines
gather and present information (Teevan, 2008), it
becomes the user’s role to locate, compare, and
manage the required information in the task. A Web
search engine sees the sequences of activities in a
task as separate interaction steps. It also provides no
means for re-finding information, which is an
activity that represents one third of the user
interactions during information gathering tasks
according to Mackay and Watters (2008). Moreover,
search engines do not usually provide support for
representing task results according to the type of
information being sought in the task.
With regard to information gathering tasks,
information organization has been barely studied in
the context of the Web. In this article, a Web
information gathering and organization prototype
(VIMS) is presented. VIMS exploits visualization,
visual clustering, and several Web information
preserving, re-finding, and organization strategies
for Web information gathering tasks. The paper is
presented as follows. Section 2 discusses related
research work. Section 3 presents the research
motivations. Section 4 describes VIMS’ design in
details. Section 5 presents the evaluation user study.
Section 6 discusses the research findings. Section 7
concludes the paper.
2 RESEARCH RATIONALE
2.1 Visualizing Web Information
In addition to different prototypes that have been
investigated (Bonnel et al. 2006; Teevan et al.,
2009), there are several search tools on the Web that
use visualization, such as the search engine Viewz
(www.viewzi.com). Visualization of Web search
271
Alhenshiri A., Shepherd M. and Watters C..
IMPROVING HOW USERS ORGANIZE WEB INFORMATION FOR GATHERING TASKS.
DOI: 10.5220/0003308502710277
In Proceedings of the 7th International Conference on Web Information Systems and Technologies (WEBIST-2011), pages 271-277
ISBN: 978-989-8425-51-5
Copyright
c
2011 SCITEPRESS (Science and Technology Publications, Lda.)
results has also been investigated in several layouts
including the use of hyperbolic trees, Self-
Organizing Maps, and thematic maps such as in the
visual search engine Kartoo (www.kartoo.com).
Most of these approaches were intended for
improving how users find sources of Web
information. Exploring multiple features of Web
documents such as their content similarities, page
thumbnails, URLs, and document summaries in a
visualized approach should be investigated in Web
information gathering tasks. These featureswhen
visualized properlycan help users find sources of
information on the Web, find information in such
sources, compare information, and make more
effective and efficient decisions.
2.2 Clustering Web Information
In Web information retrieval, clustering has been
investigated in several prototypes such as in the
work of Alhenshiri et al. (2010a). Clustering has
also been implemented in conventional search
engines such as Clusty (www.clsuty.com), Gceel
(www.Gceel.com), and Google (in their “see
similar” feature and Google Wonder Wheel).
Although the performance of users with list
presentations of Web documents is comparable to
their performance with clustering-based
presentations, user preference usually comes in
favour of clustering-based methods (Carpineto et al.,
2009). In addition, there are indications that
clustering can even be more effective (Turetken and
Sharda, 2005). With the variety of information that
is gathered on the Web, clustering can play a
significant factor in Web information gathering
tasks.
2.3 Preserving and Re-finding
Web Information
Research has focused on enhancing re-finding Web
information locally on the Web browser. However,
re-finding strategies such as the back button,
favourites, and bookmarks can maintain limited
numbers of information sources (documents), and
they are useful only during a particular Web session.
Therefore, searching the Web for re-finding, also
known as re-searching (Teevan, 2008), has been
studied for assisting users in locating results of
interest from previous sessions. Research shows that
a great deal of Web search visitations is for
revisiting (Teevan, 2008). Consequently, Re-finding
is a common activity in Web information gathering
tasks accounting for 53.27% according to Mackay
and Watters (2008). For information gathering tasks
of multi-session nature, which may require a multi-
topic search, re-finding can play a significant role in
the effectiveness of tools designed for this type of
task. Re-finding should be focused not only on
preserving active Web pages in the browser but also
on preserving Web search results in the context of a
complete task.
2.4 Organizing Web Information
Research has focused on investigating how users
manage their information for re-finding (Elsweiler
and Ruthven, 2007; Jones et al., 2003; Mackay et al.,
2005). Strategies users follow to manage Web
information in order to be able to relocate and reuse
previously found information are discussed in the
work of Jones et al. (2003). Most users gather
information over multiple sessions (Mackay and
Watters, 2008), which indicates the need for
management strategies for preserving and re-finding
such information for reuse. The variety of finding,
re-finding, organizing, and management strategies
and approaches users follow while seeking and
gathering Web information can be related to the idea
that current Web tools lack important reminding,
integration, and organization schemes (Cutrell et al.,
2006). How users organize and manage information
during Web information gathering has had little
consideration. Since Web information gathering
tasks may take several sessions, involve looking at
information from different sources, and require
comparing information that may belong to varied
topics, investigating organizational and management
strategies users follow on the Web is necessary.
3 RESEARCH MOTIVATIONS
To further exploit the concepts of information
visualization, visual clustering, re-finding, and
organization, VIMS was designed. VIMS attempts
to utilize visualization and clustering to allow users
to find, compare, and relate information to the
already located sources of information more
effectively. It also aims to create a more effective
storing and re-finding environment. Re-finding is
done not only by searching a list of documents, but
also by using keyword search to re-find individual
documents, sessions, and entire tasks previously
stored by the user. Moreover, VIMS is intended to
provide effective organizational schemes for
information during information gathering tasks.
WEBIST 2011 - 7th International Conference on Web Information Systems and Technologies
272
4 VIMS DESIGN
The VIMS interface was designed using Java swing
components and the prefuse visualization toolkit
(http://prefuse.org/). The CloudGarden GUI builder
‘Jigloo’ was also used in the implementation of
VIMS. The VIMS interface is shown in Figure 1.
Four models were employed in the design and
implementation of VIMS: searching, clustering,
organizing, and re-finding models.
It has been shown that Web information
gathering tasks are largely search based (Alhenshiri
et al., 2011). VIMS combines the powers of the
Google and Yahoo Web search engines. Moreover,
VIMS uses intuitive visual clustering to render its
search results and its preserved task and session
documents. Clustering is performed based on one of
four criteria: network domain, country of origin,
genre, and content similarity (topical clustering).
Clusters are labelled using cluster-internal labelling.
The title of the document closest to the centroid of
the cluster is used as the label of the cluster.
Figure 1: VIMS Interface.
To assist users with organizing task information,
VIMS allows the user to store partial information
during a task by preserving current session
information. This is done by either preserving active
visual views of the current display or by selectively
preserving particular documents among the search
results. Preserved documents are grouped under a
task title (name) and sorted by date for later
retrieval. The user can continue working on the same
task over multiple sessions while adding and
eliminating documents. The user can also add
annotations to the preserved task information along
the way towards completing the task. In addition,
search results and task information (documents) can
be viewed either visually or in HTML format (as a
list of hits). The study in Alhenshiri et al. (2010a)
showed that users prefer to have both views during
information gathering.
For serving the process of re-finding
information, VIMS allows users to store complete
sessions and individual documents. It also allows
search within sessions and within tasks by either
selecting from a list of tasks/sessions or by keyword
search to further assist the user. The keyword search
matches the task name given previously by the user
and the annotations preserved alongside the task.
Moreover, VIMS allows users to email task
information including accumulated documents, task
subject and date, and task annotations. The emailing
strategy was recommended in the work of Jones et
al. (2003). However, VIMS adds the organization of
a task to the subject matter by submitting all the
aforementioned information items. With VIMS, the
user can follow the preserving and re-finding
strategy that suits their needs and accommodates the
task requirements.
5 EVALUATION
To evaluate VIMS, a small-scale study in a field
setting was conducted. While the number of
participants was limited, the results reflected how
early adopters perceive VIMS as a Web information
gathering and organization tool. In this study,
engagement (enjoyment) factors only were
considered in evaluating VIMS. The effectiveness
and efficiency of VIMS were not considered by
comparing VIMS to other systems; however, how
users perceived these factors was taken into
consideration in the survey items used for evaluating
VIMS.
5.1 Study Participants and Design
Twenty participants took part in the study. They
were students from Dalhousie University. VIMS was
given to the participants in field settings. The choice
of the field setting approach was due to the fact that
the researchers wanted to accumulate first
impressions about VIMS from early adopters, and to
allow users to provide their own judgment where no
control of the experiment was in place.
5.2 Study Procedure
Every participant was given a Web information
gathering task to perform at the time and place of
their convenience. Every task was divided into two
IMPROVING HOW USERS ORGANIZE WEB INFORMATION FOR GATHERING TASKS
273
parts to encourage participants to keep information
during the first part and to re-find information
during the second part.
Information gathering tasks were created to
satisfy the criteria described in the work of Kules
and Capra (2006). Every participant was given one
task in addition to a brief description of how VIMS
works. The description illustrated functionalities of
the interface and how each feature behaves. There
was no training for participants on how to use
VIMS. Participants were pointed to the website
where they could download VIMS. They were also
given the description file to read.
After a two week period, an evaluation form was
completed and returned to the principal investigator.
The form involved questions for participants that
concerned several items including: a) the user
satisfaction with VIMS; b) how helpful the visual
clustering of search results was to the user; c) how
interesting it was to use VIMS to complete the given
task; d) how the user perceived preserving and re-
finding Web information and Web documents using
VIMS; e) how the user rated the presentation of Web
search results on VIMS; f) whether or not the self-
emailing feature was useful to the user during the
task; g) whether the user used VIMS to accumulate
and manage task information in the information pool
provided in VIMS; h) what the user liked most about
VIMS and what they thought should be improved;
and i) any other comments the user wanted to add.
Likert scales were used for answering the survey
questions.
5.3 Study Results
With respect to user satisfaction with VIMS, 75% of
the participants showed complete satisfaction.
Twelve percent indicated that they were neither
satisfied nor dissatisfied with VIMS. The remaining
participants (13%) showed dissatisfaction with
VIMS. They indicated that they either did not like
the limit of 120 result hits on VIMS, or that they
prefer a list of hits in a textual presentation such as
in Google’s. The limit of a maximum of 120 results
per display was to prevent clutter, a purpose that was
not discussed with the participants. The z-test shows
a significant difference between the proportion of
participants who were satisfied with VIMS and those
who were dissatisfied (z=2.84, p< 0.003).
VIMS’ helpfulness with the given task was also
evaluated as one of the engagement factors
considered in the study. Most users (80%) used all
the features provided in VIMS. The results show that
69% of the users found VIMS to be helpful while
12% found VIMS to be of no help. Participants who
found VIMS not to be helpful indicated that they
needed further training on some of the features in
VIMS and how to use those features. The remaining
participants (19%) indicated that they either did not
use VIMS for the given task or that they did not use
all of VIMS’ features.
One of the most interesting organization features
in VIMS is how users preserve and re-find
documents and information about a particular task or
session on the Web. All participants used and
evaluated this feature. The results show that 85% of
the participants think that VIMS provided effective
preserving and re-finding capabilities. The
remaining 15% thought that the features were
neither effective nor ineffective. None of the
participants rated these features negatively. The z-
test shows that there was a significant difference
between the proportion of participants who found
VIMS to be effective and those who did not evaluate
the feature of preserving and re-finding task
information (z= 4.11, p<0.0001).
One of the features in VIMS is the ability to send
information to one’s email. Information may include
the name and date of the current task, documents
located for search queries over one or multiple
sessions during the task, and user annotations on the
task. The results showed that only 50% of the users
had the chance to use this feature. One interpretation
of the results in this case is that users tended to
finish the whole task in one session. In future
studies, this feature will be tested during information
gathering over multiple sessions in a more controlled
environment. Of the remaining 50% of users, only
10% found the feature to be of no usefulness. In long
term information gathering tasks such as writing a
report, which may be conducted on multiple
machines over multiple sessions, this feature is
expected to be more effective and more useable.
Another important feature that was evaluated in
VIMS is the task organization capability. This is the
ability to accumulate task information including
documents found for search queries in addition to
annotations added by the user regarding the task at
hand. The feature allows the user to add and remove
documents from the information pool visualized on
the display (see Figure 1) over multiple sessions
while using the task name and date of creation for
access. The results show that 70% of the participants
used and evaluated the feature. Of those, 73% think
that the feature was either helpful or very helpful.
Only 10% think that it was of no help during the
task. Of the participants, 17% did not evaluate this
feature. The z-test shows a significant difference
WEBIST 2011 - 7th International Conference on Web Information Systems and Technologies
274
between the proportion of participants who found
VIMS’ organizational feature to be helpful and those
who found it of no help (z= 3.83, p< 0.00001). The
results are shown in Figure 2.
Figure 2: User Ratings of the Info. Accumulation Feature.
The last engagement factor the study tested was
the presentation of Web search results to the user.
As shown by Alhenshiri et al. (2011), Web
information gathering tasks are largely search based.
Hence, it was necessary to evaluate the presentation
of Web search results on VIMS with the use of
visualization and clustering. The results showed that
67% of the participants think that the VIMS
presentation was effective. Twenty five percent
think that the visual presentation in VIMS is as
effective as a list-of-hits presentation. Only 8% of
the participants think that the presentation of search
results was somewhat cluttered and confusing. The
z-test shows a significant difference between
participants who found the presentation of Web
documents on VIMS to be effective and those who
did not (z=1.87, p<0.04).
Finally, the survey asked participants about
features they liked or found effective in VIMS. The
participants left 39 comments. The comments
belonged to four main categories which are: a)
organizing and managing task results, b) presenting
and exploring search results, c) preserving and re-
finding Web task information, and d) clustering task
results. Most of the comments were in regard to the
organization of task information. Organizing and
managing task information included storing, re-
finding, annotating, and comparing task information.
The results of this item are shown in Figure 3.
Examples of user comments are:
the colour coding of clustering was really
good
The presentation of the search results
grouped by the country it came from was kind
of neat
I loved how I can keep accumulating task
information over multiple sessions’.
Users left important feedback regarding how
VIMS should be improved. User comments covered
different issues but were hard to categorize into
specific and distinct groups. Nonetheless, most of
the comments concerned interface issues such as the
choices of colors and the size of the glyphs that
represent Web documents. Clutter was also an issue
that was mentioned several times. The total number
of comments regarding this criterion was 11. Some
users (10%) indicated that they would like to see
clustering based on different criteria happen on the
fly instead of selecting the clustering criterion prior
to sending search queries. Of the participants, one
user indicated that the search box should be
enlarged; one participant suggested that the user
should have the ability to see the ranks of documents
inside each cluster; and another participant indicated
that there should be more clustering criteria. None of
the comments occurred more than twice in the
provided set of comments.
Figure 3: Categorized Subjective Comments.
6 DISCUSSION
VIMS is a prototype system for testing information
organization and management features intended for
improving Web information gathering tasks. A
relatively small-scale user study was conducted to
examine the feasibility of a larger research study and
to highlight possible improvements to VIMS. In the
evaluation approach, participants were meant to
provide feedback about VIMS as recommended in
the works of Scott (2009), and to suggest further
possible improvements. Users were given two weeks
to work on VIMS and return the evaluation forms.
The longer evaluation period was intended to further
ensure that the study participants had enough time to
use and examine VIMS. The aim of the study was
to identify usability, acceptability, and user
satisfaction issues with VIMS.
IMPROVING HOW USERS ORGANIZE WEB INFORMATION FOR GATHERING TASKS
275
The results of the study indicate that VIMS
provided: a) an excellent level of user satisfaction;
b) significant help to users with their tasks; c)
effective information organization features; d)
effective and useful presentation of task results; and
e) effective preserving and re-finding features. Not
only did users provide specific answers to questions
related to evaluating each feature, but they also
provided their own comments that showed the
degree to which VIMS improved how Web
information gathering is performed. The results of
the study reflect how early adopters perceived
VIMS.
The purpose of the prototype testing using only
students as participants was to decide on the
feasibility of a larger study to evaluate VIMS and to
derive practical design recommendations. Further
studies in this research will use experiments with
larger populations. A larger complete factorial user
experiment will be conducted to examine the
organizational and management criteria identified in
VIMS.
7 CONCLUSIONS
This paper presented VIMS, a prototype system for
improving how users manage and organize Web
information during gathering tasks. The current state
of Web information gathering necessitates studying
challenges users encounter during this type of task.
VIMS was designed based on three previous studies
(Alhenshiri et al., 2010a; Alhenshiri et al., 2010b;
Alhenshiri et al., 2011). Our previous studies
revealed several questions regarding which
visualization, clustering, re-finding, and organizing
factors would improve the process of Web
information gathering. The initial evaluation of
VIMS showed good indications of its usability and
user satisfaction with VIMS as a tool for Web
information organization.
REFERENCES
Alhenshiri, A, Brooks, S., Shepherd, M., and Watters, C.
2010a. Augmenting the Visual Presentation of Web
Search Results. In proceedings of the 5
th
International
Conference on Digital Information Management,
Thunder Bay, ON, Canada, 101-107.
Alhenshiri, A, Shepherd, M., Watters, C., and Duffy, J.
2010b. Web Information Gathering Tasks: A
Framework and Research Agenda. In proceedings of
the International Conference on Knowledge Discovery
and Information Retrieval (KDIR2010), Valencia,
Spain, 131-140.
Alhenshiri, A, Shepherd, M., and Watters, C., 2011. User
Behaviour during Web Search as a Part of Information
Gathering. In proceedings of the Hawaii International
Conference on System Sciences (HICSS 2011),
Hawaii, USA.
Bonnel, N., Lemaire, V., Cotarmanac’h, A., and Morin, A.
2006. Effective Organization and Visualization of
Web Search Results. In Proceedings of the 24
th
IASTED International Multi-Conference on Internet
and Multimedia Systems and Applications, Innsbruck,
Austria, 209-216.
Broder, A. 2002. A Taxonomy of Web Search. ACM
SIGIR Forum, vol. 36, issue 2, 2-10.
Carpineto, C., Osiński, S., Romano, G., and Weiss, D.
2009. A Survey of Web Clustering Engines. ACM
Computing Surveys, vol. 41, issue 3, Article No. 17.
Cutrell, E., Dumais, S. T., and Teevan, J. 2006. Searching
to Eliminate Personal Information Management.
In Communications of the ACM (Special Issue:
Personal Information Management), 49(1), 58-64.
Elsweiler, D., and Ruthven, I. 2007. Towards Task-based
Personal Information Management Evaluations. In
proceedings of the 30
th
Annual International ACM
SIGIR Conference on Research and Development in
Information Retrieval, Amsterdam, The Netherlands,
23-30.
Jones, W., Bruce, H., and Dumais, S. 2003. How do
People Get Back to Information on the Web? How
Can They Do It Better? In Proceedings of the 9
th
IFIP
TC13 International Conference on Human-Computer
Interaction, Zurich, Switzerland.
Kellar, M., Watters, K., and Shepherd, M. 2007. A Field
Study Characterizing Web-based Information-Seeking
Tasks. Journal of the American Society for
Information Science and Technology, vol. 58, issue 7,
999-1018.
Kules, B. and Capra, R.: What Do Exploratory Searchers
Look at in a Faceted Search Interface? In: 9th
ACM/IEEE-CS Joint Conference on Digital Libraries,
Austin, TX, USA, 2009.
Mackay, B., Kellar, M., and Watters, C. 2005. An
Evaluation of Landmarks for Re-finding Information
on the Web. In Proceedings of the 2005 ACM
Conference on Human Factors in Computing Systems,
Portland, Oregon, USA, 1609 - 1612.
Mackey, B., and Watters, C. 2008. Exploring Multi-
session Web Tasks. In Proceedings of the 2008 ACM
Conference on Human Factors in Computing Systems,
Florence, Italy, 4273-4278.
Rose, D., and Levinson, D. 2004. Understanding User
Goals in Web Search. In Proceedings of the 13th
International Conference on World Wide Web, New
York, NY, USA, 13-19.
Scott K. M. 2009. Is Usability Obsolete? Interactions, vol.
16, issue 3, 6-11.
Teevan, J. 2008. How People Recall, Recognize, and
Reuse Search Results. ACM Transactions on
Information Systems, vol. 26, issue 4. Article No. 19.
WEBIST 2011 - 7th International Conference on Web Information Systems and Technologies
276
Teevan, J., Cutrell, E., Fisher, D., Drucker, S. M., Ramos,
G., Andre, P., and Hu, C. 2009. Visual
Snippets: Summarizing Web Pages for Search and
Revisitation. In Proceedings of the 27
th
International
Confserence on Human Factors in Computing
Systems, Boston, MA, USA, 2023-2032.
Turetken, O., and Sharda, R. (2005). Clustering-based
Visual Interfaces for Presentation of Web Search
Results: An Imperical Investigation. Information
Systems Frontier, 7(3), 273-297.
IMPROVING HOW USERS ORGANIZE WEB INFORMATION FOR GATHERING TASKS
277