2.4 Improve Ranking Mechanisms
One very clear result from our survey was the
immense importance of ranking algorithms: People
are quite unhappy with today’s search engine’s
result ranking methods being only partly transparent
and only poorly adaptable to the user. For innovation
professionals, especially a ranking according to up-
to-dateness and regional aspects showed up to be
important. An information acquisition tool for
innovation professionals thus should provide
adequate possibilities. The determination of up-to-
dateness, of course, is a challenging task and heavily
depends on the document being analysed: Certain
documents, like blog entries, patents, or press
releases usually contain appropriate metadata. If
explicitly marked, e.g. using an appropriate HTML
or XML-tag, identifying them is trivial. Sometimes,
however, they have to be extracted from the text –
which is rather straight-forward by using regular
expressions. In case of arbitrary web documents, the
task is much more difficult: though timestamps may
be found in the document it is not always obvious if
it denotes the point in time the information in the
text refers to. If no timestamp is given at all, the
freshness might be estimated by comparing the
content with another version of the content that has
been indexed the last time the page was visited. Web
monitoring tools do exactly that: they visit a web
page in certain intervals and detect certain changes.
By integrating carefully selected web monitoring
patterns, a search engine might consider such
documents that have been recently changed in the
ranking method.
2.5 Support Long-term
Information Needs
Search queries are often classified as being either
1. navigational (searching one or more specific
document(s)),
2. informational (seeking information for a given
topic), or
3. transactional (perform a particular action),
depending on the user’s intention (Broder, 2002,
Manning, Raghavan and Schütze 2007).
Almost 50 percent of all queries that general
purpose search engine users utter belong to the first
class (Broder, 2002). Consequently, current general
purpose search engines mainly address navigational
information needs. Nevertheless, the information
needs of professional end-users (like market
researchers, innovation professionals, etc.) often fall
into the second or third categories. Springer (2006)
stated that „the further development of tools to
enable the detection of trends and the finding of
information in the Internet can account for
improving the innovation performance of
companies”.
Recurring searches are widely used within
professional information management (cf. Finzen et
al., 2009). The possibility to save and automatically
repeat complex queries thus should be combined
with effective ways to notify the user on newly
found results. According to our study’s results,
techniques like RSS-feeds are still considered less
important than more traditional communication
channels like e-mail. Nevertheless we expect that the
acceptance of such techniques will grow in the
future, as they offer good means of integrating
search results with further applications (e.g.
knowledge management systems).
2.6 Offer Advanced Interaction
and Visualization Concepts
Common use cases in innovation management
include patent mining, competitor observation, and
trend monitoring. To support these use cases
information must be either extracted from one or
more documents, or interpolated given an amount of
documents. Such information needs do not only
require special result presentation techniques, but
also affect the query frontend: the users have to
clarify that they do not want to be confronted with a
list of documents but rather with, e.g., information
extracted from documents (“new products of
competitor X”) or statistic information (“pie chart
comparing the positive and negative utterances of
forum users who wrote about company X in the last
month”, “bar chart showing the trend for a recent
search topic for selected companies”). Search tools
for professional end-users thus require suitable
navigation concepts and powerful user interfaces for
both, search query formulation and result
interpretation.
2.7 Foster Integration
and Collaboration
Web-based information gathering is a very
individual task even in professional information
work. However, with the size of an organisation the
need to exchange search artifacts rises. It is quite
common for larger companies to outsource search
tasks to a special department (Finzen et al., 2009).
With people searching for information together, the
INNOVATION MINING - Supporting Web Mining in Early Innovation Phases
243