Ranking Web Services using Centralities and Social Indicators

Tilo Zemke

, Jos

e Ignacio Fern

andez-Villamor

and Carlos

A. Iglesias

Technische Universit

at Chemnitz, Straße der Nationen 62, 09111 Chemnitz, Germany

Universidad Polit

ecnica de Madrid, Avenida Complutense 30, 28040 Madrid, Spain

Keywords:

Web Service Composition, Ranking.

Abstract:

Nowadays, developers of web application mashups face a sheer overwhelming variety and pluralism of web

services. Therefore, choosing appropriate web services to achieve speciﬁc goals requires a certain amount of

knowledge as well as expertise. In order to support users in choosing appropriate web services it is not only

important to match their search criteria to a dataset of possible choices but also to rank the results according

to their relevance, thus minimizing the time it takes for taking such a choice. Therefore, we investigated six

ranking approaches in an empirical manner and compared them to each other. Moreover, we have had a look

on how one can combine those ranking algorithms linearly in order to maximize the quality of their outputs.

1 INTRODUCTION

Over the past years, the number of web services

that offer an API to access their functionalities

has risen rapidly. As of February 2012, Pro-

grammableWeb.com

as one of the most important

directories for APIs holds over 5,000 APIs in its

database. This sheer overwhelming plurality of

web services that are available to the community of

mashup creators does not only provide a huge amount

of possibilities to mash up the World Wide Web but

also requires a certain level of expertise when one

wants to create a mashup. Therefore, choosing ap-

propriate web services to accomplish a speciﬁc goal

can still be a time-consuming task scaring off poten-

tial developers that are not that experienced in mashup

creation.

In order to overcome this issue, there are different

approaches pursued by current research. Typically,

a user request gets semantically matched against a

web service’s description and a ranking is produced

by providing a list of web services descending in sim-

ilarity scores. Other approaches employ mechanisms

of crowd computing, such as tagging web service de-

scriptions, for example.

Focusing on how to bring order in the variety of

web services, we investigated six simple ranking ap-

proaches, that work independently from a user re-

quest, and analysed them by means of the quality of

http://www.programmableweb.com

their outputs. Furthermore, we investigated the possi-

bility of linearly combining them to compound rank-

ing functions that provide quality enhancements.

The rest of this paper is structured as follows:

First, we give a brief introduction on the the model

we employed for our rankings in section 2, followed

by the details of the implemented ranking functions as

well as our approach to combine them. Afterwards,

the methodology used for evaluating and comparing

the implemented ranking functions and their respec-

tive outputs is described in section 3. Section 4 sum-

marizes the related work and ﬁnally, conclusions of

our work are drawn in section 5 along with a glimpse

on the future work.

2 RANKING MODEL

A metadirectory consisting of web services, mashups

as well as widgets served as the starting point for our

work. This metadirectory makes use of the Linked

Mashups Ontology (LiMOn) (Jos

e I. Fern

andez-

Villamor and Tilo Zemke and Carlos

A. Iglesias,

2012), a uniﬁed model for those components, which

integrates information that are available from cur-

rent repositories in the web and covers trust, business

as well as technical aspects. Formalizing this, our

dataset consists of a set of web services S and a set

of mashups M .

In order to support a developer in choosing the

right web services a two-step-methodology was used:

156

Zemke T., Ignacio Fernández-Villamor J. and Á. Iglesias C..

Ranking Web Services using Centralities and Social Indicators.

DOI: 10.5220/0003997101560160

In Proceedings of the 7th International Conference on Evaluation of Novel Approaches to Software Engineering (ENASE-2012), pages 156-160

ISBN: 978-989-8565-13-6

 2012 SCITEPRESS (Science and Technology Publications, Lda.)

• First, we ﬁltered S for potential component can-

didates using her query. This ﬁltering provides a

subset of S called S

query

• Afterwards, a ranking function f ∈ F is applied on

query

and returns a speciﬁc permutation of S

query

i.e. a ranking of the web services s

∈ S

query

, by

assigning each service s

a ranking score r

Focusing on the ranking part, the ﬁltering was done

by selecting only web services from the metadirectory

whose names, textual descriptions and/or tags con-

tained a speciﬁc search term. Hence, for example,

the subset S

image

contains all web services that have

the term ”image” in their respective names, textual

descriptions and/or tags.

Our goal was to ﬁnd indicators for a web service’s

relevance and therefore, six different simple rank-

ing function were chosen and implemented, namely

four different types of centrality and two indicators

of social activity. Dealing with centralities, we de-

ﬁned an undirected and bipartite graph G = (V,E)

letting the set of vertices V = S ∪ M be the union

of the set of web services and the set of mashups in

our dataset. The set of edges was deﬁned as E =

{(s,m) | Mashup m uses service s.} using the prop-

erty uses of LiMOn. Figure 1 illustrates the structure

of this graph.

Figure 1: Illustration of the Mashup API Graph.

2.1 Simple Ranking Functions

In particular, the following ranking functions have

been investigated:

• C

: Degree Centrality, i.e. in our scenario the

number of mashups that use a certain web service,

which directly reﬂects its popularity.

• C

: Betweenness Centrality is a more complex ap-

proach that considers the number of shortest paths

between two vertices v 6= u a web service s lies

on. This metric is an important technique in so-

cial network analysis and can be determined with

the help of Brandes’ algorithm (Brandes, 2001).

• C

: Closeness Centrality: A vertex v is ranked

higher the shorter the geodesic distances between

itself and other vertices are, i.e. the closer it is to

other vertices. Closeness centrality is also an im-

portant technique in social network analysis and

can be determined with the help of Brandes’ al-

gorithm as well - even as a side product of calcu-

lating C

. In order to be working with our graph

structure we implemented it with a modiﬁcation

(Opsahl et al., 2010) proposed.

• C

: Eigenvector Centrality is a very established

and successful approach to rank documents in

other domains, e.g. PageRank (Page et al., 1998)

for web resources. The central idea behind it is

that a web service gets ranked higher the more im-

portant the mashups are that use it and vice versa.

• PUR: The score ranging from 0 to 5 each web

service has on Programmable Web’s user rating

functionality which measures the degree of satis-

faction the users had when working with a speciﬁc

API.

• GSO: The amount of hits the Google Search En-

gine

returned querying it for the web service’s

name and limiting the results to the domain of

StackOverﬂow

, a question-and-answer website

specialized on programming topics, is an indica-

tor of how widespread a web service is among de-

velopers. ”Twitter site:stackoverﬂow.com” could

serve as an example for such a search engine

query.

2.2 Compound Ranking Functions

In addition to the mentioned simple functions, we in-

vestigated on how one could combine them linearly

in order to create a new, compound ranking function,

which can result in a different permutation of S

query

Such a linear combination F can be described as in

equation 1.

F (S

query

) =

∑

f ∈F

f (S

query

) (1)

The following example will illustrate the idea behind

this: Having three web services in our subset, s

, s

and s

, as well as two simple ranking functions, f

and f

which produce the following ranking scores r

• Function f

ranks service s

as the most relevant

one with a score of r

= 10, s

in second place

scoring r

= 5 and s

in third place with a score

of r

= 1.

• Function f

places s

= 8) ﬁrst, s

= 3)

second and s

last with a score of r

= 1.

http://www.google.com

http://www.stackoverﬂow.com

RankingWebServicesusingCentralitiesandSocialIndicators

157

In order to create a new ranking, we can now combine

with f

using λ

= λ

= 0.5. The resulting ranking

of our linear combination F would be the following:

• Service s

scores 0.5 ∗ 8 + 0.5 ∗ 5 = 6.5

• Service s

scores 0.5 ∗ 10 + 0.5 ∗ 1 = 5.5

• Service s

scores 0.5 ∗ 1 + 0.5 ∗ 3 = 2.0

Table 1: Illustration of the example scenario for a linear

combination of ranking functions (λ

= λ

= 0.5).

Pos f

) f

) F (r

)

1 s

(10) s

(8) s

(6.5)

2 s

(5) s

(3) s

(5.5)

3 s

(1) s

(2)

3 EVALUATION AND RESULTS

The metadirectory contains over 10,000 web services

and 7,000 mashups after crawling ProgrammableWeb

and Yahoo Pipes

in July 2011.

Adapting the methodology of relevance judge-

ments (K

uster and K

onig-Ries, 2009), a group of

three relevance judges, i.e. experienced mashup de-

velopers, was formed. Moreover, three different sub-

sets of our dataset’s web services, i.e. S

twitter

, S

voice

and S

image

containing 32, 19 and 18 web services re-

spectively, were chosen. First of all, each relevance

judge had to individually rate each web service ac-

cording to three different criteria, functional scope,

technical variety and support as well as trust in the

service and its provider. In a second step, the rele-

vance judges met and conﬂicts, that occurred when

two or more judges did not give the same rating on

a certain criterium for a speciﬁc web service, were

discussed until all judges agreed on a uniform rating.

Using this uniform rating to produce gain quantiﬁca-

tions G

, which reﬂect the relevance, for each web ser-

vice s

, the Normalized Discounted Cumulated Gain

(nDCG

) (J

arvelin and Kek

ainen, 2002) metric has

been applied to each simple ranking function. The

nDCG

metric is based on DCG

which is deﬁned as

follows in our scenario:

DCG

(

, i = 1

DCG

i−1

+ G

/log2(i), otherwise

(2)

The higher the position of web services with high gain

quantiﬁcations are in a speciﬁc ranking, the better the

evaluational score of the ranking itself. This leads to

a very intuitive sight on the quality of the rankings

produced by the simple ranking functions. We chose

http://pipes.yahoo.com

sharp gain quantiﬁcations, i.e. powers of 2, as well as

a discounting factor of 2 and we only compared the re-

sults up to the 15th position in the rankings (nDCG

)

thus modelling a rather impatient developer that needs

quality results in the beginning of his results list.

Figure 2: Results of the evaluation in S

voice

Figure 3: Results of the evaluation in S

image

Figure 4: Results of the evaluation in S

twitter

Figures 2, 3 and 4 show the results of the evalua-

tion done. As can be seen, the ranking functions pro-

duce results of considerably similar quality except the

ProgrammableWeb user rating PU R. An explanation

for PUR’s lack of quality may be the lack of votes

and therefore missing reliability. Moreover, the rea-

son for the similarity between the centrality measures

is their strongly-related nature and the structure of

our dataset’s graphical representation. For example,

the more mashups use a certain web service (C

) the

higher is the probability of being part of a shortest

path in G (C

) and the higher the number of mashups

or APIs close to it (C

ENASE2012-7thInternationalConferenceonEvaluationofNovelSoftwareApproachestoSoftwareEngineering

158

Although runtime performance has not yet been

taken into consideration, our experiments showed that

degree centrality as well as eigenvector centrality de-

liver the best cost-beneﬁt ratios among the analysed

ranking approaches. While betweenness and close-

ness centrality suffer from their algorithmic complex-

ity (O(S M )), the trafﬁc caused by GSO does not im-

ply a practical use.

In addition to that we analysed nearly 325,000

possible linear combinations for each subset of web

services that was evaluated and checked whether or

not nDCG

could be improved. The results show that

there are slight improvements possible in our scenario

with the most remarkable one found in S

image

achiev-

ing an nDCG

of 0.9715 for a combination of C

GSO and PUR, i.e.

C +

GSO +

PUR, over

0.9307, the best score of a simple ranking function

) in this speciﬁc subset. Table 2 shows the scores

of the most successful linear combinations (nDCG

)

compared to the most successful elementary ranking

functions for each evaluated set.

Table 2: This table shows the score of linear combinations

of the elementary ranking functions that maximize the qual-

ity of the overall ranking output.

Set max(nDCG

( f

)) nDCG

∗

)

image

0.9307 (C

) 0.9715

voice

0.8991 (C

) 0.9049

twitter

0.8338 (C

) 0.8418

As can be seen, the improvements achieved by lin-

early combining ranking functions, especially for

twitter

and S

voice

, are not very high. This is a result

of the the likewise nature of our elementary ranking

functions and therefore the similarity of the rankings

they produce.

4 RELATED WORK

In this paper, we presented different approaches for

ranking web services independently of how they are

matched against a user query. It has to be noted, that

all of the previously mentioned ranking functions do

not take the user request into consideration. Some

of the presented ranking functions use centralities as

indicators for a web service’s relevance while others

employed social activities. Using degree, between-

ness and closeness centrality in order to analyse the

network of Programmable Web, (Wang et al., 2009)

also draw conclusions on the importance of a cer-

tain web service with the help of a user-api-network

and the degree centralities of a service’s neighbour-

hood. Introducing the serviut rank (Ranabahu et al.,

2008) present a composite ranking functionality for

web services that - inter alia - makes use of popular-

ity scores. Moreover, they use Alexa trafﬁc rankings

in order to determine the popularity of a web service.

Futhermore, (Elmeleegy et al., 2008) use estimations

of conditional probabilities that a certain concept is

added to a given mashup input as basis for the ranking

component of their mashup advisor. WSColab (Gaw-

inecki et al., 2010) introduces the concept of struc-

tured collaborative tagging in the context of web ser-

vice matchmaking. While succeeding at JGDEval

S3 Contest in 2009 their rankings are build upon sim-

ilarity scores for web services’ interfaces and func-

tional behaviour. Another approach is presented by

(Goarany et al., 2010) by predicting mashup patterns

using social tagging. Also exploiting the structure

folksonomies, (Hotho et al., 2006) adapted the idea

behind the popular PageRank and created FolkRank

demonstrating their results in the social bookmark-

ing domain. (Skoutas et al., 2010) also propose a

methodology of ranking web services based on domi-

nance relationships between web services where mul-

tiple criteria can be integrated.

5 CONCLUSIONS AND FUTURE

WORK

Throughout this paper we showed that the presented

ranking algorithms can produce quality rankings.

Moreover, we showed that ranking functions can be

linearly combined in order to improve those rank-

ings. Due to the similarity between the analysed rank-

ings, those improvements were mostly rather mini-

mal. Therefore, other ranking approaches, such as,

for example, semantic similarity scores for the web

services’ descriptions to the user’s search query or

QoS of a web service, should be taken into account

as well. During this work a query interface, called

rOMking for end-users has been implemented, where

the presented concepts are provided.

Future work will also involve further analysing

the performance of the presented ranking functions as

well as the process of efﬁciently optimizing the rank-

ings with the help of linear combinations. Enhancing

the capabilities of the minimalistic ﬁltering process is

planned, too.

http://fusion.cs.uni-jena.de/professur/jgdeval/

RankingWebServicesusingCentralitiesandSocialIndicators

159

ACKNOWLEDGEMENTS

This research project was funded by the European

Commission under the R&D project OMELETTE

(FP7-ICT-2009-5).

REFERENCES

Brandes, U. (2001). A faster algorithm for between-

ness centrality. Journal of Mathematical Sociology,

25(2):163–177.

Elmeleegy, H., Ivan, A., Akkiraju, R., and Goodwin, R.

(2008). Mashup Advisor: A Recommendation Tool

for Mashup Development. 2008 IEEE International

Conference on Web Services, pages 337–344.

Gawinecki, M., Cabri, G., Paprzycki, M., and Ganzha, M.

(2010). Wscolab : Structured collaborative tagging

for web service matchmaking. Proceedings of the WE-

BIST Conference, pages 70–77.

Goarany, K., Kulczycki, G., and Blake, M. B. (2010). Min-

ing Social Tags to Predict Mashup Patterns. Informa-

tion Systems Journal, (Smuc):71–77.

Hotho, A., J

aschke, R., Schmitz, C., and Stumme, G.

(2006). Information retrieval in folksonomies: Search

and ranking. The Semantic Web Research and Appli-

cations, 4011(28of33[):411–426.

arvelin, K. and Kek

ainen, J. (2002). Cumulated gain-

based evaluation of ir techniques. ACM Transactions

on Information Systems, 20(4):422–446.

Jos

e I. Fern

andez-Villamor and Tilo Zemke and Carlos

Iglesias (2012). A semantic metadirectory of services

based on web mining techniques linked mashups on-

tology ( limon ). In Proceedings of the AAAI 2012

Spring Symposium on AI, pages 27–33.

uster, U. and K

onig-Ries, B. (2009). Relevance judgments

for web services retrieval - a methodology and test col-

lection for sws discovery evaluation. In Proceedings

of the 7th IEEE European Conference on Web Services

(ECOWS09), Einhoven, The Netherlands.

Opsahl, T., Agneessens, F., and Skvoretz, J. (2010). Node

centrality in weighted networks: Generalizing degree

and shortest paths. Social Networks 32 (3), pages 245–

251.

Page, L., Brin, S., Motwani, R., and Winograd, T. (1998).

The page rank citation ranking: Bringing order to the

web. In Proceedings of the 7th International World

Wide Web Conference, pages 161–172.

Ranabahu, A., Nagarajan, M., Sheth, A. P., and Verma, K.

(2008). A Faceted Classiﬁcation Based Approach to

Search and Rank Web APIs. 2008 IEEE International

Conference on Web Services, pages 177–184.

Skoutas, D., Sacharidis, D., Simitsis, A., and Sellis, T.

(2010). Ranking and Clustering Web Services Using

Multicriteria Dominance Relationships. IEEE Trans-

actions on Services Computing, 3(3):163–177.

Wang, J., Chen, H., and Zhang, Y. (2009). Mining user

behavior pattern in mashup community. 2009 IEEE

International Conference on Information Reuse Inte-

gration, pages 126–131.

ENASE2012-7thInternationalConferenceonEvaluationofNovelSoftwareApproachestoSoftwareEngineering

160