A FRAMEWORK FOR INFORMATION DIFFUSION OVER

SOCIAL NETWORKS RESEARCH

Outlining Options and Challenges

Juan Yao and Markus Helfert

School of Computing, Dublin City University, Dublin, Ireland

Keywords: Information Diffusion, Social Networks, Social Influence.

Abstract: Information diffusion is a phenomenon in which new ideas or behaviours spread contagiously through

social networks in the style of an epidemic. Recently, researchers have contributed a plethora of studies,

approaches and theoretical contributions related to various aspects of the diffusion phenomenon. There are

many options and approaches. However there are only rare research articles consolidating and reviewing the

various options. In this paper, we aim to contribute an overview of the most prominent approaches related to

the studies of the diffusion phenomenon. We present a framework and research overview for this area. Our

framework can assist researchers and practitioners to identify suitable solutions and understand the

challenges in the information diffusion over social networks research.

1 INTRODUCTION

Information diffusion within social networks has

known an increased interest in recent years. It is a

ubiquitous process in human social networks.

Recently, the emergence of mobile, email and online

social networking have gradually transformed

communication among people. Social interaction

through these platforms leaves extensive digital

traces by its very nature. The availability of such

rich data has lead to a number of empirical and

theoretical studies on information diffusion process

at a more precise, quantitative level. These studies

have shed light on several important principles of

such phenomenon. The objective of this paper is to

review various recent studies related to information

diffusion process in social networks, summarize

their findings and possible extensions, and present

directions for future research.

2 MODELS & FORMAL

DESCRIPTION

While the outcomes of the diffusion processes are

easily visible, their inner workings have remained

elusive. Models of diffusion process aim to provide

a clear picture of how such dynamic process unfolds

by defining the way information flows between

individuals. Here we consider a collection of

probabilistic and game-theoretic diffusion models;

summarize some of the challenges faced when

modelling the diffusion phenomenon and common

approaches to address these challenges.

2.1 The Dynamics of Adoption Process

One of the most common approaches of modelling

the adoption process is to explicitly represent the

step-by-step dynamics of adoption. Typically, it

assumes the dynamic process unfolds in discrete

time unit with each individual following certain rule

when making adopting decision. A set of individuals

are chosen to be initial active set which corresponds

to the early adopters of the behaviour. Starting with

the initial active set the process then unfolds as

follows: at each time step, individuals who were

active at previous time step remain active; an

individual will be activated according to certain

rules such as a threshold number of his neighbours

have been activated. The dynamic process continues

until no more activation is possible.

In the economics literature, diffusion models

have been studied from a game-theoretic perspective

(Morris, 2000). This approach builds on work

investigating how a new technology A might be

spread through a social network of individuals who

491

Yao J. and Helfert M. (2010).

A FRAMEWORK FOR INFORMATION DIFFUSION OVER SOCIAL NETWORKS RESEARCH - Outlining Options and Challenges.

In Proceedings of the 5th International Conference on Software and Data Technologies, pages 491-494

DOI: 10.5220/0002926204910494

 SciTePress

are currently users of technology of B. The diffusion

process is modelled as the dynamic of a coordination

game played on the social network, in which the

adoption of a common strategy between players has

a higher payoff. In particular, in every time step,

each player in a social network has two available

choices A and B. Each player receives a positive

payoff for each of his neighbours that has the same

choice as he does, in addition to an intrinsic benefit

that he derives from his choice.

2.2 Modelling Social Influence

An underlying premise many diffusion models build

on is that people are influenced by their neighbours

in the social network when making adopting

decision. Social influence determines to a large

extent what we adopt and when we adopt it. One

simple way to capture this effect is to assign each

individual in the network a threshold value

(Granovetter, 1978). The threshold value indicates

the personal tendency of an individual to adopt the

behaviour when his neighbours do. In addition to the

number of adopted friends, how those friends are

connected to one another could also have an impact

on individual’s propensity of adopting. It is argued

that (Burt, 2005) if two actors related to the same

individual are also related to each other, they have

greater power over that individual than if they were

unrelated. A primitive approach (Katona, Zubcsek

and Sarvary, 2009) to incorporate this idea is to

postulate the adoption likelihood of individual

increases as a function of the density of relationships

among their adopted neighbours.

Another way to encode neighbour influence is by

using infection rate, inspired by the epidemic

models. In the Independent Cascade Model

(Goldenberg, Libai and Mullen, 2001), every time

an individual contacts with an active neighbour, he

has a constant chance of getting activated.

Obviously, a constant infection rate seems not

accurate enough. Kempe, Kleinberg and Tardos

(2003) refined the Independent Cascade Model to

interpret the idea that an individual's receptiveness to

influence depends on the past history of interactions

with his neighbours. Contrary to a constant rate, an

individual's probability of being activated is a

function of the set of neighbours have already tried

and failed to influence him.

It’s commonly accepted that social influence

affects adopting decision. However, it is not the only

factors that drive information diffusion. Van den

Bulte and Stremersch (2004) argue that S-shaped

diffusion curves can also result from heterogeneity

in the intrinsic tendency to adopt. In the context of

new product diffusion, individuals are different with

respect to subject matter expertise, strength of

opinion, personality traits, media exposure or

perceived adopting costs. Efforts have been made to

capture this effect. For instance, Hartline, Mirrokni

and Sundararajan (2008) model a buyer’s decision to

buy an item is influenced by the set of other buyers

that own the item and the price at which the item is

offered.

2.3 Modelling Diffusion of Competing

Technologies

Most models discussed above typically focus on the

diffusion of one behaviour or technology. What is

often ignored is that several different behaviours

may coexist in a system at the same time and

possibly compete with each other. Some may have a

better chance to survive and spread than others. In

fact, this scenario frequently arises in the real world.

In consumer market, producers of consumer

technologies often must introduce a new product

into a market where a competitor will offer a

comparable product, which makes them vie for sales

with competing word-of-mouth cascades. Thus it is

important to examine the diffusion of competing

technologies in social networks.

As mentioned earlier, such phenomena can be

modelled as a coordination game played on the

edges of the social network with multiple equilibria.

An influential paper by Morris (Morris, 2000)

provided a set of elegant game-theoretic

characterizations for when these qualitatively

different types of equilibria arise in terms of the

underlying network topology and the quality of

technology A relative to technology B. In recent

work, Immorlica et al. (2007) incorporates

compatibility between technologies and discuss how

this effects the diffusion. Their results show that in

some cases, for one technology to survive the

introduction of another, the cost of adopting both

technologies must be balanced with a narrow,

intermediate range.

3 EMPIRICAL STUDIES &

FINDINGS

Leskovec, Adamic and Huberman’s (2006) argues

that while above models address the question of how

influence spreads in a network, they are based on

assumed rather than measured influence effects. The

ICSOFT 2010 - 5th International Conference on Software and Data Technologies

492

wide variety of rules theoretical diffusion models

posed on individuals’ behaviour, even if plausible,

are often lacking empirical support. Furthermore,

most of the diffusion models take a single snapshot

of the evolving network and then build upon this

static network topology. As such, it becomes unclear

how accurately existing models render real-world

diffusion phenomena. Cointet and Roth (2007)

suggest that future investigations of the diffusion

mechanisms should begin with adequate empirical

protocols, then propose adapted modelling

frameworks. Most of the recent empirical studies on

information diffusion focus on addressing the

following fundamental questions:

 What are the characteristics of social influence?

 What are the patterns of information cascade?

 How does network structure affect diffusion

process?

3.1 Characteristics of Social Influence

If we view the diffusion process as a cascade of

social influence, a natural starting point is to

understand the local mechanism of the influence.

Social influence can be described as the actions of

an individual can induce his or her friends to behave

in a similar way. Backstrom et al. (Backstrom et al.,

2006) investigated the membership problem in

online communities and measured how propensity of

individuals to join a community depends on friends

already within the community. Specifically, they

measured the joining probability as a function of the

number of friends already in the community. The

adoption curve exhibits kind of a diminishing returns

pattern in which it continues increasing, but more

and more slowly, even for large numbers of friends.

This kind of diminishing returns pattern has also

been observed in many other studies. For example,

Christakis and Fowler (Christakis and Fowler, 2007)

studied the spread of obesity in large social network

over 32 years and found out that people were most

likely to become obese when friends became obese.

However, Kleinberg (Kleinberg, 2007) argued that

the dependence of adopting probability on the

number of friends adopted expressed in this way

reflects an aggregate property of the full population,

and does not imply anything about an particular

individual’s response to their friends’ behaviour.

Anagnostopoulos et al. (Anagnostopoulos et al.,

2008) argues that while these studies have

established the existence of correlation between user

actions and social affiliations, they do not address

the source of the correlation. There are factors such

as homophily (Mcpherson, Lovin and Cook, 2001)

or unobserved confounding variables that can induce

statistical correlation between the actions of friends

in a social network. Homophily is the tendency for

people to choose relationships with people who are

similar to them, and hence perform similar actions.

Recently, Anagnostopoulos et al. (Anagnostopoulos

et al., 2008) proposed a statistical test to distinguish

social influence from correlation using time series

data of user actions. Crandall et al. (Crandall et al.,

2008) developed techniques for identifying and

modelling the interactions between social influence

and social selection process and found an elaborate

interplay between the two factors.

3.2 Patterns of Information Cascade

Although above studies have shed light on the

mechanisms of social influence, the overall patterns

by which the influence spreads through social

networks have been a mystery. Several recent

studies have been conducted to illustrate the

existence of cascade and observe patterns of

cascading behaviour. In particular, Leskovec et al.

(Leskovec, Singh and Kleinberg, 2006) consider

information cascades in a recommendation network.

According to their observation, the distribution of

cascade sizes can be approximated by a heavy-tailed

distribution. Generally cascades are shallow but

occasional large bursts also occur. Another recent

study on cascading behaviour in large blog network

(Leskovec et al., 2007) found that blog posts do not

have a bursty behaviour; they only have a weekly

periodicity.

Liben-Nowell and Kleinberg (Liben-Nowell and

Kleinberg, 2008) traced the information cascade

process on a global scale by using methods to

reconstruct the propagation of massively circulated

Internet chain letters. Contrary to predictions that

large-scale information spreads widely and reaches

many people in very few steps, their results show

that the progress of these chain letters proceeds in a

narrow but very deep tree-like pattern, continuing

for several hundred steps.

3.3 Role of Network Structure

Researchers have long emphasized the important

role played by the network structure in determining

properties of information diffusion. However, the

way such dynamic process is affected by network

structure is still poorly understood. How widely

does information spread? Does it spread only in

local region? Does it spread quickly on a dense

network? Several studies on real-world network data

A FRAMEWORK FOR INFORMATION DIFFUSION OVER SOCIAL NETWORKS RESEARCH - Outlining Options

and Challenges

493

have been conducted to address questions like these.

Granovetter (Granovetter, 1973)’s weak ties

hypothesis states that weak ties typically act as

connectors between different communities or circles

of friendship. Using mobile call records, Onnela et

al. (Onnela et al., 2007) have observed a coupling

between interaction strengths and the network’s

local structure, confirming the weak tie hypothesis.

Specifically, they found that weak ties appear to be

crucial for maintaining the network’s structural

integrity, but strong ties play an important role in

maintaining local communities. In addition, they

investigated how the dynamics of different tie

strengths influence the spread of information in the

network. They show that the coupling between tie

strength and network structure significantly slows

the diffusion process, resulting in dynamic trapping

of information in communities and find that both

weak and strong ties have a relatively insignificant

role as conduits for information.

4 CONCLUSIONS

In this paper we summarize three major challenges

faced when modelling diffusion phenomenon. We

review recent studies that measure diffusion

phenomenon empirically. We believe our framework

can assist researchers and practitioners to understand

the challenges in studying information diffusion

over social networks and identify suitable solutions

to address those challenges.

REFERENCES

Anagnostopoulos, A., Kumar, R., and Mahdian, M., 2008.

Influence and correlation in social networks. In KDD

'08, pages 7-15, New York, NY, USA. ACM.

Burt, R. S., 2005. Brokerage and Closure: An Introduction

to Social Capital. Oxford University Press, USA.

Backstrom, L., Huttenlocher, D., Kleinberg, J., and Lan,

X., 2006. Group formation in large social networks:

membership, growth, and evolution. In KDD '06,

pages 44-54, New York, NY, USA. ACM.

Cointet, J. P. and Roth, C., 2007. How realistic should

knowledge diffusion models be? Journal of Artificial

Societies and Social Simulation, 10(3).

Christakis, N. A. and Fowler, J. H., 2007. The spread of

obesity in a large social network over 32 years. N Engl

J Med, 357(4):370-379.

Crandall, D., Cosley, D., Huttenlocher, D., Kleinberg, J.,

and Suri, S., 2008. Feedback effects between

similarity and social influence in online communities.

In KDD '08, pages 160-168, New York, NY, USA.

ACM.

Granovetter, M. S., 1973. The strength of weak ties. The

American Journal of Sociology, 78(6):1360-1380.

Granovetter, M., 1978. Threshold models of collective

behavior. The American Journal of Sociology,

83(6):1420-1443.

Goldenberg, J., Libai, B., and Mullen, E., 2001. Talk of

the network: a complex systems look at the underlying

process of word-of-mouth. Marketing Letters,

12(3):211-223.

Hartline, J., Mirrokni, V., and Sundararajan, M., 2008.

Optimal marketing strategies over social networks. In

WWW '08, pages 189-198, New York, NY, USA.

ACM Press.

Immorlica, N., Kleinberg, J., Mahdian, M., and Wexler,

T., 2007. The role of compatibility in the diffusion of

tecfhnologies through social networks. In Proc. 8

EC,

pages 75-83.

Kempe, D., Kleinberg, J., and Tardos, Ã., 2003.

Maximizing the spread of influence through a social

network. In KDD '03, New York, NY, USA. ACM.

Kleinberg, J., 2007. Cascading Behavior in Networks:

Algorithmic and Economic Issues. Cambridge

University Press.

Katona, Z., Zubcsek, P. P., and Sarvary, M., 2009.

Network effects and personal influence: Diffusion of

an online social network. Working paper.

Leskovec, J., Adamic, L. A., and Huberman, B. A., 2006.

The dynamics of viral marketing. In EC '06:

Proceedings of the 7th ACM conference on Electronic

commerce, pages 228-237, New York, NY, USA.

ACM.

Leskovec, J., Singh, A., and Kleinberg, J., 2006. Patterns

of influence in a recommendation network. pages 380-

389.

Leskovec, J., McGlohon, M., Faloutsos, C., Glance, N.,

and Hurst, M., 2007. Cascading behavior in large blog

graphs.

Liben-Nowell, D. and Kleinberg, J., 2008. Tracing

information flow on a global scale using internet

chain-letter data. Proceedings of the National

Academy of Sciences, 105(12):4633-4638.

Morris, S., 2000. Contagion. Review of Economic Studies,

67:57-68.

Mcpherson, M., Lovin, L. S., and Cook, J. M., 2001. Birds

of a feather: Homophily in social networks. Annual

Review of Sociology, 27(1):415-444.

Onnela, J. P., Saramäki, J., Hyvönen, J., Szabó, G., Lazer,

D., Kaski, K., Kertész, J., Barabási, A.-L., 2007.

Structure and tie strengths in mobile communication

networks. Proceedings of the National Academy of

Sciences, 104(18):7332-7336.

Van den Bulte, C., Stremersch, S., 2004. Social Contagion

and income heterogeneity in new product diffusion: A

meta-analytic test. Marketing Science, 23(4):530-544.

ICSOFT 2010 - 5th International Conference on Software and Data Technologies

494