A Machine Learning Approach to Identify Dependencies Among

Learning Objects

Carlo De Medio

, Fabio Gasparetti

, Carla Limongelli

, Filippo Sciarrone

and Marco Temperini

Sapienza University of Rome, Dept. of Computer, Control and Management Engineering,

Via Ariosto, 25 - 00189 Rome, Italy

Engineering Department, Artiﬁcial Intelligence Laboratory, Roma Tre University,

Via della Vasca Navale, 79, 00146 Rome, Italy

Keywords:

E-learning, Data Mining, Wikipedia.

Abstract:

Selecting and sequencing a set of Learning Objects (LOs) to build a course may turn out to be quite a challeng-

ing task. In this paper we focus on such an aspect, related to the veriﬁcation and respect of the relationships of

pedagogical dependence existing between two LOs added to a course (meaning that if a given LO has another

one as “pre-requisite”, then any sequencing of the LOs in the course will need to have the latter LO taken by

the learners before of the former). In our approach the sequencing of LOs in the course can still be managed

by the instructor, basing on her/his taste and preferences, yet s/he can also be helped by a set of suggestions,

related to the pre-requisite relationships existing among the LOs selected for the course. Such suggestions

(such relationships, in effect) can be computed automatically and provide the instructor with signiﬁcant help

and guidance. We show a light-weight formalization of the LO, and how it can be “represented” by a set of

WikiPedia Pages (“topics”); then we show how such set of topics, together with a set of relevant hypotheses we

previously deﬁned, can help establish the dependence relationship existing between two LOs. In this endeavor

we exploit the classiﬁcation in categories available for the WikiPedia topics, and obtain interesting results for

our framework, in terms of precision and recall of the dependence relationships.

1 INTRODUCTION

When an instructor is faced with the task of building

or maintaining a web-based course, her/his work can

be complex, in several respects. On the one hand,

the availability of an ever increasing amount of edu-

cational material on the web offers a problem of sheer

quantity of possible choices. An idea of how vast and,

possibly, confusing the selection task for an instruc-

tor can be, is given by simply limiting the consider-

ation to learning material available under the formal-

ized shape of standardized Learning Objects (LOs),

from Learning Object Repositories (LORs) such as

Connexion

, Ariadne

, or Merlot

. In this kind of

task, recommender and ﬁltering tools might be of

help (Revilla Mu˜noz et al., 2015; Limongelli et al.,

2010; Limongelli et al., 2012; Limongelli et al., 2015;

Connexions is a Learning Object Repository, available

at http://www.cnx.org (Accessed July 24 2015).

Ariadne Foundation, available at http://ariadne-eu.org

(Accessed July 24 2015).

Merlot is a Learning Object Repository, available at

http://www.merlot.org (Accessed July 24 2015)

Limongelli et al., 2016).

On the other hand, one of the main responsibilities

of the instructor, while assembling a course through

LOs, is to ensure the fundamentalpedagogicalaspects

of the course, such as the preservation of the exist-

ing relationships of dependence between two LOs in

the course: in other words, to ensure that a LO

the course is depending on another object LO

in the

course (i.e. “LO

has LO

as a prerequisite”), then

will precede LO

in every admissible sequencing

of the course’s LOs.

How the instructor sequences the learning mate-

rial strongly depends on the actual learning content

of the LOs, as well as on a teacher’s taste/preference.

Nevertheless, having automated suggestions on how

certain LOs should be necessarily sequenced, in or-

der to preserve dependency relationships, can be of

great help for the instructor, since it can ease a part

of the selection and sequencing task, and allow the

instructor to focus on less automatable aspects.

During course construction the instructor usually

labels each selected LO with a set of pre-requisite

concepts (knowledge and/or skills that will deﬁnitely

Medio, C., Gasparetti, F., Limongelli, C., Sciarrone, F. and Temperini, M.

A Machine Learning Approach to Identify Dependencies Among Learning Objects.

In Proceedings of the 8th International Conference on Computer Supported Education (CSEDU 2016) - Volume 1, pages 345-352

ISBN: 978-989-758-179-3

345

need be covered prior to the learner engaging the LO).

These pre-requisites statements provide a set of con-

straints that have to be veriﬁed during the course se-

quencing deﬁnition. Because of the intrinsic depen-

dence of such constraints on the LOs’ contents, nei-

ther their deﬁnition, nor their evaluation is a trivial

task, the former possibly being quite time-consuming

and error-prone.

In a previous work (Gasparetti et al., 2015b) we

have proposed six hypotheses that help to decide,

given two LOs, their prerequisite relationship. We

proposed and validated the hypotheses (described in

Section 3) and showed that, although these are statis-

tically signiﬁcant, we get a low precision, at most 0.4.

In this paper, starting from the results of the pre-

vious work, we propose a different study based on

a traditional machine learning approach (Mitchell,

1997). We expand the set of data on which the anal-

ysis is done, and identify some features derived from

the above assumptions and feed the training system

(Weka).

The protocol of application of the criteria/features,

proceeds by passing the learning objects of a course

through Wikipedia Miner (Milne and Witten, 2013),

collect data and then feed Weka (Hall et al., 2009) in

such a way to obtain a set of suggestions about the

possible dependence relationships among LOs and

wiki pages. The results show good precision and re-

call values, increasing with the number of LOs em-

ployed in the course. This led us to think that our pro-

tocol can provide the teacher with valuable sugges-

tions about the possible sequencing of their learning

material.

In the following section we present some related

work. Then, in Section 3 we illustrate the six revised

hypotheses on which we base the inference of depen-

dence relationships; in this section we also provide

a lightweight formal background to discuss the char-

acteristics of a LO, based on its representation as set

of Wikipedia Pages (topics). Here we also discuss

the quantitative features computed for a LO during

the relationships inference. Section 4 presents the re-

sults obtained through the use of Weka (Hall et al.,

2009), that we trained with ﬁve different sets of LOs.

Section 5 comments on the experimental results and

proposes how to reinforce and generalize the present

results.

2 RELATED WORK

Associating learning material, of diverse origin, in

one’s own course material is a delicate task, since

the learning material is often not to be treated as a

mere additive on the activities proposed to students,

yet the new resources have to undergo some pedagog-

ical adaptation.

Curriculum Sequencing is an interesting challenge

in the educational research area: research in this

ﬁeld aims to automatically produce a personalized se-

quence of didactic materials or activities (Brusilovsky

and Vassileva, 2003), or to allow a sequence of in-

structional material ”on demand”, by means of sys-

tems that deliver training to workers when and where

they need it (Capuano et al., 2009). Our proposed ap-

proach focuses on the possibility that the learning ob-

jects through which the teacher makes her selection

have already been labeled as to their possible depen-

dency relationships. The teacher has full control of

such dependency determinations: s/he can still accept

or discard such suggestions. On the other hand, such

hints can also simply add information that the teacher

had not noticed, resulting even more fruitful. Such as-

sistance is admittedly speciﬁc, even narrow, yet it can

still be of good use and let the teacher save time to be

dedicated to other pedagogical aspects.

Resources from the Web, and in particular from

Wikipedia, have been investigated as a source for

enrichment of the learning contents of a LMS or

LOR (Parker and Chao, 2007; Cole, 2009; Stuurman

et al., 2012; Gentili et al., 2001; Sciarrone, 2013).

Other kinds of resources have also been considered

for automated treatment and inclusion in the learning

resources of a course, such as the podcasts (Cebeci

and Tekdal, 2006), and the use of wikis also as means

for contribution from students into the course’s mate-

rial (Allen and Tay, 2012; Sun and Qiu, 2014).

The task of including such external material in a

course remains, however, a hard one, both on a tech-

nical and pedagogical level.

Wikipedia provides a wealth of information and

possibly learning resource documents, so the idea of

using such resources to enrich a course’s learning ma-

terial is compelling. In addition Wikipedia is instru-

mented on the web, so that the analysis and evalua-

tion of semantic relationships between documentsand

concepts is supported, down to the comparison with

words or text excerpts (Strube and Ponzetto, 2006;

Gabrilovich and Markovitch, 2009; Milne and Wit-

ten, 2008; Biancalana et al., 2013). This can allow

for an effective treatment of the wikipedia contents in

advance of their uses to support learning in a formal

environment.

In this paper we propose a method to exploit the

informative resources of weak-semantic taxonomies,

in particular Wikipedia, to allow sequencing LOs with

external resources, and to make the LOs enrichment

possible, by annotating them through the external re-

CSEDU 2016 - 8th International Conference on Computer Supported Education

346

sources.

To our knowledge our approach is novel, yet we

have found correspondences in literature.

An approach to the identiﬁcation of prerequisite

relationships among “knowledge components” is in

(Scheines et al., 2014), where causal discovery is used

on components represented as latent (unmeasured)

variables. To validate the approach, simulated data

are used, representing a dataset of student-skills mea-

sures.

Voung et al. (Vuong et al., 2011) propose to an-

alyze large-scale assessment to determine the depen-

dency relationships between knowledge units. Given

sufﬁcient user data, the authors prove that prerequi-

sites for each instructional unit can be identiﬁed. On

the contrary, the methodology cannot be applied to

new curriculums, that is, units to which student per-

formances have not been extensively evaluated.

Recently (Gasparetti et al., 2015c) proposed an

early attempt to exploit Wikipedia as a source of

learning materials. Analyzing the links present in

the Wikipedia pages, they build courses based on the

Grasha teaching styles and on a social didactic ap-

proach. A further early attempt to exploit Wikipedia

for the sequencing task has been proposed in (Gas-

paretti et al., 2015a).

3 HYPOTESES AND FEATURES

FOR COMPARING LOS

In the previous work (Gasparetti et al., 2015b) we

have proposed an approach for extracting prerequi-

site relations from textual LOs by means of wikipedia

miner (Milne and Witten, 2013) that is a tool to access

to Wikipedia’s structure and content.

The process goes from information extraction out

of the involved LOs, through determination of the

WikiPedia pages (topics henceforth) that are associ-

ated to each one of the LOs, up to the analysis of a set

of features of such topics, allowing to establish the

possible dependency relationships between the LOs.

It can be summarized as follows:

• given the learning objects LO

and LO

, the

Wikipedia Miner Toolkit is activated on them;

• for each LO the textual content is extracted

and analyzed, so as to pair portions of it

with annotations, relating to categories in the

Wikipedia taxonomy (made by metatags inside

the Wikipedia’s pages and perfected by the com-

munity of Wikipedia, it is the division into cat-

egories of the information; the system generates

a graph of the categories that can be questioned

without fetch the whole page);

• for each LO, the set of annotations is used to relate

the LO to a set of topics;

• then we apply certain criteria of evaluation to the

two sets of topics associated to LO

and LO

;

• we infer the existence of dependencyrelationships

on the basis of a set of hypotheses.

The dependency relation of prerequisites is expressed

as LO

→ LO

meaning that LO

is a prerequisite for

In the following we provide a revised set of hy-

potheses and explain the features of the topics asso-

ciated with the LOs, which we use to test such Hy-

potheses.

if the distance that connects the Wikipedia two

categories to which LO

and LO

belong respec-

tively (even through a common ancestor category)

is less than a given threshold, the prerequisite re-

lation exists. The threshold is set to 2, with val-

ues higher than 2 the execution of the program

slows down dramatically and the performance is

not improved. The hypothesis is inspired by the

semantic similarity deﬁned over a IS-A taxonomy

proposed by Resnik (Resnik, 2011).

more general topics need longer discussions to be

described, compared to very speciﬁc ones.

if a Wikipedia author has found the necessity

to deﬁne a concept by means of other concepts,

probably the former one is more speciﬁc than the

latter.

H4 more general concepts create several connections

to speciﬁc ones, especially if the author wants to

give an overview of the domain leaving it up to the

reader to deepen discussions on specialized pages.

H5 In this hypothesis we extract the nouns from the

two articles and assume that nouns in a Wikipedia

article correspond to concepts. Therefore, articles

dealing with multiple concepts should be consid-

ered more general.

H6 The last hypothesis analyzes the length of LOs in

terms of number of words included in the descrip-

tion (i.e., ﬁrst paragraph) of the articles. If the

number of words of the article associated with LO

is much greater than the ones of LO

, LO

→ LO

exists. Again, we follow a similar rationale of

the hypotheses H4 and H5, but the computational

complexity is limited in this case.

A Machine Learning Approach to Identify Dependencies Among Learning Objects

347

In the proposed approach, the features the hypotheses

are based upon are considered as an input of a ma-

chine learning algorithms. Eleven features represent-

ing relevant aspects of each LO’s are deﬁned. Before

describing the features we introduce some deﬁnitions.

3.1 Formal Background

Following a trend used in several sources, a

WikiPedia Page associated to a LO will be named

”topic”.

In a topic anchors are spread, to connect such re-

source to other topics. Such references will be named

“links”.

For our purposes a link is then a URI (Uniform

Resource Identiﬁer, i.e. a usual web address). Notice

that another signiﬁcant link is the topic address.

As a web resource, the topic comprises multime-

dia, yet in our approach we will consider only its tex-

tual contents; such textual contents are basically com-

posed by all the topic’s text and links to other topics.

We differentiate between simple word and a nouns ,

they will be analyzed separately as suggested in H5.

Moreover, a Wikipedia page usually comprises a

ﬁrst section, containing a summary of relevant aspects

of the page.

Here are some deﬁnitions for understanding the

formal background:

T the set of all the topics t associated to a LO

L the set of all the links to other topics in t: t.L

W the set of all the words used in t: t.W

N the number of the links to other topics in t: t.N

addr the topic address (another link): t.addr

FS the ﬁrst section of the topic in its usual web-

publication: t.FS. It comprises sets of links,

nouns and words, that we consider subsets of,

resp., the links, words and nouns of the topic:

- set of links in the ﬁrst section: t.FS.L ⊆ t.L

- set of words in the ﬁrst section: t.FS.W ⊆ t.W

- set of nouns in the ﬁrst section: t.FS.N ⊆ t.N

3.2 Features of a LO

Given two learning objects LO

and LO

, the features

can be formalized as follows:

1. avgLen(LO

): The average length of the text of the

Wikipedia topics associated to LO

deﬁned in terms of

words obtained by a text tokenization process.

avgLen(LO

) =

∑

t∈T

|t.W|

2. avgLen(LO

): Similar to avgLen(LO

) but evaluated on

avgLen(LO

) =

∑

t∈T

|t.W|

3. f sl(LO

): number of links in the ﬁrst section of the

Wikipedia topics associated with LO

fsl(LO

) = t. FS.L

4. f sl(LO

): number of links in the ﬁrst section of the

Wikipedia topics associated with LO

fsl(LO

) = t. FS.L

5. avgNL(LO

): The average number of links in the topics

associated to LO

avgNL(LO

) =

∑

∀t∈T

t.L

|T|

6. avgNL(LO

): Similar to avgNL(LO

) but evaluated on

7. nouns(LO

): Number of distinct nouns in LO

extracted

by the part-of-speech tagger.

8. nouns(LO

): Similar to nouns(LO

) but on LO

9. nounsIntersect(LO

, LO

): Given the two sets of nouns

and N

extracted from LO

and LO

nounsIntersect(LO

, LO

) = |N

∪ N

10. avgFsLen(LO

): The average length of the text of the

Wikipedia topics associated to LO

deﬁned in terms of

words obtained by the tokenization process limited to

the ﬁrst section of the topics

avgFsLen(LO

) =

∑

∀t∈T

t.FS.W

|T|

11. avgFsLen(LO

): Similar to avgFsLen(LO

) but evalu-

ated on LO

All the features are represented by real or integer

numbers.

4 EXPERIMENTAL RESULTS

The system, programmed to operate on the features

described above, was tested through WEKA (Waikato

Environment for Knowledge Analysis), a free soft-

ware developed by the University of Waikata in new

Zealand which allows to apply Machine Learning al-

gorithms on large data-sets. In our case the training

set consists of 5 courses structured as shown in Ta-

ble 1.

The courses were chosen in order to have sam-

ples differing both in terms of number of LOs and in

terms of content and materials. Some of the LOs con-

tent presents a very formal language (e.g.: Program-

ming Language (Java) and Advanced Computer Sci-

ence) and other LOs show historical references and

CSEDU 2016 - 8th International Conference on Computer Supported Education

348

Table 1: Description of test courses.

Course N. of LOs N. of expected

Name dependences

Advanced 85 2256

Computer Science

Programming 18 41

Language (Java)

Economy 5 10

Futurism 4 5

Basic 4 5

Mathematics

descriptive content (e.g.: Futurism). The expected de-

pendencies are the relationships between prerequisite

and successor concepts represented by LOs. The LOs

are represented by text ﬁles containing the entire text

of the lessons; the system is implemented so as to ac-

cept

html

pages, automatically retrieved (or not) by

the network. They are parsed and return the textual

ﬁle automatically. The entire data-set was classiﬁed

by the algorithm J48, an open source implementation

of the algorithm C4.5, which in the ﬁrst tests was the

most encouraging one. Other algorithms are appli-

cable to data-sets but require larger data sets to see

signiﬁcant results. The measures taken from the per-

formance analysis of the implemented system are the

following:

• π Precision in [0..1]: high precision means that

an algorithm returned more relevant results than

irrelevant results;

• ρ Recall in [0..1]: high recall means that an algo-

rithm returned most of the relevant results;

• F1 F-measure is a combination of precision and

recall, namely the harmonic mean;

• ROC Is the area under the ROC curve normalized

between [0..1] and it is equal to the probability

that a classiﬁer will rank a randomly chosen posi-

tive instance higher than a randomly chosen neg-

ative one.

In addition, given a pair of LOs: LO

and LO

distinguish two classes results for the classiﬁer:

• Class 1 (Cl1) Set of all pairs: LO

, LO

for which

there is the prerequisite relation LO

→ LO

;

• Class 2 (Cl2) Set of all pairs: LO

, LO

for which

there isn’t any prerequisite relation.

We carried out tests divided into courses; for each

test we show the measures described above and the

confusion matrix. This matrix is a speciﬁc table lay-

out that allows to visualize the performance of an al-

gorithm: each column of the matrix represents the

instances classiﬁed by the system for a given class,

while each row represents the instances in the ex-

pected class. The classiﬁer works on the two classes

described above, so the corresponding confusion ma-

trix will be 2 X 2; on the primary diagonal we will

ﬁnd the number of instances correctly classiﬁed: True

Positives (a prerequisite relation does exist and it is

found) and True Negatives (a prerequisite relation

does not exist and it is not found). On the secondary

diagonal the classiﬁer errors are reported: False Pos-

itives (a prerequisite relation does not exist while it

is found) and False Negatives (a prerequisite relation

does exist and it is not found). The general confusion

matrix is shown in table 2.

Table 2: Confusion Matrix.

1 2 class

True Positives False Positives 1

False Negatives True Negatives 2

With the course of Advanced Computer Science,

composed of 85 LO , The System has responded with

good values of precision and recall as is shown in Ta-

ble 3.

Table 3: Results for the course Advanced Computer Sci-

ence.

instances n. Correct Wrong

1037 877 160

Cl 1 Cl 2

Precision 0,839 0,853

Recall 0,858 0,833

F1 0,848 0,843

ROC 0,894 0,894

1 2 class

448 74 1

86 429 2

The results for the course Programming Language

(Java) reported in Tab. 4 are lower than the course

of Advanced Computer Science because the second

course has more LOs than the ﬁrst one, even if both

belong to a branch of science that is well categorized

in Wikipedia.

Table 4: Results for the course Programming Language

(Java).

instances n. Correct Wrong

289 256 33

Cl 1 Cl 2

Precision 0,633 0,915

Recall 0,463 0,956

F1 0,535 0,935

ROC 0,743 0,743

1 2 class

19 22 1

11 237 2

A Machine Learning Approach to Identify Dependencies Among Learning Objects

349

The Futurism course’s results are illustrated in Ta-

ble 5. The low number of training instances does not

allow the system to reach acceptable levels of preci-

sion and recall on Class1.

Table 5: Results for the course Futurism.

instances n. Correct Wrong

25 19 6

Cl 1 Cl 2

Precision 0,333 0,818

Recall 0,2 0,9

F1 0,25 0,857

ROC 0,53 0,53

1 2 class

1 4 1

2 18 2

The Economy course was created by taking the

LOs text directly from the Wikipedia pages in order

to see how the system works in an ideal situation. The

performance reported in the Table 6 shows good re-

sults despite the few LOs in the course.

Table 6: Results for the course Economy.

instances n. Correct Wrong

25 21 4

Cl 1 Cl 2

Precision 0,75 0,923

Recall 0,9 0,8

F1 0,818 0,857

ROC 0,87 0,87

1 2 class

9 1 1

3 18 2

In the Basic Mathematics course, results are re-

ported in Table 7. All the zero values are explained

by the fact that the system is unable to ﬁnd a sufﬁ-

cient number of relationships between pairs of LOs.

This is due to the mathematical formalism expressed

with the use of formulas in the text: Wikipedia Miner

service fails to ﬁnd Wikipedia pages associated to the

mathematical formulas and doesn’t associate topics to

concepts.

To see how the system works in a real situation,

with LO taken from courses of different types, a

dataset consisting of all instances of Advanced Com-

puter Science, Programming Language (Java) and Fu-

turism was created. We left out the mathematics

course owing to the low results obtained, as shown

above. The tests are shown with and without the ideal

course of Economy so we can see the peak present

at low numbers of LO’s components in the course.

Table 7: Results for the course Basic Mathematics.

instances n. Correct Wrong

5 4 1

Cl 1 Cl 2

Precision 0 0,8

Recall 0 1

F1 0 0,889

ROC 0,5 0,5

1 2 class

0 1 1

0 4 2

The test performed on the data set composed of 1-

2-3 courses highlights how the training stabilizes the

system levels of precision and recall around the 0.848

using a big training set.

Table 8: Results for the course course 1-2-3.

instances n. Correct Wrong

5498 4660 838

Cl 1 Cl 2

Precision 0,835 0,859

Recall 0,845 0,85

F1 0,84 0,855

ROC 0,895 0,895

1 2 class

2199 403 1

435 2461 2

Here is the trend of results sorted by number of

instances in the dataset of the course.

Figure 1: On the x-axis the courses are ordered by LO nu-

merousness, on the y-axis the value of the precision and

recall.

As expected, the results show an improvement in

system performance with the growth of the size of

the data set used as a training set. Plus, an analysis

of decision trees generated by the algorithm J48 for

each iteration shows that initially the features with a

higher content of information are the two measures

associated with the amount of nouns in the wikipedia

CSEDU 2016 - 8th International Conference on Computer Supported Education

350

page associated to the LO. Just below these measures,

in the decision tree, two other measures are close to

the root: the average length of the initial parts of

wikipedia pages related to the LO, and the number

of the internal links. Including in the graph the results

of the Economy course we have a peak in the graph at

a low level of instances as shown in Fig. 2.

Figure 2: On the x-axis the courses are ordered by LO nu-

merousness, on the y-axis the value of the precision and

recall.

5 CONCLUSIONS

The composition of web based instructional courses,

especially if personalization and adaptivity are sup-

ported (Sterbini and Temperini, 2009; De Marsico

et al., 2013) can result in a burdensome task for the

teacher, encompassing both the selection of suitable

learning objects, and the control on their sequencing.

In this paper we have presented an approach for sup-

porting the teacher in the management of the relation-

ships of dependencies between learning objects: such

relationships can be suggested/discovered automati-

cally, so as to allow the teacher to adopt or change

them. An effective automated determination of such

relationships can also be very useful in contexts of

personalized e-learning, where the learner is proposed

learning objects that are automatically sequenced.

Experimental results presented in this article have

conﬁrmed the suitability of an approach based on the

data, namely a machine learning approach that pro-

vides precious indications that strengthen our work-

ing hypothesis. Obviously, since this approach is data

driven, the provided information may be domain-

dependent.

In order to produce results as independent domain

as possible, in future we will consider resorting to dif-

ferent machine learning approaches (neural networks,

Bayesian networks, etc.) and to substantiate the va-

lidity of our work hypotheses, on a theoretical level

too.

REFERENCES

Allen, M. and Tay, E. (2012). Wikis as individual student

learning tools: The limitations of technology. Int. J.

Inf. Commun. Technol. Educ., 8(2):61–71.

Biancalana, C., Gasparetti, F., Micarelli, A., and Sansonetti,

G. (2013). Social semantic query expansion. ACM

Trans. Intell. Syst. Technol., 4(4):60:1–60:43.

Brusilovsky, P. and Vassileva, J. (2003). Course sequencing

techniques for large-scale web-based education. In-

ternational Journal of Continuing Engineering Edu-

cation and Life-long Learning, 13:75–94.

Capuano, N., Gaeta, M., Orciuoli, F., and Ritrovato, P.

(2009). On-demand construction of personalized

learning experiences using semantic web and web

2.0 techniques. In Advanced Learning Technologies,

2009. ICALT 2009. Ninth IEEE International Confer-

ence on, pages 484–488.

Cebeci, Z. and Tekdal, M. (2006). Using podcasts as au-

dio learning objects. Interdisciplinary Journal of E-

Learning and Learning Objects, 2(1):47–57.

Cole, M. (2009). Using wiki technology to support student

engagement: Lessons from the trenches. Computers

& Education, 52(1):141 – 146.

De Marsico, M., Sterbini, A., and Temperini, M. (2013).

A framework to support social-collaborative person-

alized e-learning. In Kurosu, M. (Ed.) Human-

Computer Interaction (Part II), 15th Int. Conf. on

Human-Computer Interaction, LNCS 8005, pages

351–360. Springer.

Gabrilovich, E. and Markovitch, S. (2009). Wikipedia-

based semantic interpretation for natural language

processing. J. Artif. Int. Res., 34(1):443–498.

Gasparetti, F., Limongelli, C., and Sciarrone, F. (2015a).

A content-based approach for supporting teachers

in discovering dependency relationships between in-

structional units in distance learning environments.

In Human-Computer Interaction. Theories, Methods,

and Tools - 17th International Conference, HCI Los

Angeles, CA, U.S.A., August 2-7, 2015. To appear.

Gasparetti, F., Limongelli, C., and Sciarrone, F. (2015b).

Exploiting wikipedia for discovering prerequisite re-

lationships among learning objects. In Proc. IEEE Int.

Conf. on Information Technology Based Higher Edu-

cation and Training, ITHET2015.

Gasparetti, F., Limongelli, C., and Sciarrone, F. (2015c).

Wiki course builder: a system for retrieving and se-

quencing didactic materials from wikipedia. In Proc.

IEEE Int. Conf. on Information Technology Based

Higher Education and Training, ITHET2015.

Gentili, G., Marinilli, M., Micarelli, A., and Sciarrone, F.

(2001). Text categorization in an intelligent agent for

ﬁltering information on the web. International Jour-

nal of Pattern Recognition and Artiﬁcial Intelligence,

15(3):527–549.

Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann,

P., and Witten, I. H. (2009). The weka data min-

ing software: An update. SIGKDD Explor. Newsl.,

11(1):10–18.

A Machine Learning Approach to Identify Dependencies Among Learning Objects

351

Limongelli, C., Lombardi, M., Marani, A., Sciarrone, F.,

and Temperini, M. (2016). A recommendation mod-

ule to help teachers build courses through the moodle

learning management system. New Review of Hyper-

media and Multimedia, 22(1-2).

Limongelli, C., Miola, A., Sciarrone, F., and Temperini,

M. (2012). Supporting teachers to retrieve and select

learning objects for personalized courses in the moo-

dle ls environment. In Proc. 14th IEEE Int. Conf. on

Advanced Learning Technologies, Workshop SPEL,

pages 518–520. IEEE Computer Society.

Limongelli, C., Sciarrone, F., Starace, P., and Temperini, M.

(2010). An ontology-driven olap system to help teach-

ers in the analysis of web learning object repositories.

Information Systems Management, 27(3):198–206.

Limongelli, C., Sciarrone, F., and Temperini, M. (2015).

A social network-based teacher model to support

course construction. Computers in Human Behavior,

51:1077–1085.

Milne, D. and Witten, I. H. (2008). An effective, low-

cost measure of semantic relatedness obtained from

wikipedia links. In Proceeding of AAAI Workshop

on Wikipedia and Artiﬁcial Intelligence: an Evolving

Synergy, pages 25–30. AAAI Press.

Milne, D. and Witten, I. H. (2013). An open-source toolkit

for mining wikipedia. Artif. Intell., 194:222–239.

Mitchell, T. M. (1997). Machine Learning. McGraw-Hill,

Inc., New York, NY, USA, 1 edition.

Parker, K. and Chao, J. (2007). Wiki as a teaching tool.

Interdisciplinary Journal of E-Learning and Learning

Objects, 3(1):57–72.

Resnik, P. (2011). Semantic similarity in a taxonomy:

An information-based measure and its application to

problems of ambiguity in natural language. CoRR,

abs/1105.5444.

Revilla Mu˜noz, O., Alpiste Penalba, F., and

Fern´andez S´anchez, J. (2015). The skills, com-

petences, and attitude toward information and

communications technology recommender system:

an online support program for teachers with person-

alized recommendations. New Review of Hypermedia

and Multimedia. Published online - Article in Press.

Scheines, R., Silver, E., and Goldin, I. (2014). Discovering

prerequisite relationships among knowledge compo-

nents. In Stamper, J., Pardos, Z., Mavrikis, M., and

McLaren, B., editors, Proceedings of the 7th Interna-

tional Conference on Educational Data Mining, pages

355–356. European Language Resources Association

(ELRA).

Sciarrone, F. (2013). An extension of the q diversity met-

ric for information processing in multiple classiﬁer

systems: A ﬁeld evaluation. International Journal

of Wavelets, Multiresolution and Information Process-

ing, 11(6).

Sterbini, A. and Temperini, M. (2009). Collaborative

projects and self evaluation within a social reputation-

based exercise-sharing system. In Workshop SPEL,

for the IEEE/WIC/ACM Int. Joint Conferences on Web

Intelligence and Intelligent Agent Technologies, WI-

IAT’09, Vol. 3, pages 243–246.

Strube, M. and Ponzetto, S. P. (2006). Wikirelate! com-

puting semantic relatedness using wikipedia. In Pro-

ceedings of the 21st National Conference on Artiﬁcial

Intelligence - Volume 2, AAAI’06, pages 1419–1424.

AAAI Press.

Stuurman, S., van Eekelen, M., and Heeren, B. (2012).

A new method for sustainable development of open

educational resources. In Proceedings of Second

Computer Science Education Research Conference,

CSERC ’12, pages 57–66, New York, NY, USA.

ACM.

Sun, Z. and Qiu, X. (2014). Evaluating the use of wikis for

eﬂ: A case study of an undergraduate english writing

course in china. Int. J. Inf. Technol. Manage., 13(1):3–

14.

Vuong, A., Nixon, T., and Towle, B. (2011). A method for

ﬁnding prerequisites within a curriculum. In Pech-

enizkiy, M., Calders, T., Conati, C., Ventura, S.,

Romero, C., and J. Stamper, J., editors, The 4th In-

ternational Conference on Educational Data Mining

(EDM 2011), pages 211–216.

CSEDU 2016 - 8th International Conference on Computer Supported Education

352