ular, starting from a baseline with direct links, we
tried learning models that included rules for the di-
rect links and rules for the co-citation links. The non-
MLN algorithms used the extra information to im-
prove (e.g., from 52.7% to 69.5% for ICA
C
), but the
best MLN algorithm (MCMC) actually lost accuracy
(from 56.6% to 45.0%). Thus, given data with com-
plex links, MLNs may sometimes outperform other
techniques, but at other times may struggle with learn-
ing based on a complex rule set.
These observations suggest that MLN learning re-
mains a challenging problem, at least for CC and
likely for other tasks as well. Future work should fur-
ther consider the impact of training set size on MLN
learning, explore the effects of MLN structure learn-
ing, and evaluate other recent weight learning algo-
rithms (e.g., (Huynh and Mooney, 2009)) for MLNs.
ACKNOWLEDGEMENTS
Thanks to David Aha, Bryan Auslander, Ryan Rossi,
and the anonymous referees for comments that helped
to improve this work. Portions of this analysis were
conducted using Proximity, an open-source software
environment developed by the Knowledge Discov-
ery Laboratory at the University of Massachusetts
Amherst (http://kdl.cs.umass.edu/proximity/). This
work was supported in part by the U.S. National Sci-
ence Foundation under award number 1116439.
REFERENCES
Chakrabarti, S., Dom, B., and Indyk, P. (1998). Enhanced
hypertext categorization using hyperlinks. In Proc. of
the ACM SIGMOD International Conference on Man-
agement of Data.
Chechetka, A., Dash, D., and Philipose, M. (2010). Rela-
tional learning for collective classification of entities
in images. In AAAI-10 Workshop on Statistical Relat.
AI.
Crane, R. and McDowell, L. K. (2011). Evaluating markov
logic networks for collective classification. In Proc.
of the 9th MLG Workshop at the 17th ACM SIGKDD
Conference on Knowledge Discovery and Data Min-
ing.
Dhurandhar, A. and Dobra, A. (2010). Collective vs inde-
pendent classification in statistical relational learning.
In Submitted for publication.
Gallagher, B., Tong, H., Eliassi-Rad, T., and Faloutsos,
C. (2008). Using ghost edges for classification in
sparsely labeled networks. In Proceeding of the 14th
ACM SIGKDD International Conference on Knowl-
edge Discovery and Data Mining.
Huynh, T. and Mooney, R. (2009). Max-margin weight
learning for markov logic networks. In Proc. of the
European Conf. on Machine Learning and Knowledge
Discovery in Databases, ECML PKDD.
Jensen, D., Neville, J., and Gallagher, B. (2004). Why col-
lective inference improves relational classification. In
Proc. of the 10th ACM SIGKDD International Confer-
ence on Knowledge Discovery and Data Mining.
Kok, S. and Domingos, P. (2010). Learning markov logic
networks using structural motifs. In Proc. of the 27th
International Conference on Machine Learning.
Kok, S., Sumner, M., Richardson, M., Singla, P., Poon,
H., and Domingos, P. (2006). The Alchemy system
for statistical relational AI (Technical Report). Depart-
ment of Computer Science and Engineering, Univer-
sity of Washington, Seattle, WA.
Lin, F. and Cohen, W. W. (2010). Semi-supervised classifi-
cation of network data using very few labels. In Proc.
of the Int. Conference on Advances in Social Network
Analysis and Mining (ASONAM).
Lowd, D. and Domingos, P. (2007). Efficient weight
learning for markov logic networks. In In Proc. of
the Eleventh European Conference on Principles and
Practice of Knowledge Discovery in Databases.
Macskassy, S. and Provost, F. (2007). Classification in net-
worked data: A toolkit and a univariate case study.
Journal of Machine Learning Research, 8:935–983.
McDowell, L., Gupta, K., and Aha, D. (2009). Cautious col-
lective classification. The Journal of Machine Learn-
ing Research, 10:2777–2836.
McDowell, L. K., Gupta, K. M., and Aha, D. W. (2007).
Cautious inference in collective classification. In
Proc. of the 22nd AAAI Conference on Artificial In-
telligence.
Mihalkova, L. and Mooney, R. (2007). Bottom-up learning
of Markov logic network structure. In Proc. of the
24th International Conference on Machine Learning.
Mihalkova, L., Moustafa, W., and Getoor, L. (2011). Learn-
ing to predict web collaborations. In Workshop on
User Modeling for Web Applications (UMWA-11).
Neville, J. and Jensen, D. (2000). Iterative classification in
relational data. In AAAI Workshop on Learning Statis-
tical Models from Relational Data.
Neville, J. and Jensen, D. (2007). Relational dependency
networks. Journal of Machine Learning Research,
8:653–692.
Poon, H. and Domingos, P. (2006). Sound and efficient
inference with probabilistic and deterministic depen-
dencies. In Proc. of the 21st AAAI Conference on Ar-
tificial Intelligence.
Richardson, M. and Domingos, P. (2006). Markov logic
networks. Machine Learning, 62(1-2):107–136.
Riedel, S. and Meza-Ruiz, I. (2008). Collective seman-
tic role labelling with markov logic. In Proc. of the
Twelfth Conference on Computational Natural Lan-
guage Learning, CoNLL ’08.
Sen, P., Namata, G., Bilgic, M., Getoor, L., Gallagher, B.,
and Eliassi-Rad, T. (2008). Collective classification in
network data. AI Magazine, Special Issue on AI and
Networks, 29(3):93–106.
Singla, P. and Domingos, P. (2005). Discriminative training
of Markov logic networks. In Proc. of the National
Conference on Artificial Intelligence.
INVESTIGATING MARKOV LOGIC NETWORKS FOR COLLECTIVE CLASSIFICATION
15