and make it, and the datasets that we use, publicly
available. Our MOR framework is much more com-
putationally efficient compared to a binary classifica-
tion approach. Note that although in this study we
are primarily concerned with the automated reason-
ing domain, our method is general enough to be ap-
plicable to ranking tasks in bioinformatics, natural
language processing, information retrial, etc. Third,
we compare our framework with existing premise se-
lection algorithms on three different datasets. The
experiments show that our method significantly out-
performs the existing algorithms, in particular on the
harder problems.
In the future, we will first extract and then uti-
lize feature representations of the premises in order
to improve the ranking performance of the proposed
algorithm. So far, we were only concerned with rel-
atively small problems. Our biggest dataset had only
1020 distinct premises. Eventually, we would like to
use our algorithm efficiently over datasets with tens
or even hundreds of thousands of premises. Our fi-
nal goal is to incorporate the developed algorithm into
open source ATP systems which hopefully leads to
notable benefits both in terms of accuracy and effi-
ciency.
ACKNOWLEDGEMENTS
We acknowledge support from the Netherlands Orga-
nization for Scientific Research, in particular Learn-
ing2Reason and a Vici grant (639.023.604).
REFERENCES
Agarwal, S. (2005). A study of the bipartite ranking prob-
lem in machine learning. PhD thesis, University of
Illinois at Urbana-Champaign, Champaign, IL, USA.
AAI3182211.
Bancerek, G., Endou, N., and Sakai, Y. (2001). On the char-
acterizations of compactness. Journal of Formalized
Mathematics, 13(4):733–738.
Carlson, A., Cumby, C., Rizzolo, N., and
Rosen, J. (1999). SNoW user manual.
http://scholar.google.com/scholar?hl=en&btnG=Searc
h&q=intitle:SNoW+User+Manual#1.
Cortez, C. and Mori, M. (2004). AUC optimization vs. error
rate minimization. Advances in Neural Information
Processing Systems, 16:313.
Deerwester, S., Dumais, S. T., Furnas, G. W., Landauer,
T. K., and Harshman, R. (1990). Indexing by latent
semantic analysis. Journal of the American Society
for Information Science, 41(6):391–407.
Dem
ˇ
sar, J. (2006). Statistical comparisons of classifiers
over multiple data sets. The Journal of Machine
Learning Research, 7:1–30.
D’Silva, V., Kroening, D., and Weissenbacher, G. (2008).
A survey of automated techniques for formal soft-
ware verification. Computer-Aided Design of Inte-
grated Circuits and Systems, IEEE Transactions on,
27(7):1165–1178.
Fuernkranz, J. and Huellermeier, E. (2011). Preference
Learning. Springer Berlin Heidelberg, Berlin, Hei-
delberg.
Gonthier, G. (2008). Formal Proof – The Four-Color The-
orem. Notices of the American Mathematical Society,
55(11):1382–1393.
Hales, T. (2005). A proof of the Kepler conjecture. Annals
of Mathematics, 162(3):1065–1185.
Hales, T. (2008). A Special Issue on Formal Proof. Notices
of the American Mathematical Society, 55(11).
Hurd, J. (2003). First-order proof tactics in higher-order
logic theorem provers. Technical report, Design
and Application of Strategies/Tactics in Higher Order
Logics, number NASA/CP-2003-212448 in NASA
Technical Reports.
Klein, G., Elphinstone, K., Heiser, G., Andronick, J., Cock,
D., Derrin, P., Elkaduwe, D., Engelhardt, K., Kolan-
ski, R., Norrish, M., Sewell, T., Tuch, H., and Win-
wood, S. (2009). seL4: Formal Verification of an
OS Kernel. In Anderson, T., editor, Proceedings of
the CICM Workshop on Empirically Successful Au-
tomated Reasoning in Mathematics, pages 207–220,
Big Sky, USA. ACM Press.
McCune, W. (1997). Solution of the Robbins Problem.
Journal of Automated Reasoning, 19(3):263–276.
Meng, J. and Paulson, L. C. (2008). Translating Higher-
order Problems to First-order Clauses. Journal of Au-
tomated Reasoning, 40(1):35–60.
Phillips, J. and Stanovsky, D. (2008). Automated Theo-
rem Proving in Loop Theory. In Sutcliffe, G., Colton,
S., and Schulz, S., editors, Proceedings of the CICM
Workshop on Empirically Successful Automated Rea-
soning in Mathematics, pages 42–54, Birmingham,
United Kingdom.
Poggio, T. and Smale, S. (2003). The Mathematics of
Learning : Dealing with Data. Notices of the Amer-
ican Mathematical Society, 50(5):537–544.
Rifkin, R., Yeo, G., Poggio, T., Rifkin, R., Yeo, G., and
Poggio, T. (2003). Regularized Least-Squares Classi-
fication, chapter 131-154. IOS Press.
Roederer, A., Puzis, Y., and Sutcliffe, G. (2009). Divvy:
An ATP Meta-system Based on Axiom Relevance Or-
dering. Automated Deduction - CADE-22, pages 157–
162.
Schoelkopf, B., Herbrich, R., Williamson, R., and Smola,
A. J. (2001). A Generalized Representer Theorem. In
Helmbold, D. and Williamson, R., editors, Proceed-
ings of the 14th Annual Conference on Computational
Learning Theory, pages 416–426, Berlin, Germany.
Shawe-Taylor, J. and Cristianini, N. (2004). Kernel Meth-
ods for Pattern Analysis, volume 47. Cambridge Uni-
versity Press.
KDIR 2011 - International Conference on Knowledge Discovery and Information Retrieval
50