a PLR model based on clinical knowledge regarding
the relationship between IOP and the rate of glauco-
matous progression (AGIS Investigators, 2010; Col-
laborative Normal-Tension Glaucoma Study Group,
1998; Satilmis, M. et al., 2003). Our method can
also deal with the external heterogeneity and medical-
data-structure-difficulty by incorporating a collective
method. Therefore, our method is a novel extension
of previous collective methods from both theoretical
and practical aspects, which increases prediction ac-
curacy. Similarly, other methods (Maya, S. et al.,
2014; Murata, H. et al., 2014) are expected to be im-
proved by incorporation of our method.
Medical datasets are commonly plagued by high
levels of heterogeneity, and we have here proposed
a new method that shows good performance in over-
coming this heterogeneity in a glaucoma dataset for
effective predictions of disease progression. We be-
lieve that our method can be extended to tackle simi-
lar difficulties in other medical datasets and we have
provided standardized directions for such analyses.
ACKNOWLEDGEMENTS
We thank Mr. Fujino and Ms. Taketani at the De-
partment of Ophthalmology,The University of Tokyo,
for their useful advice. This work was supported by
CREST, JST.
REFERENCES
AGIS Investigators (2010). The advanced glaucoma in-
tervention study (AGIS): 7. the relationship between
control of intraocular pressure and visual field de-
terioration. American Journal of Ophthalmology,
130:429–440.
Akaike, H. (1973). Information theory and an extension
of the maximum likelihood principle. In Proceedings
of the 2nd International Symposium on Information
Theory, pages 267–281.
Collaborative Normal-Tension Glaucoma Study Group
(1998). Comparison of glaucomatous progression be-
tween untreated patients with normal-tension glau-
coma and patients with therapeutically reduced in-
traocular pressures. American Journal of Opthalmol-
ogy, 126(4):487–497.
Fujino Y., Murata H., Mayama C., and Asaoka R. (2015).
Applying “lasso” regression to predict future visual
field progression in glaucoma patients. Investigative
Ophthalmology & Visual Science, 56(4):2334–2339.
Holmin, C. and Krakau, C. E. T. (1982). Regression anal-
ysis of the central visual field in chronic glaucoma
cases. Acta Ophthalmologica, 60(2):267–274.
Kingman, S. (2004). Glaucoma is second leading cause of
blindness globally. Bulletin of the World Health Or-
ganization, 82(11):887–888.
Liang, Z., Tomioka, R., Murata, H., Asaoka, R., and Ya-
manishi, K. (2013). Quantitative prediction of glau-
comatous visual field loss from few measurements.
In Proceedings of the 2013 IEEE 13th International
Conference on Data Mining 2013, pages 1121–1126.
Maya, S., Morino, K., and Yamanishi, K. (2014). Predicting
glaucoma progression using multi-task learning with
heterogeneous features. In Proceedings of the 2014
IEEE International Conference on Big Data, pages
261–270.
Maya, S., Morino, K., and Yamanishi, K. (2015). Discov-
ery of glaucoma progressive patterns using hierarchi-
cal MDL-based clustering. In Proceedings of the 21st
ACM SIGKDD Conference on Knowledge Discovery
and Data Mining, pages 1979–1988.
Mehta, M., Rissanen, J., and Agrawal, R. (1995). MDL-
based decision tree pruning. In Proceedings of the
1st ACM SIGKDD Conference on Data Mining, pages
216–221.
Morino, K., Hirata, Y., Tomioka, R., Kashima, H., Ya-
manishi, K., Hayashi, N., Egawa, S., and Aihara,
K. (2015). Predicting disease progression from short
biomarker series using expert advice algorithm. Sci-
entific Reports, 5:8953.
Murata, H., Araie, M., and Asaoka, R. (2014). A new ap-
proach to measure visual field progression in glau-
coma patients using variational bayes linear regres-
sion. Investigative Ophthalmology & Visual Science,
55:8386–8392.
Natarajan, R. and Pednault, E. (2002). Segmented regres-
sion estimators for massive data sets. In Proceedings
of the 2nd SIAM International Conference on Data
Mining, pages 566–582.
Quigley, H. A. and Broman, A. T. (2006). The number of
people with glaucoma worldwide in 2010 and 2020.
British Journal of Ophthalmology, 90(3):262–267.
Rissanen, J. (1986). Stochastic complexity and modeling.
Annals of Statistics, 14(3):1080–1100.
Robnik-
ˇ
Sikonja, M. and Kononenko, I. (1998). Pruning re-
gression trees with MDL. In Proceedings of the 13th
European Conference on Artificial Intelligence, pages
455–459.
Satilmis, M., Org¨ul, S., Doubler, B., and Flammer, J.
(2003). Rate of progression of glaucoma correlates
with retrobulbar citation and intraocular pressure.
American Journal of Ophthalmology, 135(5):664–
669.
Schwarz, G. (1978). Estimating the dimension of a model.
Annals of Statistics, 6(2):461–464.
Taketani Y., Murata H., Fujino Y., Mayama C., and Asaoka
R. (2015). How many visual fields are required to pre-
cisely predict future test results in glaucoma patients
when using different trend analyses? Investigative
Ophthalmology & Visual Science, 56(6):4076–4082.
Vogel, D. S., Asparouhov, O., and Scheffer, T. (2007). Scal-
able look-ahead linear regression trees. In Proceed-
ings of the 13th ACM SIGKDD Conference on Knowl-
edge Discovery and Data Mining, pages 757–764.
Zhu, H., Russell, R. A., Saunders, L. J., Ceccon, S.,
Garway-Health, D. F., and Crabb, D. P. (2014). De-
tecting changes in retinal function analysis with non-
stationary weibull error regression and spatial en-
hancement (ANSWERS). PLOS ONE, 9(1):e85654.