Hinton, G., Osindero, S., and Teh, Y.-W. (2006). A fast
learning algorithm for deep belief nets. Neural com-
putation, 18(7):1527–1554.
IDG Enterprise (2014). Big data survey.
Jolliffe, I. (2005). Principal component analysis. Wiley
Online Library.
Kroeger, T. and Long, D. D. E. (2001). Design and imple-
mentation of a predictive file prefetching algorithm.
In USENIX Annual Technical Conference, pages 105–
118.
Leonardi, P. M., Huysman, M., and Steinfield, C. (2013).
Enterprise social media: Definition, history, and
prospects for the study of social technologies in or-
ganizations. In Journal of Computer-Mediated Com-
munication.
Linden, G., Smith, B., and York, J. (2003). Amazon. com
recommendations: Item-to-item collaborative filter-
ing. Internet Computing, 7(1):76–80.
Nagori, R. and Aghila, G. (2011). LDA based integrated
document recommendation model for e-learning sys-
tems. In International Conference on Emerging
Trends in Networks and Computer Communications
(ETNCC).
Ngiam, J., Chen, Z., Bhaskar, S. A., Koh, P. W., and Ng,
A. Y. (2011). Sparse filtering. In Advances in Neural
Information Processing Systems, pages 1125–1133.
Office365 (2015). Microsoft office 365. http://en.
wikipedia.org/wiki/Office
365.
Ovsjanikov, M. and Chen, Y. (2010). Topic modeling for
personalized recommendation of volatile items. In
The European Conference on Machine Learning and
Principles and Practice of Knowledge Discovery in
Databases.
Paris, J.-F., Amer, A., and Long, D. D. E. (2003). A stochas-
tic approach to file access prediction. In International
Workshop on Storage Network Architecture and Par-
allel I/Os (SNAPI).
Rendle, S. (2010). Factorization machines. In IEEE Inter-
national Conference on Data Mining (ICDM).
Salakhutdinov, R., Mnih, A., and Hinton, G. (2007). Re-
stricted boltzmann machines for collaborative filter-
ing. In ACM International Conference on Machine
Learning.
Salesforce (2015). Salesforce.com. http://
www.salesforce.com/.
scikit-learn (2015). scikit-learn Machine Learning in
Python. http://scikit-learn.org/.
Song, Q., Kawabata, T., Ito, F., Watanabe, Y., and Yokota,
H. (2014). File and task abstraction in task workflow
patterns for file recommendation using file-access log.
In IEICE Transactions on Information and Systems.
Van der Maaten, L. J., Postma, E. O., and van den Herik,
H. J. (2009). Dimensionality reduction: A compara-
tive review. Journal of Machine Learning Research,
10(1-41):66–71.
Wang, C., Viswanathan, K., Choudur, L., Talwar, V., Sat-
terfield, W., and Schwan, K. (2011). Statistical tech-
niques for online anomaly detection in data centers.
In IFIP/IEEE International Symposium on Integrated
Network Management, pages 385–392.
Whittle, G. A. S., Paris, J.-F., Amer, A., Long, D. D. E.,
and Burns, R. (2003). Using multiple predictors to
improve the accuracy of file access predictions. In
International Conference on Massive Storage Systems
and Technology (MSST), pages 230–240.
Xia, P., Feng, D., Jiang, H., Tian, L., Xia, P., Feng, D.,
Jiang, H., Tian, L., and Wang, F. (2008). Farmer: A
novel approach to file access correlation mining and
evaluation reference model for optimizing peta-scale
file systems performance. In The International ACM
Symposium on High-Performance Parallel and Dis-
tributed Computing (HPDC).
Yeh, T., Long, D. D. E., and Brandt, S. A. (2001a). Per-
forming file prediction with a program-based succes-
sor model. In Modeling, Analysis and Simulation
of Computer and Telecommunication Systems (MAS-
COTS).
Yeh, T., Long, D. D. E., and Brandt, S. A. (2001b). Using
program and user information to improve file predic-
tion performance. In International Symposium on Per-
formance Analysis of Systems and Software (ISPASS).
Yeh, T., Long, D. D. E., and Brandt, S. A. (2002). Increas-
ing predictive accuracy by prefetching multiple pro-
gram and user specific files. In Annual International
Symposium on High Performance Computing Systems
and Application (HPCS).
AccessPredictionforKnowledgeWorkersinEnterpriseDataRepositories
161