proficiency. In particular, the learning journals also
contributes to the prediction accuracy of the
students’ TEM-4 achievement level.
Based on the research results, English teachers
might need to pay more attention to those students
who come from ordinary high schools and from
underdeveloped provinces. Due to their weak
English foundation, they might face more pressure
in language learning and are in need of more help,
more encouragement and more flexible learning
methods fit for them. In addition keeping regular
learning journals is highly recommended, as it can
help students to develop the habit of self-reflection
and help them regulate their English learning. It can
also help teacher to keep track on students’ learning
and emotional fluctuations at students’ report in
interviews.
5 CONCLUSIONS
In the present study, substantial data from English
majors were collected to develop classification
models to predict students’ TEM-4 performance.
Our methodology differs from priori ones in two
aspects. Firstly, previous researches are focused on
the relationship between students’ characteristics
and their TEM 4 scores rather than the accuracy of
prediction. Our study employed Naive Bayes to
predict whether students will pass the TEM-4 and
whether they will obtain excellent scores. The
accuracy of the established model is up to 98%.
Secondly, a richer set of data was collected in the
current study, including students’ demographics and
family socioeconomic status, learning related
achievement, motivation and learning journals.
What’s more, we integrate the data. A contributing
finding of this study is that students’ written learning
journals have been verified in the improvement of
the accuracy of the prediction models. Our tentative
suggestions for the English teaching are to try to
understand the students’ learning background and
emotional fluctuations, and teaching in accordance
with individual’s aptitude. The English majors and
the like are advised to keep English learning diaries,
which help know one’s own strengths and
weaknesses, and have a positive attitude towards
English learning and life.
Due to the imbalanced data in this study between
those failed and those reached excellent in TEM-4,
the findings of this study needs further validation by
a much larger sampling, and should be generalized
or used with caution.
ACKNOWLEDGEMENTS
We would very much like to thank Prof. Xiangdong
Gu’s and Prof. Qing Zhou’s team and the external
reviewers for their insightful feedback. The current
study is supported by “The Short-term International
Academic fund” of Chongqing University,
Fundamental Research Funds for the Central
Universities (Grant No. 106112015CDJSK04JD02)
in Chongqing University, National Natural Science
Foundation Project of CQ CSTC (Grant No.
cstc2016jcyjA0276), Postgraduate Education and
Teaching Reform Research Project in Chongqing
Province (Grant No. yjg153023), Degree and
Postgraduate Education Research (Grant No. C-
2015Y0415-128).
REFERENCES
Chuanyi Li. (2012). The criterion-related validation of
TEM4 based on test-takers' self-assessment, 2012
IEEE Fifth International Conference on Advanced
Computational Intelligence (ICACI), Pages: 856 – 858.
Elliot, A. J., and McGregor, H. A. (2001). A 2× 2
achievement goal framework. Journal of personality
and social psychology, 80(3), 501.
Elliot, A. J., and Murayama, K. (2008). On the
measurement of achievement goals: Critique,
illustration, and application. Journal of Educational
Psychology, 100(3), 613.
Hall, M. A. (1999). Correlation-based feature selection for
machine learning. The University of Waikato.
Li, L. (2013). Foreign Language Aptitude Components
and Different Levels of Foreign Language Proficiency
Among Chinese English Majors. In Pacific Rim
Objective Measurement Symposium (PROMS) 2012
Conference Proceeding, pp. 179-196. Springer Berlin
Heidelberg.
Rish, I. (2001). An empirical study of the naive Bayes
classifier. In IJCAI 2001 workshop on empirical
methods in artificial intelligence. Vol. 3, No. 22, pp.
41-46. IBM New York.
Sun, J. Y., (2014). “Jieba Chinese text segmentation.”
https://github.com/fxsjy/jieba, accessed July 21, 2014.
Wen, Q. F., and Wang, L. (2009). Validation of TEM 4-
Oral. Journal of PLA University of Foreign Languages,
5, 009.
Xi, C. (2012). A Linear Regression Analysis on TEM4.
Journal of Civil Aviation Flight University of China, 6,
020.
Zhou Q, Mou C, Yang D. (2015) Research Progress on
Educational Data Mining: A Survey. Ruan Jian Xue
Bao/ Journal of Software, 26(11): 3026-3042(in
Chinese).