The difference in learning characteristics is
influenced by individual characteristics as well. In
addition to behaviour, personal characteristics play a
major role in the learning characteristics. Personal
characteristics show special behaviour in each
individual. Various studies have revealed the
importance of understanding student characteristics
on the effectiveness of the learning process
(Kauffman, 2015; VanSickle et al., 2015; Apple,
Duncan and Ellis, 2016). However, there is still not
much literature that explore student characteristics by
applying data mining technique. According to (Sin
and Muthu, 2015), data mining techniques can be
used to improve academic quality, including
predicting student performance in learning, data
visualization, detecting student failures in learning
and even investigating student behaviour in learning.
Therefore, data mining techniques are also potential
to be used to segment student characteristics in order
to understand their learning behaviour. This study
aims to identify the learning characteristics pattern of
engineering students using data mining clustering
technique. The cluster resulted from this research can
be used to figure out the existing differences among
cluster and provide faculty members with some
insight of their student characteristics.
2 METHOD
This study uses an online questionnaire to collect
data. The online questionnaire is administered via
Universitas Negeri Malang Academic Information
System (SIAKAD) in April - May 2018. The target
respondents are all registered students in Faculty
Engineering at Universitas Negeri Malang, which are
approximately 5,300 students. Among of those
registered students, only 2,934 students fill out the
questionnaire (55.34% participation rates). The data
mining clustering model is built following the
SEMMA procedure, which are Sample, Explore,
Modify, Model, and Assess.
The first step, sample is conducted by determining
the target object of the study, which are all registered
students in Universitas Negeri Malang. The explore
step aims to understand the nature of collected data,
which is performed by plotting the collected data. The
modify step includes data preparation and data
transformation when needed. Data preparation steps
include data cleaning and data imputation. Data
cleaning aims to delete uncompleted responses and
outlier responses. Data imputation is performed to
impute missing responses with the mode or mean
responses. After data preparation steps, only 1,914
responses (65.23% usable rate) are complete and can
be used for further analysis. To identify the learning
characteristics pattern, this study uses three clustering
data mining technique. The clustering models built in
this study are K-means cluster, Kohonen cluster
analysis, and Two step cluster analysis. The last step
is to determine how to assess the model performance
(accuracy). Regarding the model accuracy, this study
use Silhouette index as suggested by (Pereda and
Estrada, 2018). Silhouette measures distance of an
element to its own cluster (cohesion) and compares it
to other clusters (separation). The higher value
indicates that the element is well matched to its own
cluster and poorly matched to other defined clusters.
The questionnaire that is used to collect data
contains 4 questions about respondent’s profile
(gender, level of study period, study program, and
GPA). In addition, the questionnaire contains 18
closed-ended question that ask about the learning
characteristic of respondent. Briefly, the item list of
the questions in the questionnaire is shown in Table
1.
Table 1: Item list in the questionnaire.
Characteristics of
discussion activity
Understanding during
learning process
Characteristics of
independent study
Table 2: Descriptive statistics of the respondent’s profile.
Identification of Learning Characteristics Pattern of Engineering Students using Clustering Techniques
275