not take into account such ways of guessing.
Consequently, we are facing the challenge to find a
scoring method, that is able to recognize and properly
penalize guessing. Previously proposed algorithms
suffer from imbalance and skewness as we show in 2.
The task to find the scoring method can be divided
into two steps: to find a method to determine points for
the correctly marked options(1) and to find a method
to determine the penalty for the incorrectly marked
options(2). For the first part a reasonable approach
was proposed by Ripkey (Ripkey and Case, 1996).
Thus our research aims to provide a method for the
second part (determining penalties). We propose a
general approach and a mathematical model, that takes
into account the most common ways of guessing and
behaves balanced at the same time.
Our concept is based on the assumption, that scor-
ing can be based on the guessing level of the question.
By guessing level we mean here (in partial scoring)
the probability to obtain more than zero points. Each
question is associated with a difficulty to guess a (par-
tially) correct answer. To accommodate the difficulty
level of guessing in the scoring method, we propose to
determine the penalty only when a student marks more
options, than the actual number of correct ones. We
argue that our approach can be added as an option, or
even as a replacement of manual designation of penal-
ties. We claim that our algorithm behaves better, than
existing ones and prove that with both synthetic and
real experiments.
The paper is structured as follows: first, we discuss
existing algorithms for scoring MMQs, then we de-
scribe our approach on conceptual and mathematical
levels and finally we show and discuss the results of
synthetic and real-life experiments.
2 RELATED WORK
There are several existing platforms, that use multiple-
mark type of questions as well as several approaches to
score them. We collected such approaches to describe,
discuss and compare them. Existing approaches for
scoring the multiple-mark questions implement four
base concepts. In the section we describe the basic
ideas, advantages and disadvantages of these concepts.
2.1 Dichotomous Scoring
This method is often used in paper-based quizzes,
where the good quality of quizzes allows teacher to
be more strict when score the results. As the aim of
e-based quizzes is not only to score the results, but to
catch the gaps of knowledge, the scoring of partially
correct responses shows the actual knowledge of the
student better. Also, dichotomous scoring does not
show the accurate progress of the student. However,
when dealing with multiple-mark questions dichoto-
mous scoring almost excludes the possibility of guess-
ing, that is why we use it as a standard of reference
when evaluating our approach with real users.
2.2 Morgan Algorithm
One of the historically first methods for scoring the
MMQs was described in the 1979 by Morgan (Mor-
gan, 1979). For our experiments we use the improved
algorithm, in accordance to which the scores are deter-
mined by the following algorithm:
1.
for each option chosen which the setter also consid-
ers correct, the student scores
+(p
max
/n)
, where
n
is a number of correct options
2.
for each option chosen which the setter consid-
ers to be incorrect, the student scores
−(p
max
/k)
,
where k is a number of distractors.
3.
for each option not chosen no score, positive or
negative, is recorded regardless of whether the set-
ter considers the response to be correct or incorrect.
However, the experiments show a large dependence
between number of options (correct and incorrect) and
amount of penalty, that indicates the skewness of the
method (see 4.1).
2.3 MTF Scoring
Multiple-mark questions can be scored with the ap-
proaches developed for multiple true-false items.
Tsai (Tsai and Suen, 1993) evaluated six different im-
plementations of the approach. Later his findings were
confirmed by Itten (Itten and Krebs, 1997). Although
both researches found partial crediting to be superior
to dichotomous scoring in a case of MTFs, they do not
consider any of the algorithms to be preferable. This
fact allows us to use the most base of them for our
experiments.
MTF scoring algorithms imply that any item has
n
options and a fully correct response is awarded with
full amount of points
p
max
. If the user did not mark a
correct option or marked a distractor, she is deducted
with the penalty
s = p
max
/n
points. Thus a student
receives points for not-choosing a distractor as well as
for choosing a correct option. This point does not fit
perfect to multiple-mark questions because of the dif-
ferences between two types (Pomplun and Omar, 1997;
Cronbach, 1941; Frisbie, 1992). Our experiments (see
4.1) confirm the studies and show the skewness of the
concept when deal with MMQs.
CSEDU2013-5thInternationalConferenceonComputerSupportedEducation
412