the ISO/IEC 25000 (SQuaRE) standard (IEC, 2005),
whose structure formalizes the interpretation of mea-
sures and the relationship among them. It allows the
representation of several attributes and the definition
of how the measures should be aggregated, as well as
what procedures have to be used to homogenize their
values. It is possible to define one quality model for
each attribute, and then, these different perspectives
can be aggregated following a hierarchical structure.
Furthermore, it allows the configuration of thresholds,
weights and operators. The next subsections present
two Quality Model representing e-commerce system
components, such as the component for Interface and
for the whole system.
3.1 Interface Trustworthiness Quality
Model
Like any part of a software product, measuring inter-
face quality is important because it helps to under-
stand deficiencies and guides improvements in this
field. The work of Casare et al. (Casare et al., 2021)
presented 25 interface metrics formalized, as follows:
4 sub attributes composing Learnability (Easy of
Learning, Navigation, Coherent Buttons and Coher-
ent Menus); 4 composing Efficiency (General Flex-
ibility, Environment Flexibility, Responsive, Perfor-
mance); 4 composing Perceivability (Simple Screens,
Colors and Fonts, Perception of System Status, Per-
formance); 4 composing Operability (Back Button,
Perceivable Focus, Broken Links, Affordable); 2
composing Safety in Use (Failure Handling, Rate of
Failures); 5 composing User Experience (Company
Information, Company Reputation, Privacy Policies,
Customer Opinion and Padlock). Furthermore, Sat-
isfaction and Usefulness are not composed of other
metrics. More details about these metrics can be
found in the work of Casare et al. (Casare et al.,
2021).
Based on this interface-based metrics formaliza-
tion, we designed an Interface Quality Model so that
an interface trustworthiness score can be calculated
based on the identified attributes. To preserve the
readability, Figure 1 shows only the main composite
attributes of the interface QM. It presents three levels
of this QM and partially presents the fourth and fifth
levels (in fact, we detailed only the Efficiency sub at-
tributes in the fourth level and Performance PageUp
in the fifth level). It is important to note that there is
a common sub attribute between Efficiency and Per-
ceivability (i.e., Performance), which means that this
measurement is used to calculate the score of both
composite attributes.
Due to the strong subjectivity of the interface at-
tributes, the scores for the majority of the attributes
are obtained through questionnaires, which are sup-
posed to be answered after a test with the users (e.g.,
the users interact with the website to perform some
usual functionalities and after answering the question-
naires). Only 4 attributes (Broken Links, Affordable,
Performance Page Up and Responsive) are less sub-
jective and can be measured through automatic tools.
Leaf attributes represent metric definitions with as-
sociated scores based on the measures collected by
the system monitoring process. They can be nor-
malized (using the limit values NormalMin and Nor-
malMax) ensuring that operators aggregate values at
same scales and they are compared against thresholds
(ThresholdMin and ThresholdMax) assuring that only
relevant and valid values are considered. The values
for each attribute i are influenced by an adjustable
weight (Wi), which specifies a preference over one or
more attributes of the system, according to predefined
requirements. For example, in the context of Figure
1, Usability (W
1
=35%) and Accessibility (W
2
=35%)
have the same importance to compose the Interface
Trustworthiness score, while User Experience has a
bit less importance (W
3
=30%) in this composition.
The final score is computed using the aggrega-
tion of the attribute values, starting from the leaf-
level attributes towards the root one, using the Opera-
tors (OPn), which describe the relation between them.
Different types of operators may be used to define
the conditions under which composite attributes are
aggregated, such as neutrality (combination of simul-
taneous satisfaction requirements with replaceability
capability); simultaneity (all requirements must be
satisfied); replaceability (used when one of the re-
quirements has a higher priority replacing the remain-
ing requirements). In Figure 1, Environment Flexibil-
ity is a Efficiency sub attribute, which, in turn, com-
poses Usability attribute. This is a subjective sub at-
tribute and it must be obtained by applying a ques-
tionnaire that must be answered by users. The ques-
tionnaire uses a 7-point Likert scale, with questions
that helps to evaluate if the e-commerce website un-
der test is flexible to be used in different browsers
and devices. The measurement score is obtained by
the weighted average of each question answered by
all users. With the answers of the questionnaires, the
weighted average is obtained considering the Likert
scale (1 to 7) and the total of responses for each of
these points (n1 – total responses as “Strongly Dis-
agree” to n7 – total responses as “Strongly Agree”),
after counting the answers of all participants. Then,
the standard deviation must be evaluated allowing bet-
ter analysis of the perception score.
Using a Quality Model to Evaluate User Interface Trustworthiness of e-Commerce Systems: Scoring Strategies and Preliminary Results
211