tomation is done in the form of Visualization Recom-
mendation Systems. As a result, there has been a sig-
nificant rise in the research and development of sys-
tems in recent years (Vartak et al., 2015)(Hu et al.,
2019)(Luo et al., 2018)(Krause et al., 2016). These
recommendation systems process the data and pro-
vide a set of visualizations providing insights which
could be helpful for the intended task of the analyst.
Most of the recommendation systems are designed
for the sole purpose of selecting the most relevant
features in the data and therefore, support a limited
number of visualization techniques. For example,
SeeDB (Vartak et al., 2015) aids the users to iden-
tify interesting visualizations using a deviation-based
metric, yet it only uses bar charts to display the result
and other recommendations whereas DEEPEYE (Luo
et al., 2018) only uses four common visualization
techniques namely bar charts, line charts, pie charts,
and scatter charts. SeekAView (Krause et al., 2016),
also has a fixed number of visualization techniques to
select the useful types of trends from, including fre-
quency plots, scatter plots and parallel coordinates.
The drawback of using a low number of visualization
techniques is that it is not possible to cover every type
of task using the same type of visualization technique.
The existing visualization recommendation sys-
tems that are proposed by the current literature face
several shortcomings. These shortcomings range
from the number of dimensions the recommenda-
tion system can handle to the number of visualiza-
tion techniques adopted. Moreover, most of these
systems require inputs from expert users having do-
main specific knowledge. Visualization recommenda-
tion systems that use a recommendation engine based
on supervised machine learning or neural networks
(Hu et al., 2019)(Luo et al., 2018) also suffer from
the problem of overfitting. Training the model for the
recommendation engine relies mainly on the training
data and a lack of available training data results in an
ineffective model that may be biased towards the data
similar to the training data.
To address these challenges, in this paper we pro-
pose a novel rule-based visualization recommenda-
tion system. We show how impartial and effective vi-
sualizations are recommended by using a knowledge-
based rule engine which is designed and developed
as a part of our work. The recommendation system
uses key factors such as data characteristics, intended
task and user feedback along with the knowledge-
base to decide the best suitable type of visualization
technique to be used. Furthermore, the recommen-
dations generated are ranked qualitatively based on
several statistical properties of the data.
Our contributions in this paper are summarized
below:
1. Classification of Data into Characteristics and
Proposing a Formal Visual Taxonomy:
The data is categorized based on several factors
such as the type of data (discrete, continuous) or
its format (e.g. Numerical, Categorical). These
factors are required as they influence the type of
visualization technique to be used and are there-
fore essential for the development of the rule en-
gine. A formal visual taxonomy is proposed as
well, which provides the theoretical foundation
for the construction of the knowledge base.
2. Mapping User Tasks to Visual Structures and
Creation of the Task based Visual Taxonomy:
The intended user task, required to generate vi-
sualization recommendations, is abstracted in the
form of a task based visual taxonomy which in
turn is mapped on to the type of visualization tech-
niques in order to generate the recommendations.
3. Creation of Rules for Knowledge-based Rule
Engine:
After the categorization of data based on its char-
acteristics, knowledge base rules are generated to
provide recommendations. The input factors for
the rules, apart from data categories, include as-
pects such as the intended task of the user and the
number of dimensions to be visualized. Based on
the input factors, the rule engine then decides the
best suitable visualization techniques in the form
of recommendations.
4. Ranking of Visualization Recommendations:
The recommendation system generates a set of vi-
sualizations as multiple rules could be applicable.
Therefore, a ranking algorithm is implemented
based on task dependant statistical measures so
that the visualizations are sorted in a descending
order based on their scores ensuring that the most
useful visualizations are displayed first to the user
resulting in an efficient process.
5. Evaluate the Designed System:
We test a sample scenario by using a real-world
dataset in order to evaluate the usefulness of the
generated and ranked recommendations and com-
pare the results with a popular visualization tool.
2 RELATED WORK
As discussed in the previous section, there has been
significant progress in the research and development
of visualization recommendation systems in recent
years. According to (Kaur and Owonibi, 2017), these
KDIR 2021 - 13th International Conference on Knowledge Discovery and Information Retrieval
58