generated for each data packet that can provide infor-
mation about activating and deactivating a data packet
and the amount of data packets that are transferred in
both directions between the mobile and the Control
Node.
5.1 Call Logs Database on TETRA
Network
The database contain the real logs generated on
TETRA network in the Emilia Romagna region. This
database is a relational database of more than 30GB
size containing multiple information on each teth-
ered call. The database was built using the Firebird
RDBMS and has the following features:
• 124 tables;
• 2000 attributes ;
• 20 tables for storing log information;
• 70 attributes for each log table (on average);
• 30 Gigabytes of data, of disk occupation;
• 132.165.089 tuples.
We have first identified a fact of interest in order
to define a Data Mart for OLAP analysis. We have
chosen to locate logs for calls that generated alerts,
that is, those for which the call was interrupted or dis-
connected for external cause.
5.2 Logical Model of the OLAP Cube
The source database is very heterogeneous and con-
tains many unnecessary data for the analysis to be
carried out. For this reason, it was crucial, through
ETL procedures, to extract the data log of interest. In
order to perform these transformations, it is necessary
to define in the measures of interest, dimensions and
hierarchies that we want to analyze. The conceptual
model used to represent the information schema is the
Dimensional Fact Model, due to some privacy con-
straints, we limited our analysis to number of calls
measures.
As regards the analysis dimensions, we identified
the following:
• CALL END REASON (cardinality 23). Indi-
cates the reason for call ending, this type of
information is already classified in the original
database in a table that lists 23 reasons why a
TETRA network call may end;
• DISCONNECT CAUSE (cardinality 203). in-
dicates the reason why an user has been discon-
nected from the network, differently from the rea-
son for ending a call, this dimension states why it
Figure 4: Dimensional Fact Model for OLAP Analisys on
Tetra Network.
is disconnected from the TETRA network, it con-
tains 203 causes of disconnection;
• SCN (cardinality 2). as the data comes from a
log database on TETRA Network of Emilia Ro-
magna, there is no reason to have a territorial di-
mension, except for the SCN to which the inter-
rupted call refers. Analysis of the source database
we found that all data refers only to two SCNs:
one located in the municipality of Bologna and
one in the municipality of Faenza;
• TEMPORARY DIMENSION (cardinality 8760
as this number refers to the hours in one year).
logs are stored in detail per millisecond, but there
is no practical need to aggregate the data till such
finer granularity then we decided to support the
analysis of the temporal dimension hierarchically
by hours, days and months. The year does not
make sense in our experimental analysis as the
data refer to 2010. Also we want to be able to ag-
gregate for: days in week and weekend, day of the
week, time zone and day type. By analyzing the
data based on this model we will be able to under-
stand on which days network problems occurred,
for what reasons user have been disconnected, for
what reasons the calls have been terminated and
on what SCN, for which time slot, for which days
of the week and so on.
The Dimensional Fact Model referring to the data
model for this OLAP analysis is shown in 4.
Using ETL procedures, the table of facts and di-
mensions was extracted from the original database
and uploaded also to MySQL DBMS for the sake of
comparison. The Star Schema, representing the DFM
for TETRA logs on MySQL, is shown in 5
5.2.1 Experimental Settings
Using MonOLAP, with the Schema Workbench
PLUS component on the schema described in Figure
5, the following schemes are obtained:
• Star Schema on MySQL
• Flat Schema on MonetDB
KomIS 2017 - Special Session on Knowledge Discovery meets Information Systems: Applications of Big Data Analytics and BI -
methodologies, techniques and tools
310