5.1 Threats to Validity
The users, user behaviors, and devices/apps that they
used may not represent all users. We tried to minimize
this threat by collecting data from 58 test subjects
over a 30 day period. CD-CPT was tested on a limited
set of 11 sequences of real-world context events from
users and was optimized specifically for this dataset
to improve its performance. These optimizations in-
cluded adjusting the CPT-prediction scores for each
predicted context event after it was predicted, encour-
aging the model to explore. Other optimizations in-
volved adjusting the CPT prediction scores for other
context events based on their probability of occur-
rence after specific context events.To mitigate these
threats to validity, the data was cleaned for redun-
dancy and transformed to ensure compatibility with
the algorithm’s input format. Additionally, future re-
search may examine testing the model on larger and
more diverse dataset to better assess generalization of
this research.
6 CONCLUSIONS AND FUTURE
WORK
In this paper, we investigate various sequence predic-
tion algorithms, such as AKOM, TDAG, PPM, DG,
CPT, and CPT+, to predict real-world context data for
smartphones, and propose a new method called CD-
CPT (Context Data Compact Prediction Tree) for im-
proved performance. The results show that AKOM
and TDAG had the highest F-1 score of 54.4% with
a look-up window of four, while PPM had an F-1
score of 39.0% and performed the best with a look-
up window of two. CPT+ had a lower F-1 score of
8.6 compared to the other algorithms. CD-CPT, our
proposed method, was able to predict sequences of
real-world context data from users with an F-1 score
of 11.36% using only the training model. Overall, the
findings suggest that AKOM and TDAG are more ac-
curate for single event predictions and CD-CPT was
better at predicting full sequences of context data. Fu-
ture work may use AKOM or TDAG during software
testing to monitor different patterns of context events.
Researchers may further investigate the use of CD-
CPT for prediction and compare full sequences of
context data from users. The study highlights the im-
portance of choosing appropriate algorithms for pre-
dicting context data on smartphones, as this can sig-
nificantly impact the performance and user experience
of various applications
Future work may examine fault finding and ef-
fectiveness of integrating context event sequences
into automated testing processes. Future work may
also explore CD-CPT applied to domains such as
smart watches, healthcare devices, various Internet of
Things (IoT) devices and autonomous vehicles.
ACKNOWLEDGEMENTS
This work was supported by NSF grant #2149969.
REFERENCES
(2022). Android Releases — Android Developers.
Bosmans, S., Mercelis, S., Denil, J., and Hellinckx, P.
(2019). Testing iot systems using a hybrid simulation
based testing approach. Computing, 101:857–872.
Cleary, J. and Witten, I. (1984). Data compression using
adaptive coding and partial string matching. IEEE
transactions on Communications, 32(4):396–402.
Data.ai (12 Jan 2022). Number of Mobile App Downloads
Worldwide from 2016 to 2021 (in Billions). Statistica.
Dey, A. K. (2001). Understanding and using context. Per-
sonal and ubiquitous computing, 5:4–7.
Fournier-Viger, P., Gomariz, A., Gueniche, T., Soltani, A.,
Wu, C.-W., Tseng, V. S., et al. (2014). Spmf: a java
open-source pattern mining library. J. Mach. Learn.
Res., 15(1):3389–3393.
Fournier Viger, P. e. a. (2016). The SPMF Open-Source
Data Mining Library Version 2. Proc. 19th European
Conference on Principles of Data Mining and Knowl-
edge Discovery (PKDD 2016) Part III, pages 36–40.
Gellert, A., Precup, S.-A., Pirvu, B.-C., Fiore, U., Zam-
firescu, C.-B., and Palmieri, F. (2021). An empirical
evaluation of prediction by partial matching in assem-
bly assistance systems. Applied Sciences, 11(7):3278.
Goyal, P., Khan, M. K., Steil, C., Martel, S. M., and Bryce,
R. (2023). Smartphone context event sequence pre-
diction with poermh and tke-rules algorithms. In
2023 IEEE 13th Annual Computing and Communica-
tion Workshop and Conference (CCWC), pages 0827–
0834. IEEE.
Gueniche, T., Fournier-Viger, P., Raman, R., and Tseng,
V. S. (2015). Cpt+: Decreasing the time/space com-
plexity of the compact prediction tree. In Advances in
Knowledge Discovery and Data Mining: 19th Pacific-
Asia Conference, PAKDD 2015, Ho Chi Minh City,
Vietnam, May 19-22, 2015, Proceedings, Part II 19,
pages 625–636. Springer.
Gueniche, T., Fournier-Viger, P., and Tseng, V. S. (2013).
Compact prediction tree: A lossless model for ac-
curate sequence prediction. In Advanced Data Min-
ing and Applications: 9th International Confer-
ence, ADMA 2013, Hangzhou, China, December 14-
16, 2013, Proceedings, Part II 9, pages 177–188.
Springer.
Hochreiter, S. and Schmidhuber, J. (1997). Long short-term
memory. Neural computation, 9(8):1735–1780.
IoTBDS 2024 - 9th International Conference on Internet of Things, Big Data and Security
172