ference in the network and computing consumption,
which can cause a bottleneck in the Data Station net-
work. The case study worked adequately with a sim-
ple aggregation algorithm, so we believe that our sys-
tem can alleviate the IT infrastructure constraints that
the healthcare organizations can have to ensure the
PHT execution while respecting the principles of the
PHT approach.
Future research should be performed to test our
solution with other use cases, by including machine
learning algorithms in the Train or dependent transac-
tions, for instance, to experience idle moments wait-
ing for input data. Other Trains with different interac-
tion mechanisms such as APIs, queries, and messages
should be created and then tested with extensions of
our system. We also propose some future work to as-
sess the solution developed in this research, integrat-
ing the implementation to existing proof of concepts
developed by organizations in the PHT project. Some
of these implementations already have deployed a
vast majority of the PHT workflow and have elabo-
rated more robust case studies. It would beneficial to
combine these efforts and evaluate how our solution
behaves as well as other metrics like performance and
execution time from when the end-user dispatches the
Train until the results are made available.
REFERENCES
Amazon (2020). Navigating GDPR Compliance on
AWS. https://docs.aws.amazon.com/whitepapers/
latest/navigating-gdpr-compliance/welcome.html.
Accessed: 2021-09-09.
Beyan, O., Choudhury, A., van Soest, J., Kohlbacher, O.,
Zimmermann, L., Stenzhorn, H., Karim, M. R., Du-
montier, M., Decker, S., da Silva Santos, L. O. B.,
et al. (2020). Distributed analytics on sensitive medi-
cal data: The personal health train. Data Intelligence,
2(1-2):96–107.
Choudhury, A., van Soest, J., Nayak, S., and Dekker, A.
(2020). Personal health train on fhir: A privacy pre-
serving federated approach for analyzing fair data in
healthcare. In Bhattacharjee, A., Borgohain, S. K.,
Soni, B., Verma, G., and Gao, X.-Z., editors, Ma-
chine Learning, Image Processing, Network Security
and Data Sciences, pages 85–95, Singapore. Springer
Singapore.
Christudas, B. (2019). Practical Microservices Architec-
tural Patterns: Event-Based Java Microservices with
Spring Boot and Spring Cloud. Apress.
Deist, T. M., Dankers, F. J., Ojha, P., Marshall, M. S.,
Janssen, T., Faivre-Finn, C., Masciocchi, C., Valen-
tini, V., Wang, J., Chen, J., et al. (2020). Distributed
learning on 20 000+ lung cancer patients–the personal
health train. Radiotherapy and Oncology, 144:189–
200.
Erder, M. and Pureur, P. (2016). Chapter 3 - getting started
with continuous architecture: Requirements manage-
ment. In Erder, M. and Pureur, P., editors, Contin-
uous Architecture, pages 39–62. Morgan Kaufmann,
Boston.
ISO (2011). ISO/IEC 25010:2011, Systems and software
engineering — Systems and software Quality Require-
ments and Evaluation (SQuaRE) — System and soft-
ware quality models.
Jin, B., Sahni, S., and Shevat, A. (2018). Designing Web
APIs: Building APIs That Developers Love. O’Reilly
Media, Inc.
Karim, M., Nguyen, B.-P., Zimmermann, L., Kirsten, T.,
L
¨
obe, M., Meineke, F., Stenzhorn, H., Kohlbacher,
O., Decker, S., Beyan, O., et al. (2018). A dis-
tributed analytics platform to execute fhir-based phe-
notyping algorithms. In Proceedings of the 11th Inter-
national Conference Semantic Web Applications and
Tools for Life Sciences (SWAT4HCLS 2018). http:
//ceur-ws.org/Vol-2275/.
Kumar, S. and Singh, M. (2019). Big data analytics for
healthcare industry: impact, applications, and tools.
Big Data Mining and Analytics, 2(1):48–57.
Mathew, S. (2021). Overview of Amazon Web Ser-
vices. https://docs.aws.amazon.com/whitepapers/
latest/aws-overview/introduction.html. Accessed:
2021-09-09.
Morris, K. (2016). Infrastructure as code: managing
servers in the cloud. O’Reilly Media, Inc.
Rieke, N., Hancox, J., Li, W., Milletari, F., Roth, H., Al-
barqouni, S., Bakas, S., Galtier, M. N., Landman,
B., Maier-Hein, K., et al. (2020). The future of dig-
ital health with federated learning. arXiv preprint
arXiv:2003.08119.
Voigt, P. and Von dem Bussche, A. (2017). The eu general
data protection regulation (gdpr). A Practical Guide,
1st Ed., Cham: Springer International Publishing.
Walonoski, J., Klaus, S., Granger, E., Hall, D., Gregorow-
icz, A., Neyarapally, G., Watson, A., and Eastman,
J. (2020). Synthea
TM
novel coronavirus (covid-19)
model and synthetic data set. Intelligence-based
medicine, 1:100007.
Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J., Apple-
ton, G., Axton, M., Baak, A., Blomberg, N., Boiten,
J.-W., da Silva Santos, L. B., Bourne, P. E., et al.
(2016). The FAIR guiding principles for scientific data
management and stewardship. Scientific data, 3(1):1–
9.
Yang, C., Huang, Q., Li, Z., Liu, K., and Hu, F. (2017).
Big data and cloud computing: innovation opportuni-
ties and challenges. International Journal of Digital
Earth, 10(1):13–53.
WEBIST 2021 - 17th International Conference on Web Information Systems and Technologies
144