...
CP:Whatbringsyou
heretoday?
P:Ihavepaininmy
leftear.
CP:Ahsoyourear
hurts.Doyoufeelthat
youalsocanhearless
thanusual?
P:No.
...
hasSymptom
Patient
Head
Right
Ear
Left
Ear
Instance:
Left Ear
Instance:
Pain
Symptom
Hearing
Loss
Drainage
Pain
hasSymptom
Figure 3: Example showing part of a transcription of the CP - patient dialogue (left), the resulting PMG (middle) and the
sentence plan for report generation (right).
back end runs primarily on .NET Core and is writ-
ten in C#. The analyzers are written in Python, using
gRPC for communication between services/modules.
Google Cloud Speech-to-Text service transcribes the
audio and linguistic annotation is handled by Python-
Frog. For video analysis the OpenCV and the YOLO
libraries are used. Medical guidelines are modeled in
PROforma. Prot
´
eg
´
e facilitates ontology development
and triples are stored and managed with StarDog.
5 RESEARCH OUTLOOK
So far, we presented our grand vision and the imple-
mentation of our basic ideas in the first C2R proto-
type. To reach our proposed objectives, we need to
overcome several challenges i.a. in the development
of a robust architecture that is independent of input
technology, in the semantic interpretation of input that
deviates between hospitals on terminology and pro-
cedures, and in striking a balance between required
expressiveness and computational demands in con-
structing a formal representation of the transcriptions.
Our future research will focus on device integra-
tion for high-quality multimodal recognition (stage 1
in Fig. 1), on methods to build and populate the PMG
(stage 2 in Fig. 1), and on methods to filter out irrel-
evant information from medical consultations (stage
3 in Fig. 1). Our preliminary research and results
encouraged us that our ambitious goal of fully auto-
mated medical reporting is achievable.
ACKNOWLEDGEMENTS
We thank the students of the software project teams
Ki
´
eli and KettleHawks, Marjan van den Akker,
Lennart Herlaar and Sabine Molenaar for their sup-
port.
REFERENCES
Ajami, S. (2016). Use of speech-to-text technology for
documentation by healthcare providers. The National
Medical Journal of India, 29(3):148–152.
Androutsopoulos, I., Lampouras, G., and Galanis, D.
(2013). Generating natural language descriptions
from OWL ontologies: the NaturalOWL system.
JAIR, 48:671–715.
Antoniou, G., Groth, P., van Harmelen, F., and Hoekstra,
R. (2012). A Semantic Web Primer. The MIT Press,
Cambridge, Massachusetts, 3 edition.
Chiu, C.-C., Tripathi, A., Chou, K., Co, C., Jaitly, N., Jaun-
zeikare, D., Kannan, A., Nguyen, P., Sak, H., Sankar,
A., et al. (2017). Speech recognition for medical con-
versations. arXiv preprint arXiv:1711.07274.
Klann, J. G. and Szolovits, P. (2009). An intelligent listen-
ing framework for capturing encounter notes from a
doctor-patient dialog. BMC Medical Informatics and
Decision Making, 9(1).
Klock, S., van der Werf, J. M., Guelen, J. P., and Jansen, S.
(2017). Workload-based clustering of coherent feature
sets in microservice architectures. In ICSA, pages 11–
20.
Luchies, E., Spruit, M., and Askari, M. (2018). Speech
technology in Dutch health care: A qualitative study.
In BIOSTEC, volume 5, pages 339–348.
Maynard, D. W. and Heritage, J. (2005). Conversation anal-
ysis, doctor–patient interaction and medical commu-
nication. Medical Education, 39(4):428–435.
Milian, K., Hoekstra, R., Bucur, A., ten Teije, A., van
Harmelen, F., and Paulissen, J. (2015). Enhancing
reuse of structured eligibility criteria and supporting
their relaxation. Journal of Biomedical Informatics,
56:205–219.
Peleg, M. (2013). Computer-interpretable clinical guide-
lines: a methodological review. Journal of Biomedical
Informatics, 46(4):744–763.
Portet, F., Reiter, E., Gatt, A., Hunter, J., Sripada, S., Freer,
Y., and Sykes, C. (2009). Automatic generation of
textual summaries from neonatal intensive care data.
AI, 173(7-8):789–816.
Rohloff, K., Dean, M., Emmons, I., Ryder, D., and Sumner,
J. (2007). An evaluation of triple-store technologies
Automated Medical Reporting: From Multimodal Inputs to Medical Reports through Knowledge Graphs
513