average since none of them contain another concept
(similar to the argument above).
We experimented with the NN model with one
and multiple hidden layers. We found that deepening
NNs did not further improve the performance
compared with simple structure NNs. A possible
explanation for this is that for deeper NNs to yield
better performance, more training data from larger
datasets is required. We will revisit those experiments
using larger datasets in the future.
Future Work: In the next stage of our project, as a
remedy to the shortcomings described in Limitations
Section, we plan to take the following actions. In
Phase 1, we will insert into the Initial CIT (ICIT)
these components:
1. Existing abbreviations in Cardiology
(Heart.org, u.d; Utah, u.d) and in Medicine in
general (Wikipedia, 2015).
2. Numbers from the range expected in
Cardiology EHRs.
3. Verbs with different tenses
(worldclasslearning, u.d).
4. Medications used in Cardiology (Heart.org,
u.d).
5. Common forms of negation (learngrammar,
u.d).
6. We will enrich the low synonym coverage of
CCS concepts (0.606) by migrating synonyms
from UMLS(Bodenreider, 2004).
As a result, we expect a more accurate
highlighting of EHRs.
5 CONCLUSIONS
We describe a research project to curate a Cardiology
Interface Terminology (CIT) dedicated for
highlighting EHRs of patients. The purpose is to
highlight all and only the important content of an
EHR note which a clinician need to review.
Highlighted EHRs will enable healthcare
professionals to read only the highlighted important
information of an EHR note rather than cursorily
review it, risking missing critical medical
information. Machine Learning techniques are
utilized for the design of CIT for the Cardiology
specialty. Transfer Learning will be used to design
interface terminologies for other specialties. As the
training data required for machine learning, an early
version of CIT (Koohi H. Dehkordi et al., 2023)
designed with a semi-automatic mining method rather
than slow manual mining is used. The results
demonstrate significant progress over highlighting
with SNOMED CT and with the early version of CIT.
We discussed ideas to further improve the coverage
of highlighting the important content of EHR to
achieve a satisfactory highlighting.
REFERENCES
Agarap, A. F. (2018). Deep learning using rectified linear
units (relu). arXiv preprint arXiv:1803.08375.
Agrawal, A., He, Z., Perl, Y., Wei, D., Halper, M., Elhanan,
G., & Chen, Y. (2013). The readiness of SNOMED
problem list concepts for meaningful use of electronic
health records. Artif Intell Med, 58(2), 73-80.
https://doi.org/10.1016/j.artmed.2013.03.008
Alimova, I., & Tutubalina, E. (2020). Multiple features for
clinical relation extraction: A machine learning
approach. Journal of biomedical informatics, 103,
103382.
Alsentzer, E., Murphy, J. R., Boag, W., Weng, W.-H., Jin,
D., Naumann, T., & McDermott, M. (2019). Publicly
available clinical BERT embeddings. arXiv preprint
arXiv:1904.03323.
Apathy, N. C., Rotenstein, L., Bates, D. W., & Holmgren,
A. J. (2023). Documentation dynamics: note
composition, burden, and physician efficiency. Health
Services Research, 58(3), 674-685.
Bodenreider, O. (2004). The unified medical language
system (UMLS): integrating biomedical terminology.
Nucleic acids research, 32(suppl_1), D267-D270.
Cui, S., Luo, J., Ye, M., Wang, J., Wang, T., & Ma, F.
(2022). MedSkim: Denoised Health Risk Prediction via
Skimming Medical Claims Data. 2022 IEEE
International Conference on Data Mining (ICDM),
Dashboard., F. A. E. R. S. F. P. https://fis.fda.gov/sense/
app/95239e26-e0be-42d9-a960-9a5f7f1c25ee/sheet/45
beeb74-30ab-46be-8267-5756582633b4/state/analysis
Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2018).
Bert: Pre-training of deep bidirectional transformers
for language understanding. arXiv preprint
arXiv:1810.04805.
Dymek, C., Kim, B., Melton, G. B., Payne, T. H., Singh,
H., & Hsiao, C.-J. (2021). Building the evidence-base
to reduce electronic health record–related clinician
burden. Journal of the American Medical Informatics
Association, 28(5), 1057-1061.
Elkin, P. L., Brown, S. H., Husser, C. S., Bauer, B. A.,
Wahner-Roedler, D., Rosenbloom, S. T., & Speroff, T.
(2006). Evaluation of the content coverage of
SNOMED CT: ability of SNOMED clinical terms to
represent clinical problem lists. Mayo Clinic
Proceedings,
Francis, S., Van Landeghem, J., & Moens, M.-F. (2019).
Transfer learning for named entity recognition in
financial and biomedical documents. Information,
10(8), 248.