Privacy considerations currently limit cross-
institutional validation of our matching approach.
Future work should incorporate privacy-preserving
computation methods that enable collaborative model
validation without compromising patient
confidentiality.
The current implementation overlooks potential
feature interactions by treating each characteristic
independently. Development of composite features
that capture relationships between administrative,
clinical, and temporal patterns could enhance
discriminative power.
Real-time feature extraction and matching present
computational challenges not addressed in our
retrospective analysis. Research into efficient
algorithms and optimization techniques would
facilitate clinical deployment of our information-
theoretic approach.
Specialty-specific matching requirements and
varying documentation practices across clinical
domains warrant investigation. Adaptive frameworks
that account for department-specific feature stability
and information content could improve matching
accuracy in specialized care settings.
Future work could explore hybridizing our
entropy-based framework with deterministic
methods, as Ong et al. suggest, to address missing
data and validate across diverse healthcare systems.
5 CONCLUSIONS
The information-theoretic analysis successfully
established a framework for patient matching in
critical care settings, revealing three complementary
feature groups: demographics/administrative
( 𝐷
(
𝐹
)
=12247.56 bits), ICU care patterns
( 𝐷
(
𝐹
)
=266.40 bits), and clinical records
( 𝐷
(
𝐹
)
=12.10 bits). While the combined
discriminative power (12526.06 bits) substantially
exceeds the theoretical minimum threshold
( log
(
𝑁
)
≈16bits), this significant redundancy
presents both advantages and challenges.
The excess discriminative power provides
robustness against missing data and institutional
variability. However, it suggests potential
computational inefficiencies and possible overfitting
to institution-specific patterns. Future
implementations should focus on optimizing feature
selection to maintain matching accuracy while
reducing computational overhead.
The research demonstrates that effective patient
matching requires balancing:
Feature stability vs. information content
Computational efficiency vs. redundancy
Institutional generalizability vs. local pattern
optimization
This framework provides a foundation for
implementing reliable patient matching systems,
though further validation across diverse healthcare
environments and optimization of feature selection
methods is needed.
REFERENCES
Blecker, S., Katz, S. D., Horwitz, L. I., Kuperman, G., Park,
H., Gold, A., & Sontag, D. (2016). Comparison of
Approaches for Heart Failure Case Identification From
Electronic Health Record Data. JAMA Cardiology,
1(9), 1014. https://doi.org/10.1001/
jamacardio.2016.3236
Clark, S. J., Halter, M., Porter, A., Smith, H. C., Brand, M.,
Fothergill, R., Lindridge, J., McTigue, M., & Snooks,
H. (2019). Using deterministic record linkage to link
ambulance and emergency department data: Is it
possible without patient identifiers?: A case study from
the UK. International Journal of Population Data
Science, 4(1). https://doi.org/10.23889/ijpds.v4i1.1104
Duggal, R., Khatri, S. K., & Shukla, B. (2015). Improving
patient matching: Single patient view for Clinical
Decision Support using Big Data analytics. 2015 4th
International Conference on Reliability, Infocom
Technologies and Optimization (ICRITO) (Trends and
Future Directions), 1–6. https://doi.org/10.1109/
ICRITO.2015.7359269
Evans, R. S., Benuzillo, J., Horne, B. D., Lloyd, J. F.,
Bradshaw, A., Budge, D., Rasmusson, K. D., Roberts,
C., Buckway, J., Geer, N., Garrett, T., & Lappé, D. L.
(2016). Automated identification and predictive tools to
help identify high-risk heart failure patients: Pilot
evaluation. Journal of the American Medical
Informatics Association, 23(5), 872–878.
https://doi.org/10.1093/jamia/ocv197
Fernandes, L., & O’Connor, M. (2015). Accurate Patient
Identification—A Global Challenge. Perspectives in
Health Information Management, International, 1–7.
Gehrmann, S., Dernoncourt, F., Li, Y., Carlson, E. T., Wu,
J. T., Welt, J., Foote, J., Moseley, E. T., Grant, D. W.,
Tyler, P. D., & Celi, L. A. (2018). Comparing deep
learning and concept extraction based methods for
patient phenotyping from clinical narratives. PLOS
ONE, 13(2), e0192360. https://doi.org/10.1371/
journal.pone.0192360
Godlove, T., & Ball, A. W. (2015). Patient Matching within
a Health Information Exchange. https://pmc.ncbi.nlm.
nih.gov/articles/PMC4696093/
Goldberger, A. L., Amaral, L. A., Glass, L., Hausdorff, J.
M., Ivanov, P. C., Mark, R. G., Mietus, J. E., Moody,
G. B., Peng, C.-K., & Stanley, H. E. (2000).
PhysioBank, PhysioToolkit, and PhysioNet: