overcome it, large running times in LCA, and the
complexity of interpreting the results.
ACKNOWLEDGEMENTS
This research was founded by the Foundation for
Research in Rheumatology (FOREUM).
Further acknowledgements were removed for
review purposes.
DISCLAIMER
This paper presents independent research funded by
the Foundation for Research in Rheumatology
(FOREUM) that currently is ongoing. Views
expressed are those of the author(s) and not necessary
those of all partners involved in FOREUM study.
REFERENCES
Agrawal, R. and S. Prabakaran (2020). "Big data in digital
healthcare: lessons learnt and recommendations for
general practice." Heredity 124(4): 525-534.
Akaike, H. (1987). "Factor analysis and AIC."
Psychometrika 52(3): 317-332.
Binder, H. and M. Blettner (2015). "Big data in medical
science--a biostatistical view." Dtsch Arztebl Int
112(9): 137-142.
Boeschoten, L., D. Oberski and T. d. Waal (2017).
"Estimating Classification Errors Under Edit
Restrictions in Composite Survey-Register Data Using
Multiple Imputation Latent Class Modelling (MILC)."
Journal of Official Statistics 33(4): 921-962.
Bozdogan, H. (1987). "Model selection and Akaike's
Information Criterion (AIC): The general theory and its
analytical extensions." Psychometrika 52(3): 345-370.
Calders, P. and A. Van Ginckel (2018). "Presence of
comorbidities and prognosis of clinical symptoms in
knee and/or hip osteoarthritis: A systematic review and
meta-analysis." Semin Arthritis Rheum 47(6): 805-813.
Caliński, T. and J. Harabasz (1974). "A dendrite method for
cluster analysis." Communications in Statistics 3(1):1-
27.
Cohen, B., D. K. Vawdrey, J. Liu, D. Caplan, E. Y. Furuya,
F. W. Mis and E. Larson (2015). "Challenges
Associated with Using Large Data Sets for Quality
Assessment and Research in Clinical Settings." Policy
Polit Nurs Pract 16(3-4): 117-124.
Ehrenstein, V., H. Kharrazi, H. Lehmann and C. O. Taylor
(2019). Chapter 4 Obtaining Data From Electronic
Health Records. Tools and Technologies for Registry
Interoperability, Registries for Evaluating Patient
Outcomes: A User’s Guide, 3rd Edition, Addendum 2
[Internet]. R. E. Gliklich, M. B. Leavy and N. A.
Dreyer, Rockville (MD): Agency for Healthcare
Research and Quality (US).
Grant, R. W., J. McCloskey, M. Hatfield, C. Uratsu, J. D.
Ralston, E. Bayliss and C. J. Kennedy (2020). "Use of
Latent Class Analysis and k-Means Clustering to
Identify Complex Patient Profiles." JAMA Netw Open
3(12): e2029068.
Hansen, N. S., L. Angquist, P. Lange and R. Jacobsen
(2020). "Comorbidity Clusters and Healthcare Use in
Individuals With COPD." Respir Care 65(8): 1120-
1127.
Henry, D., A. B. Dymnicki, N. Mohatt, J. Allen and J. G.
Kelly (2015). "Clustering Methods with Qualitative
Data: a Mixed-Methods Approach for Prevention
Research with Small Samples." Prev Sci 16(7): 1007-
1016.
Jung, T. and K. A. S. Wickrama (2008). "An Introduction
to Latent Class Growth Analysis and Growth Mixture
Modeling." Social and Personality Psychology
Compass 2(1): 302-317.
Khalid, S. and D. Prieto-Alhambra (2019). "Machine
Learning for Feature Selection and Cluster Analysis in
Drug Utilisation Research." Current Epidemiology
Reports 6(3): 364-372.
Liao, M., Y. Li, F. Kianifard, E. Obi and S. Arcona (2016).
"Cluster analysis and its application to healthcare
claims data: a study of end-stage renal disease patients
who initiated hemodialysis." BMC Nephrol 17: 25.
Pinedo-Villanueva, R., S. Khalid, V. Wylde, R.
Gooberman-Hill, A. Soni and A. Judge (2018).
"Identifying individuals with chronic pain after knee
replacement: a population-cohort, cluster-analysis of
Oxford knee scores in 128,145 patients from the
English National Health Service." BMC Musculoskelet
Disord 19(1): 354.
NJR Report. (2020). "National Joint Registry 17th Annual
Report 2020.", from https://reports.njrcentre.
org.uk/downloads.
Rousseeuw, P. J. (1987). "Silhouettes: A graphical aid to
the interpretation and validation of cluster analysis."
Journal of Computational and Applied Mathematics 20:
53-65.
Schwarz, G. (1978). "Estimating the dimension of a
model." The annals of statistics: 461-464.
Sclove, S. L. (1987). "Application of model-selection
criteria to some problems in multivariate analysis."
Psychometrika 52(3): 333-343.
Swain, S., A. Sarmanova, C. Coupland, M. Doherty and W.
Zhang (2020). "Comorbidities in Osteoarthritis: A
Systematic Review and Meta-Analysis of
Observational Studies." Arthritis Care Res (Hoboken)
72(7): 991-1000.
Swain, S., A. Sarmanova, C. Mallen, C. F. Kuo, C.
Coupland, M. Doherty and W. Zhang (2020). "Trends
in incidence and prevalence of osteoarthritis in the
United Kingdom: findings from the Clinical Practice
Research Datalink (CPRD)." Osteoarthritis Cartilage
28(6): 792-801.