According to the experiments, CpG site selection
with partially or fully age-labeled data works well
with following IRT parameter estimations. Since the
IRT parameter estimation part does not use any age
information, the model is thought to estimate
progressive methylation status as a latent trait.
Meanwhile, CpG site selection with no age
information does not work well. Further analysis
shows that the selected CpG sites have small variation
in the difficulty parameter (b_i), and this causes
deviation in low value range of ฮธ. In future research,
we will improve age label-less selection by
considering not only IT-correlation but also the test
information of a set of CpG sites. Another unfinished
work is an evaluation of each item (CpG site) based
on item fit statistics. By removing the CpG site whose
response does not fit to logistic function, we may
improve the analysis results.
Differing from the conventional regression
method, IRT-based analysis gives rich information
such as discrimination and difficulty parameters for
each CpG site. These parameters are thought to
indicate โhow fastโ and โwhenโ CpG sites will be
methylated. We will also advance our research in
relation to these points of view.
REFERENCES
Barton, M. A. et al. (1981). An upper asymptote for the
three-parameter logistic item-response model.
Princeton, N.J.: Educational Testing Service.
Birnbaum, A. (1968). Some Latent Trait Models and their
Use in Inferring an Examineeโs Ability, Statistical
Theories of Mental Test Scores (pp. 397โ479).
Field, A.E. et al. (2018). DNA Methylation Clocks in
Aging: Categories, Causes, and Consequences.
Molecular Cell, volume 71, issue 6, pp.882-895.
Galkin, F., et al. (2021), DeepMAge: A Methylation Aging
Clock Developed with Deep Learning. Aging and
disease, volume 12, issue 5, pp. 1252-1262.
Hannum, G. et al. (2013). Genome-wide Methylation
Profiles Reveal Quantitative Views of Human Aging
Rates, volume 49, issue 2, pp. 359-367.
Henrysson, Sten (1963). Correction of item-total
correlations in item analysis. Psychometrika, volume
28, issue 2, pp.211โ218.
Higgins-Chen, A.T., et al. (2022) A computational solution
for bolstering reliability of epigenetic clocks:
implications for clinical trials and longitudinal tracking.
Nature Aging, volume 2, pp. 644โ661.
Horvath, S. (2013). DNA methylation age of human tissues
and cell types. Genome Biology, volume 14, p. 3156.
Hsieh, M. (2010). A Comparison of Bayesian MCMC and
marginal maximum likelihood methods in estimating
the item parameters for the 2PL IRT model,
International Journal of Innovative Management,
Information & Production, volume 1, Number 1, pp.81-
89.
Jones, P. A. (2012). Functions of DNA methylation:
islands, start sites, gene bodies and beyond, Nature
Reviews Genetics, volume 13, pp. 484โ492.
Lima Camillo, L.P. et al. (2022). A pan-tissue DNA-
methylation epigenetic clock based on deep learning.
npj Aging 8, article number 4.
Liu, L. et al. (2020). Underlying features of epigenetic
aging clocks in vivo and in vitro. Aging Cell, volume
19, issue 10.
Rash, G. (1960). Probabilistic models for some intelligence
and attainment tests, Danish Institute for Educational
Research.
Samejima, (1969). F. Estimation of latent ability using a
response pattern of graded scores, Psychometrika
Volume 34, pp. 1โ97.
Samejima, F. (1973). Homogeneous case of the continuous
response model. Psychometrika, volume 38, pp.203โ
219.
Shu, C. et al. (2020). Comparison of methylation capture
sequencing and Infinium MethylationEPIC array in
peripheral blood mononuclear cells. Epigenetics &
Chromatin, volume 13, article number 51.
Yeung, C. (2019). Deep-IRT: Make deep learning based
knowledge tracing explainable using item response
theory, Proc. of the 12th International Conference on
Educational Data Mining.
Xiong, Z. et al. (2022). EWAS Open Platform: integrated
data, knowledge and toolkit for epigenome-wide
association study, Nucleic Acids Research, volume 50,
Issue D1, pp. D1004โD1009.