developed yet, therefore baseline incident diabetes
was also included as a covariate in the model.
Incident diabetes was defined as a fasting plasma
glucose level ≥ 126 mg/dl, a 2-hr OGTT glucose of
≥ 200 mg/dl or current use of hypoglycaemic drug
therapy regardless the reason.
Continuous data was compared between the two
groups using the two sample independent t-test in
case of normality (verification with the
Kolmogorov-Smirnov test with Lilliefors correction)
and variance homogeneity (verification with the
Levene test). If continuous data was normally
distributed but variance was heterogeneous Welch’s
t-test was used and if data showed no normal
distribution, the Mann-Whitney-U test was used.
Unpaired categorical data was compared using
Fisher´s exact test. For calculations SPSS (IBM
Corp. Released 2013. IBM SPSS Statistics for
Windows, Version 22.0. Armonk, NY: IBM Corp.)
and the statistical computing software R Version
3.2.3 (R Foundation for Statistical Computing,
Vienna, Austria. URL http://www.R-project.org)
were used. The uncorrected type I error was set at
5% (two-sided), this means no adjustment for the
type I error was made.
3.2 First Model
The a-priori selected baseline covariates (except
gender) discriminated very well regarding the
indication of baseline insulin resistance, so for a first
model all baseline covariates were used to predict
indication of insulin resistance at follow-up.
The estimates based on this logistic regression
model (model Ia) are presented in the table below.
Table 3: Variables of the model Ia.
Baseline Covariates p-value Odds-Ratio 95% CI for OR
2hr-OGTT glucose level [mg/dl] 0.001
**
1.015 1.006-1.024
Adiponectin [µg/ml] 0.038
*
0.911 0.834-0.995
Age [years] 0.547 0.983 0.931-1.039
BMI [kg/m²] 0.077 1.169 0.983-1.39
Fat mass [kg] 0.407 0.972 0.91-1.039
F-Insulin [µU/ml] 0.001
**
1.706 1.24-2.348
Glucose [mg/dl] 0.011
*
1.058 1.013-1.106
HbA1c [%] 0.977 0.986 0.397-2.452
HDL cholesterol [mg/dl] 0.010
*
0.963 0.936-0.991
HOMA-Index 0.008
**
0.212 0.067-0.672
kITT-Index 0.893 1.017 0.794-1.303
Lean mass [kg] 0.980 1.000 0.951-1.051
Triglyceride [mmol/] 0.979 1.000 0.996-1.004
Waist-Hip-Ratio 0.394 9.645 0.052-1773.282
Gender (male vs. female) 0.037
*
3.886 1.086-13.897
Hypertension (no vs. yes) 0.395 0.776 0.432-1.393
Incident diabetes (no vs. yes) 0.708 1.403 0.238-8.265
Constant 0.004
**
0.000 1.006-1.024
This model yielded an overall percentage of
82.2% correct classifications (90.5% correct
classifications for no indication of insulin resistance
and 65.3% correct classifications for indication of
insulin resistance). A Nagelkerke’s R² of 0.550
indicated also a good model fit.
Based on the model an increase in one unit of
“2hr-OGTT glucose level [mg/dl]”, “F-Insulin
[µU/ml]” and “Glucose [mg/dl]” is associated with a
significant higher chance for insulin resistance. An
increase of one unit in “Adiponectin [µg/ml]” and
“HDL cholesterol [mg/dl]” is associated with a
significant lower chance for insulin resistance. There
was also a significant increased chance for women
suffering from insulin resistance at follow-up.
Interestingly low levels of HOMA-index
baseline were associated with a higher chance of
insulin resistance at follow-up (Odds Ratio = 0.212).
But a univariate analysis yielded the expected
association (Odds Ratio = 4.315, p < 0.001**). So
this unexpected multivariate Odds Ratio may be
caused through the multicollinearity of the model.
The confidence interval of the “Waist-Hip-
Ratio” was extremely large so a bootstrapping
approach based on 1000 bootstrap-samples was used
to validate the numerical integrity of model Ia. The
confidence interval for “Waist-Hip-Ratio” was even
more extreme so this variable was excluded from the
model. The results from this model (model Ib)
remained almost the same like model Ia.
The classification results of the model Ib are
presented in the table below.
Table 4: Classification result of model Ib.
Predicted
Observed
no IR IR
% correct
no IR
272 29 90.4
IR
51 97 65.5
Overall %
82.2
(Cut-Off value 0.5)
3.3 Second Model
The first model yielded a good overall percentage of
correct classification results but was relatively
complex because various laboratory parameters were
needed. So the next step was to reduce the
complexity of the model using a variable selection
approach (Bursac et al., 2008). A reduced model
with similar results but fewer needed laboratory
parameters would also be more cost efficient.
The first approach was using a backwards-
likelihood-ratio-approach (probability for entry 0.05,
probability for removal 0.10) which led to an overall
percentage of correct classification of 81.7% but this