Research on the Influence Factors that Possibly Lead to Diabetes
Pengzhou Xu
School of Mathematical Sciences, Inner Mongolia University, Hohhot, 010000, China
Keywords: Diabetes, Pathogenic Factors, Binary Logistic Model.
Abstract: Diabetes has become a serious public health problem worldwide. Previous studies have found that diabetes is
related to family genetics, age and high blood pressure, but there are other unknown factors worth
investigating. In this study, a Binary Logistic Model was used to process data from the American Behavioral
Risk Factor Surveillance System (BRFSS), published in 2015. The data included 30,691 men and women of
all income levels and age groups. The study concluded that while diabetes was not associated with Vegetable
Consumption, it had significant positive effects on Age, Gender, BMI, High Blood Pressure, High Cholesterol,
Smoking, Stroke and Difficulty Walking. There were significant negative effects on Exercise, Fruit
Consumption, Alcohol and Education. Among the factors closely related to diabetes, Stroke, Age, Difficulty
Walking, did not appear in previous studies. The research not only provides some new perspectives for
relevant medical personnel to study the pathogenesis of diabetes, but also helps diabetic patients to treat
diabetes in a timely manner. At the same time, it also plays a positive role in diabetes prevention.
1 INTRODUCTION
China's economic development has led to changes in
Chinese people's living habits, and the incidence of
diabetes has increased year by year (Wang et al.,
2021). In addition, from 2013 to 2020, the mortality
rate of urban diabetic patients in China has increased
significantly, and diabetes has become an important
public health problem in China (Zhu et al., 2020 & Li
et al., 2020). Therefore, understanding the causes of
diabetes is of great significance to control the
development and treatment of the disease and reduce
the mortality. The purpose of this paper is to study the
potential factors that lead to diabetes to help people
assess their own risk of diabetes, and to take a series
of protective and treatment measures
.
Diabetes is a metabolic disease, which is usually
caused by hereditary and long-term external
influences, causing organ lesions, resulting in lower
insulin secretion than normal levels (Bai, 2018 &
Robinson and Pickering 2024)). The occurrence
factors of diabetes are complex, and some scholars
have found that diabetes has a certain correlation with
age, BMI, overweight and hypertension (Zhang et al.,
2022). In addition, Du et al. found that smoking and
living conditions were related to diabetes (Du et al.,
2022). Based on these research results, this paper will
study whether 14 factors (Age, Gender, BMI, High
Blood Pressure, High Cholesterol, Smoking, Stroke,
Exercise, Fruit Consumption, Vegetable
Consumption, Alcohol, Difficulty Walking,
Education, Income) whether these factors are related
to diabetes. In a similar direction, Ye et al. used a
multi-factor logistic regression model and a mixed
graph model (Ye et al.,2024). Logistic regression
analysis model is a classic model with high efficiency
and simplicity. In the field of medical research, case-
control studies are needed to establish multiple paired
groups. The literature indicates that unhealthy
lifestyle (Smoking, Low Physical Pabor, Alcohol,
BMI) is closely related to diabetes, but the screening
population in the literature is mainly Chinese elderly,
and the possibility of participating in screening in
other age groups is not fully considered. Li et al.
established a structural equation model (SEM) (Li et
al.,2023), but the computational complexity of this
model was high, and a large amount of data needed to
be collected to ensure the accuracy of the calculation.
Moreover, the literature also did not fully consider the
possibility of other age groups participating in
screening. Duan et al. also adopted a Logistic
regression model, but only considered the
relationship between sleep time and diabetes, and did
not investigate more factors affecting the onset of
diabetes (Duan et al., 2019). Deng et al. used the PSO-
BP neural network model which combines Particle
Swarm Optimization (PSO) algorithm and
Backpropagation (BP) neural network, but this model