models, have recently been developed by researchers
to identify the similarities in the bridge deteriorations
with respect to various bridge parameters (Chetti &
Ali, 2020; Chetti & Ali, 2019; Fuchsberger & Ali,
2017; Chetti et al., 2021).
Correlation network models (CNM) have
demonstrated their effectiveness in various domains
such as social networks and finance. In the context of
social networks, CNM has been used to identify early
opinion leaders on platforms like Twitter during the
COVID-19 pandemic (Hatami et al., 2021).
Similarly, in finance, correlation networks, combined
with population analysis, have been employed to
analyze the impact of crises on different sectors
(Hatami et al., 2023). In recent years, several
researchers have leveraged CNM and population
analysis to highlight the advantages of using
population analysis for identifying enriched
parameters and estimating inspection frequencies of
bridges within specific clusters parameters (Chetti &
Ali, 2020; Chetti & Ali, 2019; Fuchsberger & Ali,
2017). Additionally, it has been emphasized that
smart big data pipelines are necessary to tackle the
challenges associated with civil infrastructure in the
United States (Gandhi et al., 2018). Existing literature
indicates that the combination of CNM and
population analysis serves as a robust, big data model
for visualizing clusters of bridges and their
deterioration rates parameters (Chetti & Ali, 2020;
Chetti & Ali, 2019; Fuchsberger & Ali, 2017; Chetti
et al., 2021).
CNM was introduced by Chetti et al., (Chetti et
al., 2021) for analyzing safety and performance
factors in civil infrastructures, specifically focused on
highway bridges from the United States. The study
utilized correlation network models within
population analysis to understand the impact of
various parameters on bridge safety and deterioration
rates. In their study, Chetti et al. proposed a
population analysis approach, which involves
assessing the performance of an individual element or
community/cluster in comparison to a group of peers
or communities. The methodology they proposed
includes identifying significantly enriched
parameters for different bridge communities, as
illustrated in Fig. 1. The process consists of three
main steps: dataset preparation, population analysis,
and validation. Within the population analysis, three
specific steps are involved, namely creating a
similarity/correlation network, identifying candidate
clusters (CCs), and applying enrichment analysis.
Using a Spearman ranking correlation coefficient
of at least 0.90, a correlation network was created for
the condition ratings data, with bridges as nodes and
condition rating relationships as edges. The threshold
of correlation coefficient .90 is taken to capture the
clusters with bridges that have very high similarity in
their deterioration behavior. The Markov clustering
(Dongen, 2000) method was used to discern CCs, and
an inflation value of 1.9 was chosen for its modularity
and average deck condition rating differences. Of the
initial 17 clusters, 10, with a median size of 10 or
above, were viewed as CCs, constituting 233 out of
268 bridges as shown in Fig.2. Enrichment analysis
revealed a significant overrepresentation of input
parameters in seven CCs, as shown in Table 1. These
clusters were further divided into above-average and
below-average groups, each associated with
geographical regions, materials used, and factors
indicating high traffic usage and maintenance
effectiveness. However, a common limitation of
CNM for civil infrastructures is its validity. This
current study extends the work done in (Chetti et al.,
2021)
with a validation step, where the validation is
done by comparing CNM produced results with a
linear regression model and through a robustness
analysis.
2 METHODOLOGY
The verification of outcomes from a CNM, using a
population analysis strategy, can be undertaken in
three distinct ways. Initially, it can be accomplished
by scrutinizing existing scholarly works to identify
recurring themes or corroborating evidence. The
second technique contrasts the findings generated by
the CNM against those obtained from a linear
regression model by looking for alignment between
the two. Finally, implementing a robustness analysis
provides a means to assess the consistency and
reliability of the results under various conditions or
presumptions. This article primarily concentrates on
the latter two techniques, as the results were validated
using the existing literature about the durability of
concrete decks using the study done by Chyad et. al.
(Chyad et al., 2018).
2.1 Validation Using Simple Linear
Regression Model
Simple linear regression model can also be used to
validate the deterioration patterns of the candidate
clusters. The dependent variable (Y) is the condition
rating, and the independent variable (Age) is the time
in years. So, the regression equation is:
Y = b0 + b1 * Age (1)