
cables, metro-connects, and Layer1 infrastructure had
a substantial impact. Additionally, although the num-
ber of DoS attacks was relatively low, they exhibited
a high impact.
Table 2: Variables used in impact and likelihood calculation
(non-wan percentages).
Type of outage cases incidents IS RV
WAN link issues 85% 25% 3.4 2.9
Equipment 53% 43% 1.2 0.64
Optical and fibers 46% 57% 0.8 0.37
Malicious attacks 1% 0.01% 100 1
Table 2 serves as a valuable reference point for the
operator, prevalence of short outages prompts a recon-
sideration of their significance in outage management
strategies. Simultaneously, the infrequency yet high
impact of malicious attacks underscores the need for
targeted security measures.
7 CONCLUSION
In this study, our main objective was to investigate
and analyze network outage risks and their impact
on a global Internet Service Provider (ISP). Through
analysis of passive and active outage measurement
data and examination of customer cases, we gained
valuable insights into the causes and consequences of
network outages.
Our investigation identified packet loss and out-
ages in leased Layer 2 WAN links as the primary con-
tributors to network incidents. While the definitive
causes of these outages were not ascertained, factors
such as physical fiber outages, equipment failures,
and maintenance/human error are likely contributors.
Equipment maintenance and failures emerged as sig-
nificant causes of outages, representing a substantial
portion of incidents.
The relatively low number of cases (2855) com-
pared to the total incidents (700,000) is a result of the
implementation of fast failover mechanisms and a de-
liberate focus on achieving “fail open” risk reduction
strategies.
Consistent with observations in (Govindan et al.,
2016), malicious attacks were nearly absent from the
data, with only a minimal number of customer com-
plaints attributed to such attacks. This suggests that
the existing security measures implemented by the
ISP have proven effective in mitigating this specific
risk.
Our impact evaluation highlighted the significant
consequences of Layer2 WAN outages and optical
failures. Although the number of Denial-of-Service
(DoS) attacks was relatively low, they exhibited a high
impact when they did affect the service.
The findings of this study have important implica-
tions for network operators and service providers. By
understanding the key causes of outages and their im-
pact, operators can prioritize their resources and ef-
forts to effectively mitigate risks and minimize dis-
ruptions. Additionally, the near absence of mali-
cious attacks emphasizes the importance of maintain-
ing robust security measures to prevent potential fu-
ture threats.
It is important to acknowledge the limitations of
this study. Our analysis focused on one specific ISP,
and the findings may not be generalizable to other net-
work operators. Furthermore, the underlying causes
of certain outages were not definitively identified,
warranting further investigation.
To further advance research in this area, future
studies could explore the specific mechanisms and
root causes of different types of outages, allowing for
more targeted risk mitigation strategies. Additionally,
examining the effectiveness of various security mea-
sures and their impact on reducing the likelihood and
impact of outages would provide valuable insights for
network operators.
In conclusion, this study has shed light on the risks
and impacts associated with network outages for a
global ISP. We show that the most important focus
area is the physical layer, in making sure that outages
of cables and equipment are handled. Outages caused
by malicious attacks have a high impact, but do not
significantly contribute to the number of outages.
By leveraging this knowledge, risk management
can be performed continuously at an operational
stage. Impact Score can be easily calculated, and the
number of cases can be reported. This way network
operators can ensure the continuity of their services,
minimize disruptions to customers, and maintain a se-
cure and reliable network infrastructure. Ultimately,
this research contributes to the broader understanding
of network outage risks and supports efforts to en-
hance network security and reliability in an increas-
ingly interconnected world.
ACKNOWLEDGEMENTS
Language improvements by (OpenAI, 2022).
REFERENCES
Aceto, G., Botta, A., Marchetta, P., Persico, V., and Pescap,
A. (2018). A comprehensive survey on internet out-
Outage Risks: It is not the Malicious Attacks that Take Down Your Service
81