mask use by WHO since the organization
recommended the use of a non-medical mask for all
the 10 activities selected for the indicator no matter
how safe or risky it is (WHO, 2020a). No protective
gear is needed for a risk level above 75% because
any activity at this range would be too risky since
the chances of meeting an infected person are high.
Therefore, people should not partake in a particular
activity with that kind of risk level. For the study,
since it is a traffic indicator like system, green,
yellow, and red would be used instead of a four-
level indicator. Essentially, the risk level of the
proposed example can still be retained which would
make the 2nd and 3rd levels become subcategories
of yellow. This would mean that the range of each
level, including the 1st and last one, would not
change and the precautionary measures for each
level would also be the same. Table 2 contains a
summary of the risk level classification used in the
study. To clarify, the original classification of risk
level proposed in the COSRE paper was only an
example and it is not verified using real exposure
data yet (Sun, 2020b). As mentioned in COSRE the
paper and up until now, real-world exposure data is
scarce due to the pandemic. These real-world
datasets are relatively sensitive and hard to retrieve
at present. Since there is access to Philippine case
data, this can be used to test the model in the
absence of actual exposure data.
3.5 Testing the Model
To check the validity and effectiveness of the chosen
factor, a correlation between the factor of the
indicator and COVID-19 data was done. To be
specific, the University of the West of England
(UWE) stated Pearson's correlation coefficient (r)
would be used to measure the strength of the
association between two variables (UWE, n.d.). The
correlation coefficient ranges from -1 to 1 and as r
goes towards 0, the relationship between the two
variables will be weaker. A perfect degree of
correlation has a value near ± 1 and as one variable
increases, the other variable also increases (if
positive) or decreases (if negative) (Statistics
Solutions, n.d.). Furthermore, a high degree
correlation has a coefficient value that lies between
± 0.50 and ± 1. A moderate degree of correlation has
a value that lies between ± 0.30 and ± 0.49.
Moreover, a low degree of correlation has a value
that lies below ± 0.29. The last degree of correlation
would be a coefficient value of 0 which does not
correlate. The data correlated to the computed
PoMSI per region are the 7-day moving average of
the daily growth rate of COVID-19 cases (7-DMA
of DGR) and the cumulative cases of COVID-19.
The formula for cumulative cases is just the sum of
all the cases for the specific region up to the specific
point in time indicated. Both types of data can be
derived from the dataset in the DOH data drop
(DOH, 2020a). The values for the 7-DMA of DGR
and cumulative cases are both the week after the
particular week chosen to compute PoMSI.
Moreover, since the 7-DMA of DGR is a single
value and only the cumulative cases of the 7th day
of the week were used which is also a single value,
PoMSI was computed using the 7-DMA of the
cumulative active cases of the week chosen. Python
(Google Colab) was used to extract data from the
DOH data drop CSV file and to compute the
necessary computations needed (active cases,
PoMSI, DGR, cumulative sum of cases, etc.). The
range of the data taken in the DOH dataset was from
April 1 to September 1. PoMSI was computed per
region based on April to August data from the DOH
data drop. The computations were done weekly and
August ended on the 6th day that was why the range
of the data used reached September 1. It is worth
noting that there are inconsistencies present in the
Data Drop like unstandardized region names,
nonuniform date formats, and missing recovery
dates. For the missing recovery dates, an
approximation of recovered cases was done. All
cases after 14 days that were not considered as dead
were tagged as recovered (DOH, 2020b). Rather
than using 100% occupancy, which is unlikely
during a pandemic, 50% occupancy was used to
better simulate physical distancing in an
establishment as seen in Table 1 and this type of
occupancy restriction is usually utilized during the
modified general community quarantine (MGCQ) in
which most businesses, that handles the activities
included in the indicator, can operate (Crismundo,
2020). The CSV file output of the Python code was
then imported to Google Sheets to do the correlation
attempts (vs. DGR and vs Cumulative Cases). To get
the Pearson's correlation coefficient (r), the Pearson
correlation formula was used in Google Sheets and
correlation was done per region and activity.
4 RESULTS AND DISCUSSION
As stated in the Methods section of the paper, the
correlation was done using the Pearson correlation
formula in Google Sheets. Based on the results of
the correlation process, the correlation coefficients
(r) of PoMSI (per region and activity) versus the