Modelling Defence Planning as a Sequential Decision Problem

Carolyn Chen

, Mark Rempel

and Kendall Wheaton

Centre for Operational Research and Analysis, Defence Research and Development Canada,

60 Moodie Dr., Ottawa, Canada

Keywords:

Defence Planning, Sequential Decision Problem, Mathematical Modelling, Military.

Abstract:

Defence planning in a nation’s defence organization is a complex process that requires considering hundreds

of projects and billions of taxpayer dollars. While a variety of methods, such as integer programming and

genetic algorithms, are used in practice to help decision makers select in which projects to invest, their appli-

cation tends to not account for the fact that these decisions are made sequentially and under uncertainty. In this

paper, we present the ﬁrst steps towards developing a sequential decision model of the Canadian Department

of National Defence’s multi-gate Project Approval Process that addressed both issues. Our contributions are

twofold. First, using the universal modelling framework for sequential decisions we present a mathematical

model that accounts for the sequential nature of project selection, arrival of new projects over time, and un-

certainty in future budgets. In addition, we extend this model to account for the uncertainty in how a project’s

cost changes over time when its selection is delayed. Second, we demonstrate how these models may be

used to compare the effectiveness of three project selection decision policies, namely a ranked list approach, a

knapsack approach, and a knapsack approach that reserves a contingency fund for future projects.

1 INTRODUCTION

Defence planning is a critical activity that aims to

help nations achieve both their short- and long-term

defence and security objectives. However, selecting

the right capabilities, and thus projects, in which to

invest is not always a straightforward process. This

is due to a variety of factors—ﬁnancial constraints,

regulatory constraints, multiple conﬂicting objectives,

interdependence between projects, cost uncertainty,

etc.—that often introduce a high-degree of complex-

ity to the selection process. In addition, the planning

process often must consider various project types si-

multaneously, including those focused on information

technology, equipment, infrastructure, as well as sup-

port contracts (Rempel and Young, 2017). As a result,

the project delivery timelines considered may vary

signiﬁcantly. For example, the Canadian Defence In-

vestment Plan 2018 included projects with both near-

and long-term delivery dates ranging from 2022 to

2038 (Government of Canada, 2019).

In order to accommodate the wide range of time-

lines, planning processes “must consider a temporal

dimension including immediate activities to possible

https://orcid.org/0000-0003-0209-1666

https://orcid.org/0000-0002-6248-1722

demands a few decades into the future”, and as a re-

sult occur in “an inherently uncertain and often un-

stable external environment where defence organisa-

tions are required to reorganise for, and respond to,

unpredicted turns of events” (Filinkov and Dortmans,

2014, p. 76). With this in mind, many defence plan-

ning problems may be aptly described as sequential

decision making problems under uncertainty. Fur-

thermore, when decision makers are seeking a port-

folio, the defence planning problem may be described

as a multi-period portfolio optimization problem (Salo

et al., 2024) and modelled as a knapsack problem (Lo-

catelli, 2023).

The classic approach to solve a sequential deci-

sion problem is to model it as a Markov Decision

Process (MDP) (Puterman, 2005) and use Dynamic

Programming (DP) (Bellman, 1957) to ﬁnd a decision

policy—“a rule (or function) that determines a deci-

sion given the information available” (Powell, 2011,

p. 221)—that makes decisions which result in the sys-

tem performing optimally with respect to a given cri-

terion or objective. However, in many real-world sit-

uations this is not feasible due to the curse of dimen-

sionality (Kuo and Sloan, 2005) and the curse of mod-

elling (Bertsekas and Tsitsiklis, 1996).

Given these limitations, alternative approaches

such as linear programming (and its variants—integer

156

Chen, C., Rempel, M. and Wheaton, K.

Modelling Defence Planning as a Sequential Decision Problem.

DOI: 10.5220/0013248200003893

In Proceedings of the 14th International Conference on Operations Research and Enterprise Systems (ICORES 2025), pages 156-164

ISBN: 978-989-758-732-0; ISSN: 2184-4372

programming, mixed-integer programming, etc.), ge-

netic algorithms, tabu search, and other novel heuris-

tics have been used to provide decision support. This

is evidenced by a recent review of 54 application-

based articles that described the use of such ap-

proaches for portfolio optimization in a defence con-

text (Harrison et al., 2020, see Table 1). Further anal-

ysis of these articles reveals that while many address

sources of uncertainty in their respective problems,

only ﬁve articles (Crawford et al., 2003; Tsaganea,

2005; Fisher et al., 2015; Shaﬁ et al., 2017; Moallemi

et al., 2018) account for both uncertainty and new

arriving projects which are selected for investment

over time—thus, modelling their respective problems

as sequential decision problems. Inspired by Fisher

et al. (2015), who studied a capital investment plan-

ning problem in the context of the Royal Canadian

Navy, this paper presents the ﬁrst steps towards de-

veloping a sequential decision model of the Canadian

Department of National Defence (DND)’s multi-gate

Project Approval Process (PAP).

This paper’s main contributions are twofold. First,

using the universal modelling framework for sequen-

tial decisions (Powell, 2022), two sequential deci-

sion models are presented: (i) a model that accounts

for the sequential nature of project selection, the ar-

rival of new projects over time, and uncertainty in fu-

ture budgets; and (ii) an extension of the ﬁrst model

which incorporates the uncertainty in how a project’s

cost changes over time when its selection is delayed.

Both the uncertainty in the future years’ budgets and

changes in a project’s costs were not represented in

Fisher et al. (2015), and thus this paper extends the

existing research in terms of modelling. Second, we

demonstrate how these models may be used to com-

pare the effectiveness of three project selection deci-

sion policies, namely a ranked list approach, a knap-

sack approach, and a knapsack approach that reserves

a contingency fund for new projects arriving in future

years.

The remainder of this paper is organized as fol-

lows. Section 2 presents relevant background infor-

mation. Section 3 describes the defence planning

scenario considered in this paper. The mathemati-

cal model that formulates the scenario as a sequen-

tial decision problem and its extension are given in

Section 4. The results from a series of computational

experiments that demonstrate how these models may

be used to provide decision support are given in Sec-

tion 5. Lastly, a conclusion is provided in Section 6.

2 BACKGROUND

Within the Canadian DND/Canadian Armed Forces

(CAF), hereafter referred to as DND/CAF, strategic

planning is performed by the Chief of Force Develop-

ment (CFD) on behalf of the Vice Chief of Defence

Staff (VCDS) (Government of Canada, 2024b). This

process requires the preparation of long-range plans

that identify future requirements for defence. Project

and force development staff across the DND/CAF

then prepare project proposals to address these fu-

ture requirements. The major capital project propos-

als (>$10 million CAD) are evaluated through the

Capital Investment Program Plan Review (CIPPR)

process for continuation through the PAP. This is de-

scribed in further detail in subsection 2.1.

The time required to bring a proposal through the

strategic planning process can vary signiﬁcantly due

to both internal and external reasons. The future

requirements upon which proposals are based will

evolve over time and a requirement may change sig-

niﬁcantly while a project proposal is being developed

or during the PAP. These uncertainties are discussed

in subsection 2.2.

2.1 Project Approval Process

The steps a major capital project must follow through-

out its life-cycle within the DND/CAF are depicted

in Figure 1.

A project starts with a proposal submitted to the

annual CIPPR process by its internal sponsoring or-

ganization. The sponsoring organization (henceforth

referred to as sponsors) can be military (e.g., Army,

Navy, Air Force) or a supporting organization (e.g.,

Materiel, Defence Research and Development, Infras-

tructure and Environment). In this intake process,

projects identify the capability gaps they address, pro-

vide cost estimates and tentative project timelines,

and the project is evaluated by subject matter ex-

perts to determine its overall value (i.e., beneﬁt) to

the DND/CAF. At the end of the CIPPR process, a

decision is made whether to allow the project to con-

tinue through to the ﬁrst phase of the PAP (Identiﬁ-

cation), to place the project on a waiting list, or to

remove the project from consideration. Decision sup-

port tools, such as the Visual Investment Optimiza-

tion and Revision (VIPOR) or Strategic Portfolio An-

alyzer with Re-conﬁgurable Components (SPARC),

are used to support this decision process (Rempel

and Young, 2017; Chen and Wheaton, 2024). As the

project continues through the PAP, the project is as-

sessed at the end of each phase and a decision is made

on the project’s continuation by a designated govern-

Modelling Defence Planning as a Sequential Decision Problem

157

Implementation

Close-out

DefinitionIdentification

CIPPR

Options

Analysis

Options

Analysis

Project Approval Process

Army

Navy

Air Force

etc.

Figure 1: Project pathway from inception within the sponsoring organization, to the intake process (CIPPR), and through the

ﬁve phases of the PAP.

ing board (Balkaran, 2021). This creates a multi-gate,

sequential decision process. The project is complete

once it ﬁnishes the Close-out phase.

2.2 Uncertainties in Defence Planning

It is important to understand the uncertainties associ-

ated with long-term planning in defence as they can

have a large impact on the delivery of new capabil-

ities. Uncertainties are caused by many factors—

the evolving nature of warfare as new technolo-

gies emerge and new tactics are devised (Roncolato,

2022), the plans and policies of Canada’s close Allies

and the North Atlantic Treaty Organization (NATO)

(North Atlantic Treaty Organization, 2022, 2024),

and the strategic threats to the peace, security and

sovereignty of the nation (Government of Canada,

2024a)—often causing projects to require more time

and cost than originally estimated. Inﬂation alone will

change a project’s cost if planning takes years longer.

Changes in the project scope can also happen, affect-

ing the cost. Finally, it is often the case that the cost

estimates for projects tend to be optimistic (U.S. Gov-

ernment Accountability Ofﬁce, 2020, Ch. 1).

Consider the example of the Maritime Helicopter

project. Planning for a new maritime helicopter be-

gan in the 1980’s, a contract was signed in 1992, and

the project was subsequently cancelled in 1993 after a

change in the federal government (Rossignol, 1998).

New planning in the 1990’s led to a contract being

signed in 2004. The delivery of the new helicopter

commenced in 2015 (Government of Canada, 2022).

This shows how political uncertainties caused a sig-

niﬁcant increase in a project’s timeline.

The Canadian Surface Combatant (CSC) project

is an example of cost uncertainty. In a recent

study (Ofﬁce of the Parliamentary Budget Ofﬁcer,

2021), the estimated cost for the CSC was reported

to be $26.2 billion in 2008, almost $62 billion in

2017, and to $69.8 billion in 2019. After awarding

the contract in 2019, this 2021 study estimated the

cost at $77.3 billion, a 295% increase from the initial

amount.

3 PROBLEM DEFINITION

The defence planning scenario in this paper is based

on Fisher et al. (2015) and the CIPPR process de-

scribed in subsection 2.1. While the PAP includes

multiple decisions through which a project must pro-

ceed, by focusing on the CIPPR process, this paper

limits the defence planning scenario to a single stage

process in which projects are either approved or not

approved.

For this scenario, given Canada’s defence policy

(Government of Canada, 2024a) and recent defence

capability plans (Government of Canada, 2019), sup-

pose that at the start of a given ﬁscal year a set of

candidate projects are put forth by sponsors. Each

project is seeking multi-year funding from the avail-

able defence budget. To help decision makers select

which candidate projects to fund, information on each

project is collected from its sponsor: its value to de-

fence, and its purchase cost which is distributed over a

number of ﬁscal years. In addition, the available bud-

get for 20 years is provided by the Assistant Deputy

Minister (Finance) organization.

With this information, a decision policy (see Sec-

tion 4) is used by decision makers to select those can-

didate projects that will be funded, and those that will

not. Following this decision, the funding associated

with the selected projects is removed from the avail-

able budget in both the current and respective future

ﬁscal years, and projects that were not selected are

added to the set of candidate projects for the next

ﬁscal year—albeit with their value reduced due to

their implementation being delayed. This reduction in

value may be caused by a variety of reasons, including

improvement of adversarial capabilities rendering the

project less effective, a shift in defence policy render-

ing the project less important, or investment in similar

capabilities by other sponsors or Allies reducing the

project’s value by making it redundant.

At the start of the next ﬁscal year, sponsors pro-

pose new candidate projects, each with their own

value and cost, which are added to the set of unse-

lected projects from the previous year. In addition,

changes to the remaining uncommitted budget are

provided by the Finance organization, which may be

due to a variety of factors, such as Government-wide

cost cutting measures or the addition of new funds due

ICORES 2025 - 14th International Conference on Operations Research and Enterprise Systems

158

to changes in the defence policy. Decision makers

must again apply a decision policy, like in the pre-

vious ﬁscal year, to select which candidate projects

to allocate funding. The cycle then continues as de-

scribed above.

Given the uncertain number of candidate projects

put forth each ﬁscal year, lack of knowledge of their

value and cost beforehand, and that investment de-

cisions are made throughout the planning horizon,

this problem is aptly described as a sequential de-

cision making problem under uncertainty. The fol-

lowing section formulates two models using the uni-

versal modelling framework for sequential decisions

(Powell, 2022). Section 5 presents a hypothetical case

study which demonstrates how the models may be

used to provide decision support.

4 SEQUENTIAL DECISION

PROBLEM FORMULATION

This section formulates the defence planning sce-

nario introduced in the previous section as a se-

quential decision problem. To do so, the universal

modelling framework for sequential decisions is em-

ployed. This framework consists of three components

(Powell, 2022, pp. 10-14):

• a sequential decision model that describes the

state variable, decision variables, exogenous in-

formation that arrives after a decision is made, a

transition function that deﬁnes the dynamics of

how the system evolves from one state to the next,

and an objective function;

• the stochastic modelling that describes the uncer-

tain information contained within the problem’s

initial state and exogenous information; and

• the decision policies to be explored.

The remainder of this section presents these three

components applied to the problem described in Sec-

tion 3.

4.1 Sequential Decision Model

Throughout the defence planning horizon, typically

20 years, there are a set of decision epochs T

which decisions are made regarding projects, and a

set of budget epochs T

in which budget constraints

are enforced, where T

⊂ T

. For this problem, an

epoch occurs every year throughout the planning pe-

riod. The size of the set T

is determined by the

length of the funding requirements of the candidate

projects and generally exceeds the number of deci-

sion epochs. This aims to address end effects associ-

ated with defence planning in terms of ﬁnancial com-

mitments, thus eliminating “outrageous behavior at

the end of the planning horizon” (Brown et al., 2004,

p. 422).

At each decision epoch t ∈ T

there exists a set of

candidate projects P

. For each project i ∈ P

, there is

an associated information vector p

t,i

that contains two

elements: the value of the project v

t,i

; and the yearly

cost of the project c

t,i

= (c

t,i,t

′

)

′

≥t,t

′

∈T

The state variable is then deﬁned as

= (P

, B

), (1)

where P

= (p

t,i

)

i∈P

with element p

t,i

being deﬁned

as above, and B

= (b

t,t

′

)

′

≥t,t

′

∈T

is a vector of bud-

gets.

At each decision epoch t, the decision regarding

which candidate projects are selected is deﬁned as

= (x

t,i

)

i∈P

: x

t,i

= 1 if project i is selected, and

0 otherwise. The decision vector at epoch t is con-

strained by the available budgets,

∑

i∈P

t,i

t,i,t

′

≤ B

′

, t

′

≥ t, t

′

∈ T

, (2)

collectively labelled as X (S

The state transition function is deﬁned as S

t+1

, x

t+1

), where W

t+1

is exogenous informa-

tion that arrives after the decision x

is made. This

includes new candidate projects

t+1

, where for each

project i ∈

t+1

there is an associated vector ˆp

t+1,i

deﬁned as ( ˆv

t+1,i

, ˆc

t+1,i

) with the components being

deﬁned as those in p

t,i

. Thus, W

t+1

is given as

t+1

= (

t+1

), (3)

where

t+1

= ( ˆp

t+1,i

)

i∈

t+1

is a vector of new project

information vectors as described above, and

t+1

(

t+1,t

′

)

′

≥t+1,t

′

∈T

is a vector of changes in the re-

maining uncommitted budgets in upcoming years. It

should be noted that the exogenous information de-

pends on the decision at time t; that is, W

t+1

de-

pends on the post-decision state S

= S

, x

) (Pow-

ell, 2022, p. 580).

Given x

and W

t+1

, the transition function then:

reduces the value of projects not selected in year t by

10%; combines the set of unselected projects with the

newly arrived projects to determine the set of candi-

date projects in year t + 1; and updates the budgets in

year t +1 onward.

When a decision x

is made, a contribution (or re-

ward) is received and is deﬁned as

C(S

, x

) =

∑

i∈P

t,i

. (4)

Modelling Defence Planning as a Sequential Decision Problem

159

Given this contribution function, the objective is

then to maximize the expected cumulative value of

the selected projects. The objective is given as

max

π∈Π

∑

t∈T

C(S

, X

))|S

, (5)

where: π is a label that carries information about a

decision policy; Π is the set of all decision policies

considered; X

) represents the implementation of

a decision policy that returns the decision x

for each

state S

∈ S and is bounded by the constraints X

);

is the initial state that includes the initial budgets

and projects P

; and E is an expectation operator

that is over all uncertainties within the problem.

This model, labelled as Model-I, describes the

problem discussed in Section 3. However, this for-

mulation can be extended to include a range of addi-

tional uncertainties, such as: injection of additional

funding beyond yearly ﬂuctuations due to political

uncertainty, new defence policies, etc.; cancellation

of projects from the set of previously selected projects

due to advancement of adversarial capabilities, geo-

political changes, etc.; and updates to the cost of pre-

viously unselected projects due to inﬂation, changes

in project scope, etc. These additions require changes

to the state transition function and exogenous infor-

mation, and in the case of the cancellation of projects

the state variable as well.

In this paper we focus on extending Model-I by

updating the costs of projects not selected at epoch t

and using these updates in epoch t + 1. This model,

labelled as Model-II, requires that W

t+1

to be modi-

ﬁed such that

t+1

= (

t+1

), (6)

where

t+1

= ( ˆc

t+1,i,t

′

)

′

≥t+1,t

′

∈T

is a vector of up-

dated yearly costs for the set {i|i ∈ P

, x

t,i

= 0}. In

addition, the transition function must be modiﬁed to

update the costs of projects not previously selected.

4.2 Stochastic Modelling

As described in the previous section, in Model-I the

exogenous information that arrives after a decision is

made includes two stochastic components: the arrival

of new projects, and changes in the future years’ bud-

gets.

In this study,

t+1

is based on a data set of

198 projects collected in the 2022 CIPPR process and

includes the project value, total cost, and duration.

Three variations of each project were added to the

original data set (total of 792 sample projects), each

with their own value, total cost, and duration such that

each was scaled using a random value from a uni-

form distribution ranging between ±20%. This set

was then randomly sampled with replacement to cre-

ate the new projects arriving in both the initial state S

and exogenous information W

t+1

The number of new projects arriving in a given

year was modelled using a binomial distribution, with

a maximum of 20 projects and each having a 50%

probability of arriving. Distribution parameters are

notional. While project data was available for years

2018-2022, inconsistencies in annual data collection

methods made it difﬁcult to determine the number of

new projects arriving. Each project’s total cost was

then split over the duration of the project using a tri-

angular distribution. The peak of the triangular distri-

bution occurs at half of the duration of the project. In

the case of odd numbered durations, the peak occurs

at half of the duration rounded down to the closest

year.

Lastly,

t+1

is notionally determined. After

projects are selected using a policy X

) and their

costs removed from the current and future years’ bud-

gets, each remaining future year’s budget is scaled by

a random value from a uniform distribution ranging

between ±10%.

Regarding Model-II, the costs of unselected

projects (x

t,i

= 0) that are propagated to the next

epoch t +1 have their costs adjusted prior to doing so.

Each project’s annual costs are scaled by a single ran-

dom value which is sampled from a uniform distribu-

tion ranging from zero to 3.7%. The upper bound was

calculated as a yearly inﬂation percentage to achieve

a 20% increase in total cost over a ﬁve year period.

The random value is then multiplied by: -1 (cost de-

crease) or +1 (cost increase), with a 10% probability

of being -1, and a 90% probability of being +1, there-

fore creating a higher likelihood of a project’s costs

increasing over time.

4.3 Decision Policies

The purpose of the objective function in Equation 5

is to ﬁnd the decision policy that maximizes the ex-

pected value of the projects selected. In this paper we

limit the policies considered to three myopic project

selection policies.

The ﬁrst policy, labelled as the Ranked list pol-

icy and given as X

), is a Policy Function Ap-

proximation (PFA) that “directly [returns] an action

given a state, without resorting to any form of imbed-

ded optimization, and without using any forecast of

future information” (Powell, 2011, p. 221). This pol-

icy ranks candidate projects by value, and selects the

top-ranked projects that ﬁt within the available bud-

ICORES 2025 - 14th International Conference on Operations Research and Enterprise Systems

160

get. The second policy, labelled as the Knapsack pol-

icy and given as X

), is a Cost Function Approxi-

mation (CFA) that aims to maximize the value of se-

lected projects in a given year and “[does] not make

any effort at approximating the impact of a decision

now on the future” (Powell, 2022, p. 535). The third

policy, labelled as the Contingency fund policy and

given as X

C f

, θ), is similar to the Knapsack pol-

icy with the exception that it is parameterized by a

single scalar θ (ranging between zero and one) that

limits the amount of budget that can be used in future

years when making a decision at epoch t. For exam-

ple, θ = 0.4 represents a 40% contingency fund. This

policy requires an adjustment to the constraints X (S

)

listed in Equation 2, speciﬁcally

∑

i∈P

t,i

t,i,t

≤ B

, (7)

∑

i∈P

t,i

t,i,t

′

≤ (1 −θ)B

′

, ∀t

′

≥ t + 1, t

′

∈ T

, (8)

where Equation 7 enables the full budget to be used in

epoch t and Equation 8 ensures that only a portion of

the future budgets are used. Note that the Knapsack

policy is a special case of the Contingency fund pol-

icy when θ = 0. As a Knapsack policy is commonly

known, the two policies are included separately to dis-

tinguish the results.

5 RESULTS

In this section, we demonstrate how the mathematical

models presented in the previous section may be used

to provide decision support. While many questions

may be posed by both planners and decision makers

during the defence planning process, in this section

we consider the following:

• Given Model-I, how do the decision policies de-

scribed in subsection 4.3 perform relative to the

Ranked list policy as a function of the planned an-

nual budget?

• How does the inclusion of project cost uncertainty

(for projects that are not selected and are propa-

gated to the next decision epoch) in Model-II im-

pact the results when compared to Model-I?

To answer these questions, the two models

and the decision policies were implemented in

Python v3.12.6. Given either Model-I or Model-II,

an initial state S

, and decision policy, 50 trials were

executed and the expected cumulative portfolio value

(see Equation 5) and associated 95% Conﬁdence In-

terval (CI) were computed. All trials were exe-

cuted on an AMD Ryzen 5 4500U CPU with Radeon

Graphics running at 2.38 GHz with 16 GB RAM and

using the Windows 10 64-bit operating system.

The aim of the ﬁrst question is to provide deci-

sion makers with insights on: which of the policies

studied maximizes the expected cumulative portfolio

value given an initial yearly budget for a 20-year pe-

riod (B

); and the robustness the recommended policy

to changes in the initial budgets. The results of the

trials are depicted in Figure 2a and the 95% CIs are

listed in Table 1.

The results demonstrate that all policies outper-

form the benchmark Ranked list policy, and the Con-

tingency fund (θ = 0.6) policy performs the best

across all initial budgets studied. While the Contin-

gency fund policy with θ = 0.8 performed similar to

same policy with θ = 0.6, the latter consistently out-

performed the former as listed in Table 2. However,

it is worth noting that as the initial yearly budget in-

creases, the difference between the two policies be-

comes negligible.

The project selection results for each policy were

also examined to investigate why the Contingency

fund policies with higher reserves (θ = 0.6,θ = 0.8)

outperformed the other policies. It was noted that

these policies selected fewer high cost projects since

the available future budgets were limited by the re-

served funding. The total costs in the sample project

data ranged from 10s of millions to 10s of billions

of dollars. Therefore, the selection of a single high

cost project signiﬁcantly reduces the number of lower

cost projects that can be selected in future years. Ad-

ditionally, the project values range less signiﬁcantly

when compared to the total costs and are only mod-

erately correlated to the logarithm of the total cost.

As such, the selection of a high cost project does not

necessarily result in a signiﬁcant increase in the cu-

mulative portfolio value. While all tested policies are

myopic and do not consider future effects, the Contin-

gency fund policy begins to mimic forward-looking

policies by forcing the reservation of funding for fu-

ture projects.

It may therefore be concluded that, for the sce-

nario and policies studied, using a knapsack-based

policy that reserves 60% of the available budget in

future years will provide the most value to defence.

While our scenario differs from that studied by Fisher

et al. (2015)—different project values, different cost

proﬁles, uncertainty in future budgets—our results

generally agree with their conclusion that holding

back budget results in a higher cumulative portfolio

value.

The aim of the second question is to evaluate the

impact that incorporating project cost uncertainty, of

delayed projects (Model-II), has on the policy recom-

Modelling Defence Planning as a Sequential Decision Problem

161

Ranked list

Contingnecy (0.6)

(a) Model-I.

Ranked list

Contingnecy (0.6)

(b) Model-II.

Figure 2: Expected cumulative portfolio value and 95% CI as a function of initial yearly budget for a 20-year period (B

) for

each decision policy described in subsection 4.3.

Table 1: Summary of 95% CIs for the percentage difference between each policy and the benchmark Ranked list policy when

using Model-I. Key: Rl = Ranked list, K = Knapsack, Cf (θ) = Contingency fund with a speciﬁc value of θ. All values are

rounded to two signiﬁcant ﬁgures.

Policy Initial yearly budget for a 20-year period B

(Billions)

0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0

Rl - - - - - - - - - -

K [14, 30] [18, 31] [16, 28] [20, 30] [16, 28] [15, 25] [14, 25] [12, 21] [11, 19] [12, 23]

Cf (0.2) [27, 44] [35, 53] [31, 47] [35, 49] [32, 49] [29, 43] [28, 43] [27, 40] [25, 41] [27, 43]

Cf (0.4) [33, 51] [40, 60] [38, 53] [41, 56] [41, 59] [37, 55] [36, 52] [35, 50] [33, 50] [36, 54]

Cf (0.6) [38, 56] [45, 66] [44, 62] [48, 65] [46, 66] [42, 61] [41, 59] [40, 57] [39, 57] [41, 62]

Cf (0.8) [26, 44] [37, 58] [38, 55] [43, 59] [42, 62] [40, 58] [38, 56] [38, 54] [36, 54] [39, 59]

Table 2: Summary of 95% CIs for the percentage difference between the Contingency fund (θ = 0.6) policy and the Contin-

gency fund (θ = 0.8) policy when using Model-I. All values are rounded to one signiﬁcant ﬁgure.

Initial yearly budget for a 20-year period B

(Billions)

0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0

[8, 10] [5, 7] [3, 5] [3, 5] [2, 4] [2, 4] [2, 3] [1, 3] [1, 3] [1, 3]

mendation. The results for the trials are depicted in

Figure 2b. Although a detailed analysis is not pre-

sented here, the results (similar to those in Figure 2a)

suggest that the Contingency fund (θ = 0.6) policy

outperforms the benchmark Ranked list policy across

all initial yearly budgets studied and performs better

than when other values of θ are used. The Model-II

results provide further evidence that for the scenario

and policies studied, the Contingency fund (θ = 0.6)

policy is robust with respect to different initial yearly

budgets, changes in future budgets, as well as changes

in costs of those projects there are delayed.

6 CONCLUSION

This paper presented initial work on the develop-

ment of a sequential decision model of the DND/CAF

multi-gate PAP. The objective was to investigate the

impact of different decision policies in a sequential

decision model that explicitly includes uncertainties

in budgets and the project information in each new

epoch.

The results from Model-I support the conclusion

that a Contingency fund policy that reserves 60% of

the available budget for future years performs the

best. A policy that reserves 80% of the available bud-

ICORES 2025 - 14th International Conference on Operations Research and Enterprise Systems

162

get performs almost as well when the initial yearly

budgets are large. Regardless, the Contingency fund

policies outperform both the benchmark Ranked list

policy and Knapsack policy. Model-II demonstrates

that the same decision policy, the Contingency fund

policy that reserves 60%, remains robust to cost un-

certainty in delayed projects and continues to outper-

form the other tested policies.

This initial model demonstrates the potential of a

sequential decision model to determine better deci-

sion policies for defence planning. Three areas of fu-

ture development have been identiﬁed. The ﬁrst area

covers improvements to the sequential model to bet-

ter reﬂect the real-world dynamics of the DND/CAF

PAP. The initial model only covered the ﬁrst step

of the PAP, therefore the model can be expanded

to cover the multiple gates of the PAP. Addition-

ally, the awarding of the value can be delayed to

when the project is delivered, rather than when the

project is selected. The modelled uncertainties can

also be expanded to capture changes in project value

and scheduling over time, as well as uncertainties that

may force the selection or cancellation of a project.

The second area of development covers the deci-

sion policies. The set of policies can be expanded

to include future-looking policies, and reinforcement

learning can be explored as a method for determining

an optimal decision policy. The last area of develop-

ment aims to improve the project dataset. The current

dataset is limited to a single year of projects. This

dataset can be expanded to include additional years

of historical data. Tools, such as natural language

processing, can be explored for extracting data from

historical project documentation. Data augmentation

techniques can also be explored for supplementing the

real project dataset.

The results presented here are encouraging, and

the completion of this investigation should pro-

vide more information to guide the management of

projects under uncertainty in defence planning.

ACKNOWLEDGMENTS

The authors would like to acknowledge the con-

tributions of Ammar Lakdawala, who implemented

an initial version of the sequential decision mod-

els in Python and conducted extensive testing in

spring 2024.

ACRONYMS

CAF Canadian Armed Forces

CI Conﬁdence Interval

CIPPR Capital Investment Program Plan Review

CFA Cost Function Approximation

CFD Chief of Force Development

DND Department of National Defence

DP Dynamic Programming

MDP Markov Decision Process

NATO North Atlantic Treaty Organization

PAP Project Approval Process

PFA Policy Function Approximation

SPARC Strategic Portfolio Analyzer with Re-

conﬁgurable Components

VCDS Vice Chief of Defence Staff

VIPOR Visual Investment Optimization and Revi-

sion

REFERENCES

Balkaran, R. R. (2021). Process alignment: An assess-

ment of dnd project approval process in the 2019

project approval directive. Master’s thesis, Canadian

Forces College, Toronto, Ontario. Retrieved from:

https://www.cfc.forces.gc.ca/259/ 290/23/286/Balka-

ran.pdf.

Bellman, R. (1957). Dynamic Programming. Princeton

University Press.

Bertsekas, D. and Tsitsiklis, J. (1996). Neuro-Dynamic

Programming. Athena Scientiﬁc, Belmont, Mas-

sachusetts, First edition.

Brown, G. G., Dell, R. F., and Newman, A. M. (2004).

Optimizing military capital planning. Interfaces,

34(6):415–425.

Chen, C. and Wheaton, K. (2024). Investigating the Im-

pact of Project Dependencies on Capital Investment

Decisions in Defence. In 17th NATO Operations Re-

search and Analysis (OR&A) Meeting Proceedings.

NATO Science and Technology Organization. STO-

MP-SAS-OCS-ORA-2023.

Crawford, J., Do, J., Leduc, C., Malik, A., Gormley, K.,

Luebke, E., and Scherer, W. (2003). SWORD: the lat-

est weapon in the project selection arsenal. In IEEE

Systems and Information Engineering Design Sympo-

sium, 2003, pages 95–100.

Filinkov, A. and Dortmans, P. J. (2014). An enterprise port-

folio approach for defence capability planning. De-

fense & Security Analysis, 30(1):76–82.

Modelling Defence Planning as a Sequential Decision Problem

163

Fisher, B., Brimberg, J., and Hurley, W. (2015). An ap-

proximate dynamic programming heuristic to support

non-strategic project selection for the Royal Canadian

Navy. The Journal of Defense Modeling and Simula-

tion, 12(2):83–90.

Government of Canada (2019). Defence Investment

Plan 2018. Retrieved 22 October, 2024, from

https://www.canada.ca/en/ department-national-

defence/corporate/reports-publications/defence-

investment-plan-2018.html.

Government of Canada (2022). Status re-

port on transformational and major crown

projects. Retrieved 23 October, 2024 from

https://www.canada.ca/en/department-national-

defence/corporate/reports-publications/departmental-

plans/departmental-plan-2022-23/supplementary-

information/report-crown-project1.html#toc14.

Government of Canada (2024a). Our North, Strong

and Free: A Renewed Vision for Canada’s

Defence. Retrieved 15 October, 2024, from

https://www.canada.ca/en/ department-national-

defence/corporate/reports-publications/north-strong-

free-2024.html.

Government of Canada (2024b). Vice Chief of the Defence

Staff (VCDS). Retrieved 22 October, 2024, from

https://www.canada.ca/en/ department-national-

defence/corporate/organizational-structure/vice-

chief-defence-staff.html.

Harrison, K. R., Elsayed, S., Garanovich, I., Weir, T., Galis-

ter, M., Boswell, S., Taylor, R., and Sarker, R. (2020).

Portfolio optimization for defence applications. IEEE

Access, 8:60152–60178.

Kuo, F. and Sloan, I. (2005). Lifting the curse of

dimensionality. Notices of the American Mathemat-

ical Society, 52(11):1320–1328. Retrieved from:

https://www.ams.org/journals/notices/200511/ fea-

sloan.pdf?adat=December%202005&trk=200511fea-

sloan&cat=feature&galt=feature.

Locatelli, A. (2023). Optimization methods for knapsack

and tool switching problems. 4OR, 21:715–716.

Moallemi, E. A., Elsawah, S., Turan, H. H., and Ryan, M. J.

(2018). Multi-objective decision making in multi-

period acquisition planning under deep uncertainty.

In 2018 Winter Simulation Conference (WSC), pages

1334–1345.

North Atlantic Treaty Organization (2022). NATO

2022 Strategic Concept. Retrieved 23 Octo-

ber, 2024, from https://www.act.nato.int/wp-content/

uploads/2023/05/290622-strategic-concept.pdf.

North Atlantic Treaty Organization (2024). Washing-

ton Summit Declaration. Retrieved 23 October,

2024, from https://www.nato.int/cps/en/natohq/ ofﬁ-

cial texts 227678.htm.

Ofﬁce of the Parliamentary Budget Ofﬁcer (2021). The cost

of Canada’s surface combatants: 2021 update and

options analysis. Retrieved 22 October, 2024, from

https://www.pbo-dpb.ca/en/publications/RP-2021-

040-C–cost-canada-surface-combatants-2021-update-

options-analysis–cout-navires-combat-canadiens-

mise-jour-2021-analyse-options.

Powell, W. (2011). Approximate dynamic programming:

Solving the curses of dimensionality. Wiley, Hoboken,

New Jersey, Second edition.

Powell, W. (2022). Reinforcement learning and stochastic

optimization: A uniﬁed framework for sequential de-

cisions. Wiley, Hoboken, New Jersey.

Puterman, M. (2005). Markov Decision Processes: Discrete

stochastic dynamic programming. Wiley, Hoboken,

New Jersey.

Rempel, M. and Young, C. (2017). VIPOR: A visual analyt-

ics decision support tool for capital investment plan-

ning. Scientiﬁc Report DRDC-RDDC-2017-R129,

Defence Research and Development Canada.

Roncolato, G. (2022). The character of war is

constantly changing: Organizations and peo-

ple who can rapidly and effectively adapt are

more likely to prevail. U.S. Naval Institute

Proceedings, 148/5/1.431. Retrieved from:

https://www.usni.org/magazines/proceedings/2022/may

/character-war-constantly-changing.

Rossignol, M. (1998). Replacement of shipborne and

rescue helicopters. Retrieved 23 October, 2024,

from https://www.publications.gc.ca/Collection-

R/LoPBdP/CIR/943-e.htm.

Salo, A., Doumpos, M., Liesi

o, J., and Zopounidis, C.

(2024). Fifty years of portfolio optimization. Euro-

pean Journal of Operational Research, 318(1):1–18.

Shaﬁ, K., Elsayed, S., Sarker, R., and Ryan, M. (2017).

Scenario-based multi-period program optimization for

capability-based planning using evolutionary algo-

rithms. Applied Soft Computing, 56:717–729.

Tsaganea, D. (2005). Appropriation of funds for anti-

ballistic missile defence: a dynamic model. Kyber-

netes, 34(6):824–833.

U.S. Government Accountability Ofﬁce (2020). Cost esti-

mating and assessment guide: Best practices for de-

veloping and managing program costs. Technical Re-

port GAO-20-195G, U.S. Government Accountability

Ofﬁce.

ICORES 2025 - 14th International Conference on Operations Research and Enterprise Systems

164