operation, it would still be a significant planning
burden.
Given current technology, an alternative area
where AI can be applied more immediately without
the extensibility issue is factory-based manufactured
construction. In this context, the lifespan of an AI
agent should be relatively long and endure until any
reconfiguration of the factory system is required or a
change occurs in its operating environment. Flood
and Flood (2022) undertook a proof-of-concept study
that showed that an RL trained deep artificial neural
network (DANN) can significantly outperform a
hand-crafted rule-of-thumb approach to making
decisions in the control a construction factory. The
focus of their study was factory-based production of
precast reinforced concrete (PRC) components,
where the arrival of batches followed a Poisson
process, the number of components in a batch was
determined stochastically, and all components were
custom designed and therefore varied in their
processing times.
Researchers such as Benjaoran and Dawood
(2005), Chan and Hu (2002), and Leu and Hwang
(2001), have examined ways to optimize precast
reinforced concrete (PRC) component production
using genetic algorithms (GAs). The approach proved
to be effective although heuristic search techniques
like GAs can be computationally demanding, making
them unsuitable for scenarios where decisions need to
be made promptly.
Once trained, RL solutions based on a learned
model like the one developed by Shitole et al. (2019)
can produce prompt solutions to a decision problem.
Several researchers, such as Waschneck et al. (2018),
Zhou et al. (2020), and Xia et al. (2021), have utilized
this method for the control of factory operations and
have observed encouraging results when compared to
conventional approaches like rules-of-thumb.
However, these applications have been beyond the
scope of construction manufacturing, and therefore,
fail to address numerous challenges within this
industry.
This study represents a significant advancement
beyond the proof-of-concept work reported by Flood
and Flood (2022). It conducts a comprehensive
analysis of the impact of the DANN's structure, input
variable selection, and RL algorithm variables on the
system's performance, with the ultimate goal of
optimization. In addition, the RL policy is applied to
a genuine factory scenario, demonstrating its practical
application in a real-world context.
2 PROCESS CONTROL
2.1 Decision Agents
Both controllable and uncontrollable events shape the
trajectory of a construction manufacturing system in
the future and therefore its performance. The
controllable events can be leveraged to direct this
trajectory in a favourable direction for the
manufacturer, maximizing productivity and/or profit.
This is accomplished by making optimal decisions at
critical junctures, such as prioritizing tasks in a queue,
determining when to maintenance equipment, and
allocating machines to processes.
Figure 1 demonstrates how one or more agents
make decisions dynamically throughout the system's
lifetime by monitoring relevant variables that define
the system's current state (s
t
), and utilizes this
information to determine appropriate actions at the
next state (s
t+1
). While the agent's actions are
generally focused on the immediate future to make
use of the most relevant and accurate information,
they may also extend to events further in the future
for decisions with long lead times. The decisions
made by the agents will affect the performance of the
system over time.
Figure 1: Process control by a Decision Agent.
Decision agents can be categorized as search-
based or experience-based entities (Flood & Flood,
2022). Search-based agents, including blind and
heuristic methods, adopt a systematic approach to
explore the solution space in search of the optimal
action. They create a solution that is tailored to the
specific problem instance at hand, which can
potentially lead to better optimized solutions than
experience-based agents. Moreover, search-based
agents are highly adaptable, allowing them to be
easily modified to new versions of the problem.
However, they may not be suitable for situations that