Implementation of 12 Transition Controls for Rotary Double Inverted

Pendulum Using Direct Collocation

Doyoon Ju

, Taegun Lee

and Young Sam Lee

Department of Electrical and Computer Engineering, Inha University, Incheon, Korea

{seiko.kr, dlxorjs815}@gmail.com, lys@inha.ac.kr

Keywords:

Rotary Double Inverted Pendulum, Transition Control, Direct Collocation, Optimal Control.

Abstract:

The rotary double inverted pendulum system has one stable and three unstable equilibrium points due to

its kinematic properties. This paper extends the traditional swing-up control problem by deﬁning a novel

transition control problem among these points. We formulate the system’s dynamic equations and boundary

conditions for different equilibrium points to minimize energy consumption during transitions, resulting in a

two-point boundary value optimal control problem. This problem is solved ofﬂine to calculate the feedforward

trajectory for feedforward control. To convert the continuous optimal control problem with constraints into a

nonlinear optimization problem, we employ the direct collocation method. A time-varying Linear Quadratic

controller is used as the feedback controller to accurately track the generated feedforward path during real-time

control, compensating for uncertainties. Previous studies on rotary double inverted pendulums have focused on

the swing-up problem, with no research addressing transition control between the four equilibrium points. This

paper deﬁnes the transition control problem for the rotary double inverted pendulum and proposes a control

strategy. The method’s effectiveness and practicality were validated through the design and implementation

of 12 transition trajectories in experimental settings, successfully demonstrating its feasibility and utility.

1 INTRODUCTION

The inverted pendulum system, with its unstable dy-

namic characteristics, incorporates both nonlinear and

non-minimum phase properties, making it a widely

used educational tool for teaching control theory.

Additionally, it serves as a popular testbed for re-

searchers to validate new control techniques. Ma-

jor studies on the inverted pendulum system include

swing-up control, which transitions the pendulum

from an initial state where it points downward to an

upright state(

Astr

om and Furuta, 2000; Meta et al.,

2014), and balance control, which aims to maintain

stability in the upright state after the swing-up(Oh and

Lee, 2018). Among these, swing-up control is consid-

ered a more challenging research topic compared to

balance control because it requires designing a con-

troller that accounts for the system’s nonlinearity, in-

stability, and input/output constraints.

In the case of the rotary inverted pendulum sys-

tem, unlike the linear inverted pendulum where the

https://orcid.org/0000-0001-7011-6779

https://orcid.org/0009-0007-3107-2735

https://orcid.org/0000-0003-0665-1464

pendulum is ﬁxed to rotate in a single plane, the

powered arm of the rotary inverted pendulum also

rotates. This characteristic allows the pendulum to

move within a three-dimensional space, introducing

additional challenges in performing swing-up control.

To address the swing-up problem of such a rotary in-

verted pendulum system, various methods have been

applied, including self-tuning techniques(Ratiroch-

Anant et al., 2004), PID controllers(Rahairi et al.,

2011), and methods utilizing sliding observers(Thein

and Misawa, 1995). Recently, research has also

been conducted to solve this problem using artiﬁ-

cial intelligence-based controllers(Brown and Strube,

2020; Baek et al., 2024).

However, in the case of the rotary double inverted

pendulum, most studies have implemented swing-

up control using only simulation environments(Liang

et al., 2023; Tran et al., 2024; Singh and Swarup,

2021; Zied et al., 2020). Rarely, when physical

systems were used, research was primarily limited

to balance control after manually performing the

swing-up(Ibrahim et al., 2019; Sondarangallage and

Manukid, 2019). A common limitation in these stud-

ies is the absence of a physical rotary double inverted

pendulum system capable of performing swing-up

Ju, D., Lee, T. and Lee, Y.

Implementation of 12 Transition Controls for Rotary Double Inverted Pendulum Using Direct Collocation.

DOI: 10.5220/0012994200003822

Paper published under CC license (CC BY-NC-ND 4.0)

In Proceedings of the 21st International Conference on Informatics in Control, Automation and Robotics (ICINCO 2024) - Volume 1, pages 92-100

ISBN: 978-989-758-717-7; ISSN: 2184-2809

control. Even if such a system exists, there is a lack of

documented successful implementations of swing-up

control in academic literature.

Building on the research laboratory’s extensive

experience in developing various inverted pendulum

systems over many years, the authors aim to solve

the problem by constructing a physical system them-

selves, instead of purchasing a commercially avail-

able rotary inverted pendulum. The most critical ob-

jective in this process is to ensure that the arm of the

rotary inverted pendulum can rotate inﬁnitely, a char-

acteristic feature of the system. To address this, previ-

ous research successfully utilized a slip ring structure

to overcome the rotational displacement constraints

and perform swing-up control(Oh and Lee, 2018).

Building on this experience, the authors’ research lab-

oratory seeks to directly construct a rotary double in-

verted pendulum and implement swing-up control us-

ing the physical system.

In the case of a single-link inverted pendulum sys-

tem, there is only one type of swing-up problem. This

involves moving from the stable equilibrium point

where the pendulum hangs downward to the unstable

equilibrium point in the upright position. However,

when the pendulum has two links, there are two addi-

tional unstable equilibrium points determined by the

angles of each link. These are in addition to the sta-

ble equilibrium points where both links point down-

ward and the unstable equilibrium points where both

links are upright. This introduces not only the simple

swing-up problem but also a newly deﬁned ‘transition

control’ problem between these equilibrium points.

The transitions between the four equilibrium points

created by the two links result in 11 different types of

transitions, excluding the traditional swing-up. This

paper experimentally addresses all 12 transition con-

trols, including swing-up control.

The transition control among the four equilib-

rium points of the rotary double inverted pendulum

used in the experiments can be designed by refer-

encing the swing-up control strategy of the linear in-

verted pendulum. In 2007, Graichen effectively ad-

dressed the swing-up control problem by introducing

a two-degree-of-freedom (2-DOF) control structure

that combines feedforward and feedback control, tak-

ing into account the rail length constraints of the lin-

ear double inverted pendulum(Graichen et al., 2007).

The basic concept of the 2-DOF control structure used

by Graichen involves using the dynamic equations of

the multi-link inverted pendulum to calculate the state

and control input trajectories ofﬂine that lead the pen-

dulum to an upright position. The calculated feed-

forward trajectory is then applied to guide the pendu-

lum to the upright state. During the operation of the

system, the difference between the actual trajectory

and the calculated feedforward trajectory is corrected

through feedback control. This ensures successful

swing-up by closely following the feedforward tra-

jectory. Based on this study, the present paper uses

the direct collocation method(Kelly, 2017) to numer-

ically solve the nonlinear optimal control problem to

ﬁnd the feedforward trajectory for the rotary double

inverted pendulum. The feedback controller design

follows a similar approach to Graichen’s method and

employs Linear Quadratic(LQ) control, an optimal

control technique for time-varying systems. Unlike

Graichen’s method, which assumes a speciﬁc form for

the trajectory, the direct collocation method imposes

no constraints on the trajectory shape, increasing the

likelihood of ﬁnding a numerical solution for the tra-

jectory.

This paper aims to implement all 12 types of tran-

sition control using the physical rotary double in-

verted pendulum system. In Chapter 2, we ﬁrst derive

the dynamic equations of the system using the Euler-

Lagrange equations based on the structural character-

istics of the rotary double inverted pendulum used in

the experiments. Chapter 3 deﬁnes each equilibrium

point and transition control problem and proposes a

method to obtain the feedforward trajectory using the

direct collocation method, which numerically solves

the nonlinear optimal problem. In Chapter 4, we per-

form the 12 transition control experiments using a 2-

DOF controller designed with time-varying LQ con-

trol. Finally, in Chapter 5, we analyze the results to

verify the effectiveness of the proposed method.

2 MATHEMATICAL MODEL OF

THE ROTARY DOUBLE

INVERTED PENDULUM

The rotary double inverted pendulum used in this pa-

per is shown in Figure 1, and Figure 2 illustrates the

mechanical concept of the system.

The unit system used in this paper is the Interna-

tional System of Units (SI), and all variables and pa-

rameters are deﬁned accordingly. Here, θ represents

the rotational displacement from the initial position of

the arm, and u denotes the angular acceleration of the

arm. R

is the distance between the arm and the ﬁrst

pendulum, while r

and r

represent the x-axis direc-

tional distances from the arm to the center of mass of

the ﬁrst and second pendulums, respectively. M

and

are the masses of the ﬁrst and second pendulums,

and l

are the lengths from the rotational axis to the

center of mass of each pendulum, and L

is the length

Implementation of 12 Transition Controls for Rotary Double Inverted Pendulum Using Direct Collocation

Figure 1: Rotary double inverted pendulum system con-

structed by the laboratory.

Arm

First link

Second link

Figure 2: The conceptual diagram of a rotary double in-

verted pendulum.

from the rotational axis of the ﬁrst pendulum to the ro-

tational axis of the second pendulum. α is deﬁned as

the rotational displacement of the ﬁrst pendulum with

respect to the normal of the ground, and β is deﬁned

as the relative rotational displacement of the second

pendulum with respect to the ﬁrst pendulum. Addi-

tionally, c

and c

denote the friction coefﬁcients at

the rotational axes of the ﬁrst and second pendulums,

respectively. Figure 3 illustrates the rotations of the

ﬁrst and second pendulums, and expressions like I

xx1

represent the moments of inertia of the pendulums(Oh

and Lee, 2018). The dynamic model of the rotary

double inverted pendulum can be derived using the

Euler-Lagrange equation and is expressed as follows.





θ +











= 0. (1)

Each component of Equation (1) is deﬁned as follows.

Figure 3: Rotation and inertia I of the ﬁrst and second links.

= h

cos(α) + h

cos(α + β),

= h

cos(α + β),

= h

+ h

+ 2h

cos(β),

= h

+ h

cos(β),

= h

+ h

cos(β),

= h

= −h

sinβ(2

β +

) − h

sinα

− h

sin(α + β) + c

−

{

sin(2α) + h

sin(2α + β)

sin(2α + 2β)},

= h

sinβ(

) − h

sin(α + β) + c

−

{

sin(2α + 2β)

(sin(2α + 2β) − sin β)}.

to h

are deﬁned as follows, and g represents the

gravitational acceleration of 9.81 [m/s

= M

+ M

+ r

) − I

xz1

= M

+ r

) − I

xz2

= I

xx1

+ M

= M

= g(M

+ M

= I

xx2

+ M

= M

+ M

+ I

yy1

− I

zz1

= M

+ I

yy2

− I

zz2

Thus, Equation (1) can be rearranged as follows.





= −





−1





θ +





ICINCO 2024 - 21st International Conference on Informatics in Control, Automation and Robotics

By solving this,

α =

(−m

+ m

)

θ + (−m

+ m

)

β =

− m

)

θ + (m

− m

)

Φ = m

− m

In this context, the state vector is deﬁned as x

= θ,

= α, x

= β, x

θ, x

α, x

β, x

θ(τ)dτ,

and the angular acceleration

θ is represented as u.

Consequently, the model equation of the double in-

verted pendulum can ultimately be expressed as the

following nonlinear state equation. The last element

of the state variable,

θ(τ)dτ, is an additional term

introduced to eliminate the steady-state error in the

position of the arm.







˙x







| {z }

˙x







(−m

)

θ+(−m

)

−m

)

θ+(m

−m

)







| {z }

f (x,u)

(2)

The dynamic model derived in this manner is subse-

quently used as the feedforward trajectory generation

model when applying the direct collocation method.

3 DESIGN AND

IMPLEMENTATION OF

TRANSITION CONTROL

3.1 Transition Control Design

The four possible equilibrium points of the rotary

double inverted pendulum, determined by the states

of the ﬁrst and second pendulums, are depicted in Fig-

ure 4. In this paper, the numbering of equilibrium

points is represented using a binary notation, where

the states Up and Down are designated as 1 and 0, re-

spectively. If the numbering is assigned starting from

the ﬁrst pendulum, the equilibrium point where both

the ﬁrst and second pendulums are in the Down-Down

state corresponds to the binary number 00 and is de-

noted as EP0. Similarly, the equilibrium point corre-

sponding to the Up-Down state is represented by the

binary number 10 and is denoted as EP2. In this pa-

per, the equilibrium points of the inverted pendulum

will be denoted as EP0 (Down-Down), EP1 (Down-

Up), EP2 (Up-Down), and EP3 (Up-Up) according to

this method.

EP 0 EP 1 EP 2 EP 3

Figure 4: Four equilibrium points of a rotary double in-

verted pendulum.

The implemented transition control follows three

steps:

1. Linear control at the current equilibrium point.

2. Transition control from the current equilibrium

point to the next equilibrium point.

3. Linear control at the next equilibrium point.

Each transition process is structurally similar to

the swing-up problem, which involves moving the

pendulum from its initial state (EP0) to the upright

state (EP3). First, linear control is performed to en-

sure the stability of the current state before transition-

ing between the four equilibrium points. Next, tran-

sition control is executed to move from the current

equilibrium point to the next equilibrium point. Upon

reaching the new equilibrium point, additional linear

control is required to maintain stability at that point.

Thus, each transition process must satisfy both ma-

jor elements: linear control at the equilibrium points

and transition control between the equilibrium points.

Since there are a total of 12 different transition trajec-

tories for the double inverted pendulum, experiments

must be designed and conducted for each trajectory to

meet the required conditions.

The 12 transition trajectories presented in this pa-

per are conﬁgured according to the sequence diagram

in Figure 5.

3.2 Implementation of Transition

Control

To calculate the optimal state x

∗

and control input u

∗

required for generating transition control trajectories,

Implementation of 12 Transition Controls for Rotary Double Inverted Pendulum Using Direct Collocation

lG

wGW

OkTP

lG

wGX

OkTP

lG

wGZ

O|TP

lG

wGY

O|TP

[

]

Figure 5: 12-step transition diagram for the double inverted

pendulum.

it is essential to accurately estimate the parameters of

the system depicted in Figures 2 and 3. Parameters

such as M

, M

, I

xx1

, I

xx2

, l

, c

, and c

can be es-

timated using the parameter estimation techniques for

the linear double inverted pendulum(Ju et al., 2022).

The estimation of the inertia tensor can be performed

by referring to the research conducted by Oh(Oh and

Lee, 2018). The parameters of the rotary double in-

verted pendulum obtained through these estimation

processes are summarized in Table 1.

Table 1: Model parameters of the rotary double inverted

pendulum used in the experiments.

Parameter Value

0.187 [kg]

0.132 [kg]

xx1

1.0415e-03 [kgm

]

xx2

8.8210e-04 [kgm

]

yy1

4.3569e-03 [kgm

]

yy2

4.9793e-03 [kgm

]

zz1

3.3179e-03 [kgm

]

zz2

4.8178e-03 [kgm

]

xz1

3.7770e-04 [kgm

]

xz2

1.9823e-04 [kgm

]

0.072 [m]

0.133 [m]

2.4100e-06

1.0900e-06

0.1645 [m]

The transition control between the equilibrium

points of the pendulum is performed using a 2-DOF

control technique that combines nonlinear feedfor-

ward control and feedback control as proposed in

(Graichen et al., 2007). Figure 6 illustrates the 2-

DOF control structure. In the control process, the pre-

computed ideal angular acceleration trajectory u

∗

(t)

is combined with the correction input ∆u(t), which

is calculated based on the error ∆x(t) = x

∗

(t) − x(t)

between the predicted state variables of the inverted

pendulum system and the actual output values. This

generates the actual angular acceleration input u(t) =

∗

(t)+∆u(t). In this paper, the feedforward trajectory

is generated by considering the dynamic constraints

and setting up a nonlinear optimal control problem

that numerically minimizes the desired cost function

while satisfying various constraints. The direct collo-

cation method is used to numerically solve this prob-

lem(Kelly, 2017). This method effectively estimates

the control inputs and state variable paths of contin-

uous dynamic systems and is a proven technique for

obtaining the optimal trajectory to achieve control ob-

jectives.

Signal

generator

Feedback

control

Double

Inverted

Pendulum

Feedforward control

Figure 6: 2-DOF control framework for the double inverted

pendulum.

The direct collocation method is an iterative nu-

merical solution implemented using a nonlinear opti-

mization solver. This method is characterized by the

inﬂuence of the initial trajectory provided by the de-

signer on the solution time of the numerical method

and the resulting trajectory’s shape. By considering

these characteristics and selecting an appropriate ini-

tial trajectory, the designer can effectively apply the

direct collocation method to derive a suitable trajec-

tory for speciﬁc transition control problems. In this

paper, the initial trajectory is designed using the sim-

plest form of a straight-line trajectory connecting the

start and end boundary conditions. Additionally, a

cost function is designed to satisfy all set constraints

and boundary conditions, forming a general nonlinear

optimal control problem. Equation (3) below repre-

sents the optimal control problem set to calculate the

optimal trajectory.

Minimize

u(t) J(x(t), u(t))

subject to input/output constraint,

dynamic equations,

boundary conditions.

(3)

In the above equation, J(x(t),u(t)) represents the

cost function and is deﬁned as an optimal control

problem that satisﬁes the constraints set using the

model parameters of the actual system as listed in Ta-

ble 1. The cost function can be used either to mini-

mize time or to optimize the cost over a given period.

The constraints and limitations applied to this cost

ICINCO 2024 - 21st International Conference on Informatics in Control, Automation and Robotics

function include the dynamic equations of the rotary

double inverted pendulum (2) and the limits speciﬁed

by equation (4), which restrict the maximum distance

the arm can move during the control process, as well

as the maximum angular velocity and input angular

acceleration of the arm within the operational range

of the actuator. These conditions play a crucial role in

ensuring the practical operability and stability of the

control system.

∗

≤ θ

limit

∗

≤

limit

∗

≤ u

limit

(4)

The set limit values are all deﬁned as positive, with

limit

> 0,

θlimit > 0, and ulimit > 0 specifying the

maximum allowable values for the arm’s output dis-

placement, output angular velocity, and input angular

acceleration, respectively. These limit values reﬂect

the actual operational constraints of the rotary double

inverted pendulum used in the experiments, and these

constraints are explicitly stated in equation (5).

limit

= 1.5[rad],

limit

= 7.0[rad/s],

limit

= 50[rad/s

(5)

To satisfy various constraints during the 12 differ-

ent transition control processes, additional constraints

can be applied to the pendulum’s rotation angles α

and β as well as their angular velocities

α and

β.

Furthermore, at the start (t = 0) and end (t = T ) of

the transition control, the boundary conditions cor-

responding to the speciﬁc equilibrium points for the

transition must be satisﬁed. For example, if the cur-

rent equilibrium point is EP0 and the target equilib-

rium point for the transition is EP3, this situation can

be expressed as shown in equation (6).

∗

(0) =



0,−π,0,0, 0, 0,0



∗

(T ) =



0,0,0,0, 0, 0,0



∗

(0) = 0,u

∗

(T ) = 0.

(6)

By utilizing the direct collocation method, the transi-

tion trajectories implemented can be optimized to sat-

isfy the set constraints and boundary conditions dur-

ing the transition process, as illustrated in Figure 7.

4 EXPERIMENTAL VALIDATION

AND RESULTS

The design of the feedback controller for trajec-

tory tracking is similar to the method proposed by

Graichen(Graichen et al., 2007), mentioned in the in-

troduction. However, a key difference in this study

is the application of the optimal control technique,

Figure 7: Feedforward transition trajectory from EP0 to

EP3.

linear-quadratic (LQ) control, for time-varying sys-

tems throughout the entire process. In Graichen’s

study, it was determined that high compensation val-

ues in intervals where the gain coefﬁcient increases

rapidly could lead to a loss of controllability, prompt-

ing the decision to suspend feedback in certain in-

tervals. In contrast, this study maintains a consis-

tent control strategy by utilizing the calculated time-

varying LQ control gain values across all intervals

during the implementation of transition control. The

design of this time-varying LQ controller is based on

the linearized dynamic system centered around the

swing-up trajectory of the double inverted pendulum

system. The state equations used in this process,

which vary with time, used in this process are mod-

eled in a form where the values of A and B are time-

varying, as shown in equation (7).

A(t) =

∂ f

∂x



∗

(t),u

∗

(t)

, B(t) =

∂ f

∂u



∗

(t),u

∗

(t)

. (7)

In this context, x

∗

(t) and u

∗

(t) represent the feed-

forward trajectories for the state and input obtained

using the direct collocation method. The difference

between the obtained feedforward trajectory and the

actual state variable values is calculated as ∆x(t) =

∗

(t) − x(t), and the correction input ∆u(t) generated

to compensate for this error is deﬁned by equation (8).

∆u(t) = −K(t)∆x(t). (8)

In this case, the time-varying state equation can be

expressed as equation (9).

∆ ˙x(t) = A(t)∆x(t) + B(t)∆u(t). (9)

The cost function used is given by equation (10).

Implementation of 12 Transition Controls for Rotary Double Inverted Pendulum Using Direct Collocation

J = ∆x

(T )H

∆x(T ) +

∆x(t)

+∆u(t)

∆u(t)dt.

(10)

Here, the subscript Tr is used to indicate the design

variables of the transition process. In equation (10),

the variables with this subscript must satisfy H

≥ 0,

≥ 0, and R

> 0. The matrix H

represents the

weight on the terminal state, the matrix Q

repre-

sents the weight on the system state, and the matrix

represents the weight on the control input. These

weights are set to the values of equation (11) through

an experimental process.

= diag(1,300,500,1, 1, 1,1),

= 1.

(11)

Additionally, to calculate the time-varying gain K(t),

the differential Riccati equation presented in equation

(12) must be solved.

P(t) = −A(t)

P(t) − P(t)A(t)

+PB(t)R

−1

(t)P(t) − Q

P(T ) = H

(12)

Ultimately, the time-varying gain K(t) can be calcu-

lated using equation (13). The LQ control gain K(t)

applied in the earlier Figure 7 can be seen in Figure 8.

K(t) = R

−1

(t)P(t). (13)

0 0.5 1 1.5 2 2.5

time(sec)

-1500

-1000

-500

500

1000

1500

2000

time-varying feedback gain

Figure 8: Time-varying LQ gain of feedforward transition

trajectory from EP0 to EP3.

To conduct the experiments, the swing-up trajec-

tory obtained using the direct collocation method pre-

sented in Chapter 3 is combined with the LQ control-

based feedback controller to implement the 12 differ-

ent transition controls. The trajectory paths are con-

ﬁgured according to the ﬂowchart in Figure 5, and

the transition paths between each equilibrium point

are designed to move only once. The duration of one

cycle of state transition and linear control at the equi-

librium point is set to 5 seconds, with the time taken

for state transitions varying from a minimum of 1.8

seconds to a maximum of 3 seconds, as shown in

Table 2. After each transition, linear control is per-

formed at the corresponding equilibrium point until

the next transition control begins, to stabilize the sys-

tem. Figure 9 shows a captured image of a YouTube

video that presents the experimental results of 12 tran-

sition controls. The actual YouTube video can be ac-

cessed at https://youtu.be/J8vRJtQ-t3I (Video title: 12

transition controls of a rotary double inverted pendu-

lum (with double reduction timing pulleys), Channel

name: Embedded Control Lab.), where the experi-

mental results can be viewed in detail.

The ﬁrst state transition begins at the 5-second

point, and Figure 10 shows the continuous results of

all transition controls. Each line represents the sim-

ulation model trajectory depicted by solid lines and

the actual experimental trajectory shown by dashed

lines. Due to the compensation effect of the closed-

loop control, the arm’s rotation angle θ and angular

velocity

θ exhibit signiﬁcant differences immediately

after the state transition, but the result graphs for α

and β, which are the most critical variables in transi-

tion control, demonstrate that the predicted paths are

followed with high accuracy. Following this, linear

control at the equilibrium point gradually adjusts the

θ value to approach zero, helping to limit the random

movement of the double pendulum and enabling all

12 transition controls to be successfully executed.

Figure 9: The Youtube video for 12 transition controls of

the RDIP.

5 CONCLUSION AND FUTURE

WORK

In this paper, we established an experimental envi-

ronment using a physically constructed rotary dou-

ble inverted pendulum system to address 12 transition

control problems, including swing-up control. We

introduced a method for generating feedforward tra-

jectories using the direct collocation technique and

ICINCO 2024 - 21st International Conference on Informatics in Control, Automation and Robotics

Table 2: Transition times used in the experiments.

Starting EP Destination EP T [sec]

EP0 EP1 2.000

EP2 3.000

EP3 2.500

EP1 EP0 1.800

EP2 2.500

EP3 2.500

EP2 EP0 2.500

EP1 2.000

EP3 2.921

EP3 EP0 2.500

EP1 2.000

EP2 2.000

0 10 20 30 40 50 60 70

-2

0 10 20 30 40 50 60 70

-5

0 10 20 30 40 50 60 70

-10

-5

simulation

true data

0 10 20 30 40 50 60 70

Time

-10

-5

Figure 10: 12-step transition trajectories of the rotary dou-

ble inverted pendulum: simulated trajectory (solid line) and

actual trajectory (dot-dashed line).

implemented these trajectories with a 2-DOF con-

troller incorporating time-varying LQ control. This

approach effectively addressed the complex transition

control problem by combining feedforward trajec-

tory generation and feedback control. The efﬁciency

and feasibility of the proposed method were demon-

strated through continuous control of the 12 transi-

tion paths among the four equilibrium points of the

double inverted pendulum. The main contributions

of this study include the novel deﬁnition and experi-

mental implementation of the transition control prob-

lem between equilibrium points, extending the exist-

ing swing-up problem, and the practical applicability

of the proposed control strategy validated through ex-

perimental results.

Future research can consider several directions

to further develop the transition control method pre-

sented in this paper. First, the application of a Distur-

bance Observer (DOB) technique could enhance the

robustness of the system against disturbances. Sec-

ond, incorporating artiﬁcial intelligence-based con-

trol techniques such as Reinforcement Learning could

lead to the development of more intelligent and adap-

tive control systems. Third, introducing advanced

control strategies, such as nonlinear control tech-

niques or Model Predictive Control (MPC), could be

a promising direction to improve the performance of

the controller.

As additional research, it is necessary to evaluate

the performance of the transition control under vari-

ous experimental environments and conditions to ver-

ify the generality of the proposed method. In the fu-

ture, this transition control strategy will be applied to

multi-dimensional and multi-body systems to explore

the controllability of more complex dynamic systems.

Through this, it is expected that the method can be de-

veloped into a universal control technique applicable

not only to the rotary double inverted pendulum sys-

tem but also to other complex dynamic systems.

ACKNOWLEDGEMENTS

This work was supported by the National Research

Foundation of Korea(NRF) grant funded by the Korea

government(MSIT)(RS-2024-00347193).

REFERENCES

Baek, J., Lee, C., Lee, Y. S., Jeon, S., and Han, S. (2024).

Reinforcement learning to achieve real-time control of

triple inverted pendulum. Engineering Applications of

Artiﬁcial Intelligence, 128:107518.

Brown, D. and Strube, M. (2020). Design of a neural con-

troller using reinforcement learning to control a rota-

tional inverted pendulum. In 2020 21st International

Conference on Research and Education in Mechatron-

ics (REM), pages 1–5.

Astr

om, K. J. and Furuta, K. (2000). Swinging up a pendu-

lum by energy control. Automatica, 36(2):287–295.

Graichen, K., Treuer, M., and Zeitz, M. (2007). Swing-up

of the double pendulum on a cart by feedforward and

feedback control with experimental validation. Auto-

matica, 43:63–71.

Ibrahim, M. M., Ubaid, M. A., Rachid, M., and Maamar,

B. (2019). Stabilization of a double inverted rotary

pendulum through fractional order integral control

scheme. International Journal of Advanced Robotic

Systems, 16(4).

Ju, D., Choi, C., Jeong, J., and Lee, Y. S. (2022). Design

and parameter estimation of a double inverted pendu-

lum for model-based swing-up control. Journal of In-

Implementation of 12 Transition Controls for Rotary Double Inverted Pendulum Using Direct Collocation

stitute of Control, Robotics and Systems (in Korean),

28(9):793–803.

Kelly, M. (2017). An introduction to trajectory optimiza-

tion: How to do your own direct collcation. SIAM

Review, 59(4):849–904.

Liang, F., Xin, X., and Li, Y. (2023). Swing-up and balance

control of rotary double inverted pendulum. In Pro-

ceedings of the 2023 3rd International Conference on

Robotics and Control Engineering, page 65–70.

Meta, T., Gyeong, G. Y., Park, J. H., and Lee, Y. S. (2014).

Swingup control of an inverted pendulum subject to

input/output constraints. Journal of Institute of Con-

trol, Robotics and Systems (in Korean), 20:835–841.

Oh, Y. and Lee, Y. S. (2018). Robust Swing-up Control of

a Rotary Inverted Pendulum Subject to Input/Output

Constraints. Journal of Institute of Control, Robotics

and Systems (in Korean), 24(5):423–430.

Rahairi, M., H.Selamat, Zamzuri, H., and Ahmad, F.

(2011). Pid controller optimization for a rotational

inverted pendulum using genetic algorithm. In 2011

Fourth International Conference on Modeling, Simu-

lation and Applied Optimization, pages 1–6.

Ratiroch-Anant, P., Anabuki, M., and Hirata, H. (2004).

Self-tuning control for rotational inverted pendulum

by: eigenvalue approach. In 2004 IEEE Region 10

Conference TENCON 2004., volume D, pages 542–

545.

Singh, S. and Swarup, A. (2021). Control of rotary dou-

ble inverted pendulum using sliding mode controller.

In 2021 International Conference on Intelligent Tech-

nologies (CONIT), pages 1–6.

Sondarangallage, D. A. and Manukid, P. (2019). Control of

rotary double inverted pendulum system using mixed

sensitivity h∞ controller. International Journal of Ad-

vanced Robotic Systems, 16(2).

Thein, M.-W. and Misawa, E. (1995). Comparison of the

sliding observer to several state estimators using a ro-

tational inverted pendulum. In Proceedings of 1995

34th IEEE Conference on Decision and Control, vol-

ume 4, pages 3385–3390.

Tran, N., Nguyen, V., Le, C., Lai, A., Nguyen, T., Huynh,

M., Phan, V., Tong, G., Nguyen, L., and Ngo, T.

(2024). Lqr control for experimental double rotary in-

verted pendulum. Journal of Fuzzy Systems and Con-

trol, 2(2):104–108.

Zied, B. H., Mohammad, J. F., and Zafer, B. (2020). De-

velopment of a Fuzzy-LQR and Fuzzy-LQG stabil-

ity control for a double link rotary inverted pendu-

lum. Journal of the Franklin Institute, 357(15):10529–

10556.

ICINCO 2024 - 21st International Conference on Informatics in Control, Automation and Robotics

100