Multi-Agent Trajectory Prediction for Urban Environments with UAV

Data Using Enhanced Temporal Kolmogorov-Arnold Networks with

Particle Swarm Optimization

Mohammad Reza Mohebbi

1,2 a

, Elahe Kafash

and Mario D

oller

1 b

Josef Ressel Center Vision2Move, University of Applied Sciences Kufstein Tirol, Kufstein, Austria

Department of Computer Science, University of Passau, Passau, Germany

Department of Computer Engineering, Imam Reza (AS) International University, Mashhad, Iran

Keywords:

Kolmogorov-Arnold Networks, Particle Swarm Optimization, Multi-Agent, Trajectory Forecasting,

Intelligent Transportation System, Unmanned Aerial Vehicle, Feature Extraction.

Abstract:

Accurate trajectory prediction for moving agents such as pedestrians and vehicles is essential for autonomous

driving, intelligent navigation, and abnormal behavior detection. Real-time prediction of future movements

enhances the development of autonomous vehicles and the efﬁciency of trafﬁc management systems. In this

study, a novel trajectory prediction approach based on Temporal Kolmogorov-Arnold Networks (TKAN) is

introduced, using the TUMDOT-MUC dataset collected by Unmanned Aerial Vehicles (UAVs) in Munich,

Germany, to model large-scale urban scenarios. To improve prediction accuracy, additional features were

extracted from the primary dataset and incorporated into the TKAN architecture, demonstrating a marked

performance improvement over general machine learning models. The accuracy of predictions is further re-

ﬁned by tuning hyperparameters of TKAN through Particle Swarm Optimization (PSO). The proposed model

provides a robust and reliable solution for the trajectory prediction of multi-agents in challenging urban traf-

ﬁc conditions. This research advances intelligent and effective transportation systems by proposing scalable

methods for improved trafﬁc management and safety in densely populated urban areas, ultimately contributing

to smarter and more efﬁcient transportation networks.

1 INTRODUCTION

Trafﬁc congestion can lead to longer travel times,

higher fuel consumption, and air pollution, negatively

impacting public health and quality of life for urban

residents. In addition, trafﬁc congestion would cause

economic losses, disruption of commercial activities,

and low productivity levels. These challenges impose

costly social and economic pressures on city trans-

port systems, and cannot afford to delay the search

for efﬁcient solutions to control and mitigate prob-

lems (C. Arti and Kumar, 2022). Urbanization places

more pressure on the transportation infrastructure as

urban populations continue to grow. The early signs

of congestion in most cities around the world begin

with speciﬁc sections of roads where the number of

vehicles traveling through the road is greater than the

capacity of the road section, which slows trafﬁc speed

https://orcid.org/0009-0004-9455-2558

https://orcid.org/0000-0002-9716-564X

and increases travel time. More vehicles mean more

fuel consumption and air pollution conditions that ad-

versely affect the health of citizens (R. SenthilPrabha

and Harish, 2023).

In recent years, the ability to predict vehicle tra-

jectories has gained attention as a promising ap-

proach for trafﬁc optimization in Intelligent Trans-

portation Systems (ITS). Accurate trajectory predic-

tion can also help alleviate congestion and optimize

trafﬁc ﬂow. The analysis of the data of moving ve-

hicles enables the system to judge the ﬂow of trafﬁc

and provides valuable information to decide in real-

time (M. R. Mohebbi and Yamnenko, 2024). Predict-

ing trafﬁc ﬂow can route drivers through detours, re-

ducing air pollution and trafﬁc incidents. Such pre-

dictive senses are also of immense importance for

urban planners, planning better transportation infras-

tructure, improving public services, and encourag-

ing the development of smart and sustainable cities

(X. Kong and Zhang, 2016). In addition, anticipat-

586

Mohebbi, M. R., Kafash, E. and Döller, M.

Multi-Agent Trajectory Prediction for Urban Environments with UAV Data Using Enhanced Temporal Kolmogorov-Arnold Networks with Particle Swarm Optimization.

DOI: 10.5220/0013243100003890

In Proceedings of the 17th International Conference on Agents and Artiﬁcial Intelligence (ICAART 2025) - Volume 2, pages 586-597

ISBN: 978-989-758-737-5; ISSN: 2184-433X

ing the next move of dynamic objects is a key to

autonomous mobility, with applications such as au-

tonomous vehicles safely interacting in shared in-

frastructure considering the trajectories of pedestrians

and other vehicles (Deo and Trivedi, 2017).

Traditional Machine Learning (ML) models, in-

cluding Support Vector Machines (SVM), Multilayer

Perceptrons (MLP), Random Forest (RF), and early

recurrent models, including Long Short-Term Mem-

ory (LSTM) networks, have been tried to improve the

accuracy of trafﬁc prediction. However, most of these

models fail in processing nonlinear, complex trafﬁc

data, especially in dynamically changing urban envi-

ronments (B. Yang and Tian, 2019). The serious lim-

itation of these models is also related to the fact that

treating temporal sequences this way, focusing on lin-

earity, might prevent them from capturing complex re-

lationships in urban trafﬁc. Deep Learning (DL) mod-

els, especially more advanced Recurrent Neural Net-

works (RNNs), presented the potential to help with

some of those issues by providing proper modeling

of complex temporal sequences. More sophisticated

models for real-time forecasting by fusing data from

multiple sources are needed in the case of urban trafﬁc

ﬂow (Y. Lv and Wang, 2014).

This is addressed with the adoption and imple-

mentation of a new architecture known as Tempo-

ral Kolmogorov-Arnold Networks (TKAN), specially

modeled to handle nonlinear complex dependencies

in temporal data. The theoretical rationale for the im-

plementation of TKANs lies in the theories by Kol-

mogorov and Arnold, uniquely and specially ﬁtted

for modeling and identiﬁcation of complex patterns

in time-series data. Thus, it has an advanced archi-

tecture to handle large-scale and dynamic data more

efﬁciently. TKAN is thus assumed to be proﬁciently

effective in predicting real-time trajectory in an ur-

ban environment. Unlike other traditional methods,

TKAN considers historical pattern attributes for the

generation of more accurate future behavior predic-

tions. Therefore, it considers robust potential for its

use within any intelligent trafﬁc management system

(Genet and Inzirillo, 2024).

Meanwhile, parallel methodological advances in

data collection allow for more accurate and profound

inputs of urban trafﬁc ﬂow. Among others, drones

or Unmanned Aerial Vehicles (UAVs) have been sug-

gested as an efﬁcient source of data collection on traf-

ﬁc ﬂow and other movements within cities (M. R. Mo-

hebbi and Tavasoli, 2024). UAVs guarantee broad,

real-time, and multiangle views often unreachable for

ground sources and notably enhance the quality of

data collection even for the most densely populated

or hard-to-reach areas. Applications involving trajec-

tory prediction beneﬁt from UAVs, which improve ac-

curacy by providing expansive and detailed datasets

that often are beyond the reach of ground sensors

(A. Kutsch and Bogenberger, 2024). Therefore, en-

hancing both the quality and quantity of data im-

proves efforts in trafﬁc management and urban plan-

ning through the use of UAV technology, which, in

turn, provides better insights for researchers and pol-

icymakers into the complicated trafﬁc patterns and

movement behaviors within cities.

This study addresses the growing complexity of

urban trafﬁc and the limitations of standard predictive

models by introducing an advanced framework that

integrates the Temporal Kolmogorov-Arnold Net-

work with UAV-based data acquisition. The proposed

scheme leverages the capability of TKAN for pattern

learning of nonlinear variance in time series and high-

resolution real-time input provided through UAVs for

setting up a more reliable methodology of trafﬁc man-

agement in dynamic and high-density urban infras-

tructure. They address the demand for reliable, efﬁ-

cient, and scalable predictive models with the aim of

enabling transportation systems that are safer, more

intelligent, and that can learn and adapt to the chal-

lenges related to modern urban mobility. To advance

the predictive accuracy and scalability of urban traf-

ﬁc models, the following key contributions are intro-

duced:

• Noise-Reduction Techniques. Several tech-

niques, such as a moving average ﬁlter, have been

used to normalize motion data in order to reduce

the inﬂuence of unreliable points, such as outliers.

All of these efforts help maintain the data quality

so that model performance and accuracy of pre-

dictions can be improved during the training pro-

cess.

• TKAN Model. This model was selected for its

efﬁcacy in modeling temporal data by capturing

intricate nonlinear features and structures of the

trafﬁc pattern. It would improve the capability

of the model for scalability in large data and, in

essence, a dynamic urban setting with robust tra-

jectory predictions.

• Particle Swarm Optimization (PSO) for Hy-

perparameter Tuning. PSO is applied for hy-

perparameter optimization to further improve the

efﬁciency and precision of the model. This intelli-

gent parameter search not only accelerates model

tuning but also returns a model optimized for ef-

fective analysis and prediction.

The remainder of this paper is organized as follows.

Section 2 summarizes the related work by situat-

ing our approach with respect to the existing meth-

ods. Section 3 provides a detailed description of the

Multi-Agent Trajectory Prediction for Urban Environments with UAV Data Using Enhanced Temporal Kolmogorov-Arnold Networks with

Particle Swarm Optimization

587

methodology, covering the data processing pipeline,

model architecture, and temporal enhancement tech-

niques. Section 4 presents a full empirical evaluation,

and conclusions are given in Section 5.

2 RELATED WORK

Trajectory prediction has been one of the most salient

facial features in ITS research and has made sufﬁcient

progress along three main lines: vehicle trajectory

prediction, human trajectory forecasting, and multi-

agent trajectory modeling. These enable improved

safety, optimized ﬂow, and better trafﬁc management.

The following section tries to identify signiﬁcant ﬁnd-

ings and methodological variety in these areas that re-

cent research addresses with evolving challenges for

trafﬁc prediction.

2.1 Vehicle Trajectory Prediction

Trajectory prediction has been considered an impor-

tant task within the scope of ITSs, especially for very

dense and complex urban scenarios. Advanced ML

and DL models increasingly contribute to the neces-

sary analysis of extensive trafﬁc datasets for the detec-

tion of complex trafﬁc movement patterns and the im-

proved prediction of vehicle behavior. For instance,

Mohebbi et al. (M. R. Mohebbi and Yamnenko, 2024)

presented how the usage of liquid neural networks

combined with UAV-derived data signiﬁcantly im-

proves trajectory predictions in crowded scenarios by

offering a wider perspective and enabling the capabil-

ity to predict urban trafﬁc ﬂow more precisely.

Scalability is an important factor in the predic-

tive models that should be used in handling large

datasets of trafﬁc for various instant applications. An-

other approach has proposed a framework that is ef-

ﬁcient in data management for higher volumes of

data, enabling the application of such data in var-

ious urban contexts with complex adaptive func-

tions that are required by autonomous vehicle sys-

tems (P. Rathore and Bezdek, 2019). This makes the

frameworks highly effective in dynamic urban envi-

ronments, as they can manage large volumes of traf-

ﬁc data without losing accuracy. Another research

under the objective-oriented approach of prediction

demonstrated that the focus on particular trafﬁc pre-

diction goals, such as maintaining the network in a

ﬂuid state, considerably improved predictability to-

gether with system responsiveness (H. Zhao and Li,

2021).

In the prospect of having more accurate long-term

predictions, researchers have turned to using spatio-

temporal algorithms, which enable them to forecast

more about extended trajectory paths. The strategic

beneﬁts are perceived in terms of timely and data-

driven decisions for trafﬁc management and urban

planning (T. Wu and Chen, 2022). One of the recent

effective methods adopted includes the use of Varia-

tional Autoencoders (VAEs), which study historical

trafﬁc data for patterns that enable highly accurate

prediction of vehicle behavior in future scenarios by

learning from past trajectories (M.

A. De Miguel and

Garcia, 2022).

Recently, hybrid models have gained promi-

nence that try to render the predictive models more

amenable to dynamic and ﬂuctuating conditions. For

instance, networked trafﬁc data illustrated promis-

ing results considering the identiﬁcation and analy-

sis of dynamic movement patterns by integrating the

LSTM network with adaptive chirp mode decompo-

sition. This downloaded hybrid method may improve

the extraction of features and improve the precision

of prediction in capturing complex temporal struc-

tures in urban trafﬁc ﬂow (Z. Wang and Jiang, 2024).

In the meantime, some lightweight DL models have

been developed to ﬁt the usage demand of applica-

tions in dense trafﬁc scenarios in recent years. This

is important because both high processing speed and

accurate predictions are usually needed for real-time

applications (C. Wang and Lu, 2022).

In some research, such as Li et al. (Y. Li and Wu,

2024) focused on the interaction of the vehicle with

other urban elements, the results obtained showed

that environmental factors and, mainly, the presence

of pedestrians and infrastructure played an important

role in increasing the accuracy of the prediction re-

sults of the methods studied. The following interac-

tions were embedded into predictive models; boosting

prediction accuracy increases the robustness of the

model for changes in the environment, as shown in

recent work about environment-aware prediction ap-

proaches. Similarly, social attention mechanisms that

are integrated with LSTM networks have also shown

how modeling interactions among road users can be

used to further reﬁne the prediction results (S. Qiao

and Zhao, 2023).

Other techniques incorporated plate recognition

data to further improve the accuracy of urban trafﬁc

ﬂow forecasts by accurately estimating the tendencies

of vehicle motion at busy intersections (X. Shan and

Zhang, 2023). Zhou et al. (H. Zhou and Fan, 2023)

have combined Conditional VAEs with social aware-

ness models and found substantial improvements in

accuracy, particularly in highly populated urban re-

gions where vehicular interactions are even more in-

volved.

ICAART 2025 - 17th International Conference on Agents and Artiﬁcial Intelligence

588

This stream of research elicits a continuous shift

toward more adaptive and scalable trajectory predic-

tion models that are important to help foster relevant

ITS in modern urban environments. These models

will grant appropriate ﬂow, safety, and responsiveness

of trafﬁc management systems based on new learning

techniques, scalability of frameworks, and considera-

tion of the urban context.

2.2 Pedestrian Trajectory Prediction

Pedestrian trajectory prediction has become a focal

area in urban trafﬁc management, especially in im-

proving safety and streamlined movement in highly

populous settings. This enables accurate trajectory

predictions for pedestrians. In this way, proactive

interventions in trafﬁc are possible. These would

go a long way to improving the safety of pedestri-

ans. Advanced models demonstrate greater accuracy

in these predictions, far above what traditional meth-

ods are capable of. For instance, the application of

diffusion-based autoregressive models has been made

in the prediction of complex pedestrian paths, though

these have tended to realize a substantial improve-

ment in accuracy due to their ability to model multi-

dimensional aspects of human movement (K. Lv and

Ni, 2024).

In a broad examination of human trajectory pre-

diction techniques, research highlights a wide array of

approaches that are effective in diverse trafﬁc scenar-

ios. With the incorporation of spatial-temporal inter-

action, graph-based network models have gained the

lead in efﬁciency for modeling dynamic pedestrian

environments. They are able to analyze how pedes-

trians interact with both people and vehicles, hence

obtaining more exact predictions for places with high

ﬂow (A. Rudenko and Arras, 2020; R. Wang and Cui,

2022).

In addition, clustering-based methods combined

with LSTM networks improve the prediction accu-

racy by clustering certain groups of pedestrian paths

according to their respective shared characteristics;

this helps the model cope with large variations in in-

dividual behavior (H. Xue and Reynolds, 2020). The

challenge of environmental complexity is the core is-

sue in the development of pedestrian movement mod-

eling. Considering genuine complexities from the real

world, such as viewpoint distortion, advanced meth-

ods have been suggested that will deliver high accu-

racy even in complicated settings. In one work, the

use of spatial distortions to adjust a prediction yields

more robust trajectory forecasts, which will be highly

useful in urban areas where pedestrian movements are

highly variable (S. Gundreddy and Bakshi, 2023).

Another front where developments are being made

involves the use of attention mechanisms in sequence

prediction for pedestrians. An attention-driven model

has shown tendencies to recognize subtle dependen-

cies and time-based sequences of pedestrian paths.

This allows for ﬁne-grained improvements in predic-

tions of crowded urban environments, where this at-

tention mechanism acts to give a further added ad-

vantage in enhancing the model in capturing intri-

cate movement patterns. (E. Zhang and Malhan,

2022). Moreover, cross-attention-based predictive

models have improved multistep human movement

predictions for scenarios that involve the interaction

of complicated trafﬁc situations where other tradi-

tional models struggle to maintain accuracy (W. Zhu

and Yi, 2023).

Other recent works further stress the sole impor-

tance of multi-modal approaches, as they represent

variants of different behavior patterns among individ-

ual pedestrians. Such models signiﬁcantly improve

the reliability of prediction. For example, multi-

modal modeling techniques can be used to account

for a wide range of pedestrian behaviors and have bet-

ter eventual prediction performance (L. Shi and Hua,

2023). More recently, conditional ﬂow normaliza-

tion has been shown to increase the real-time accu-

racy of pedestrian trajectory predictions to better meet

the speciﬁc challenges of urban areas with dynamic

pedestrian densities (J. Sun and Lu, 2021).

2.3 Multi-Agent Trajectory Prediction

Trajectory prediction in multi-agent systems is dis-

tinctive and challenging due to a multitude of diverse

and often unpredictable behaviors among agents.

Several recent extensions proposed predictive models

incorporating situational awareness and risk assess-

ment, yielding notable improvements in the accuracy

of multi-agent trajectory prediction. For example, the

Self Attention-LSTM model, designed for dynamic

risk and situational context assessment, has been use-

ful in enhancing trajectory forecasts capturing sev-

eral agent behaviors in uncertain scenarios (Y. Ma

and Manocha, 2019). Similarly, DL models in het-

erogeneous trafﬁc behavior have proven their useful-

ness while handling diverse agent dynamics within

complex urban settings (A. Kutsch and Bogenberger,

2024).

The importance of diverse, high-quality data

sources for multi-agent trajectory prediction is

also increasingly recognized, as highlighted by the

TUMDOT-MUC dataset. This dataset was developed

to capture all details of trafﬁc interaction using UAVs

and provided comprehensive details on agent move-

ment for training predictive models on more subtle

Multi-Agent Trajectory Prediction for Urban Environments with UAV Data Using Enhanced Temporal Kolmogorov-Arnold Networks with

Particle Swarm Optimization

589

representations of multi-agent interactions (A. Kutsch

and Bogenberger, 2024). Such integration of diverse

data allows predictive models to better account for

and model interactions between agents, mainly when

these are in dense urban settings.

It is also underscored by advances in the ﬁeld of

dynamic system modeling that provide the basis for

neural networks speciﬁcally engineered to manipu-

late the dynamics of nonlinear interactions. Among

these, the TKAN represents yet another big stride, op-

timized for analyzing intricate behaviors in dynamic

environments. Such networks are based on chaos the-

ory and nonlinear dynamics for extracting temporal

and spatial patterns in the data of dynamic scenar-

ios. It can be used to simulate complex, nonlinear

interactions. The model demonstrates high accuracy

in various applications that involve dynamic predic-

tion, movement analysis, and behavioral simulation,

which makes it suitable for real-time forecasting tasks

(K. Xu and Wang, 2024),(Kasheﬁ, 2024).

One of the studies underlined that TKAN excelled

in spatial-temporal integration regarding multi-agent

dynamics, while the model considers several agents at

once and therefore can ensure higher predictive accu-

racy compared to traditional models (Genet and Inzir-

illo, 2024). Furthermore, TKAN has an excellent ap-

plication for dynamic and high-density environments,

such as those generated by the urban trafﬁc system,

because it is quite ﬂexible when adaptation speedily

copes with complex nonlinear patterns.

In summary, recent multi-agent trajectory pre-

diction contributions have marked a current trend

for models that incorporate contextually aware, rich

sources of data, and use state-of-the-art neural ar-

chitectures such as the TKAN. These improvements

enable multi-agent predictive models to guarantee

much higher accuracy, helping trafﬁc management

and safety policing in complex urban settings.

3 DATASET AND

METHODOLOGY

This section elaborated on the methodology adopted

for trajectory prediction with a discussion on the

preparation of the dataset, the extraction of features,

and the modeling framework. The proposed model

consists of the TKAN optimized with PSO for devel-

oping an object trajectory prediction framework.

3.1 Dataset Overview

This work leveraged the rich TUMDOT-MUC dataset

for trafﬁc analysis in urban environments. The cap-

ture has been performed using 12 UAVs equipped

with cameras continuously observing a segment of a

700-meter-long busy urban roadway in Munich, Ger-

many, over two afternoons during peak trafﬁc times

on Thursday, October 6th, 2022, and Wednesday, Oc-

tober 12th, 2022. Figure 1 illustrates the speciﬁc

location of the monitored road segment in Munich,

providing context to the high-density trafﬁc dynamics

captured in the dataset.

Each recording session captures the road environ-

ment and trafﬁc behaviors for several hours, focus-

ing on periods of high trafﬁc density. The ﬁrst-day

recordings were from 15:35 to 18:45, and for the sec-

ond day from 15:00 to 18:25. This data forms origi-

nal material with useful insights into trafﬁc dynamics

and a wide range of road user behaviors, thus offer-

ing strong data material upon which to ground tra-

jectory prediction. The extracted data covers a broad

range of road users, including cars, buses, trucks,

trams, motorcycles, bicycles, pedestrians, kick scoot-

ers, and trailers. It contains detailed information such

as the category of each object, its unique identiﬁca-

tion, its location, its velocity, acceleration, and orien-

tation across consecutive frames.

UAVs positioned to capture the road segment from

different heights and angles allow smooth tracking

across multiple frames. The interactions between traf-

ﬁc participants in mixed-trafﬁc conditions have var-

ied well, which allows the dataset to contain complex

trafﬁc dynamics that are relevant for trajectory predic-

tion models.

3.2 Feature Extraction and Description

This part outlines the primary and derived features

that were critical for accurate trajectory prediction,

with descriptions of each feature provided in Table

1. The primary features derived from the dataset in-

clude the basic data points: stage time, object class,

object unique ID, x-spatial and y-spatial coordinates,

speed, and acceleration. Our task is to develop further

features to add to the model so that it can enhance its

performance. Features apart from these basic ones are

extracted to derive further insight and improve the ac-

curacy of trajectory prediction. The features that have

been extracted, along with their relevance and useful-

ness in the approach taken to solve the problem, are

as follows:

• Distance Traveled. The distance traveled by each

object, scaled between 0 and 1 based on the min-

imum and maximum distances within the dataset,

is given by:

traveled

− min

traveled

max

traveled

− min

traveled

, (1)

ICAART 2025 - 17th International Conference on Agents and Artiﬁcial Intelligence

590

Figure 1: Location of monitored road segment in Munich.

• Average Velocity (v

avg

). The average velocity is

calculated as the distance traveled over time, rep-

resenting the mean velocity of each object:

avg

traveled

time

, (2)

• Manhattan Distance. The Manhattan distance

between two vehicles is the sum of the absolute

differences of their coordinates:

Manhattan Distance = |x

− x

| + |y

− y

|, (3)

where (x

, y

) and (x

, y

) represent the coordi-

nates of two vehicles.

• Manhattan Average Speed. The Manhattan Av-

erage Speed represents the average speed of the

surrounding vehicles for each frame, indicating

local congestion:

Manhattan Average Speed =

∑

i=1

, (4)

where N is the number of nearby vehicles and v

denotes the speed of the i-th nearby vehicle.

• Yaw Rate. The yaw rate, representing the angular

velocity around the z-axis, is calculated as the rate

of change in the z-axis rotation:

Yaw Rate =

∆rotation

∆t

, (5)

• Jerk. Jerk captures sudden changes in accelera-

tion, deﬁned as the time derivative of acceleration:

Jerk =

∆acceleration

∆t

, (6)

• Acceleration Magnitude. Acceleration Magni-

tude is the intensity of acceleration, given by the

magnitude of the acceleration vector:

Acceleration Magnitude =

+ a

, (7)

• Rotation Magnitude. Rotation Magnitude cap-

tures the total rotation change rate, calculated as

the magnitude of the rotation vector:

Rotation Magnitude =

+ r

, (8)

• Heading. The heading represents the direction of

movement of the object, determined using the arc-

tangent of the velocity components:

Heading = arctan





. (9)

3.3 TKAN for Trajectory Prediction

The approaches used TKAN since they are capable

of capturing the complex, temporal dependencies in

such a sequence-based data type, which is best suited

for object trajectory prediction. The model utilizes

multi-layer neural networks that map dynamic inputs

over time, capturing non-linear relationships in the

data, hence offering ﬁne-grained, temporal tracking

of objects.

TKAN models the complex temporal dependen-

cies of data in a hierarchical neural network layer

manner. These layers map object positions across

frames and create trajectories with which the model

can predict not just the immediate future locations but

also the long-term trajectory paths. With each pre-

diction, previous states are considered by the TKAN

project’s future states, highlighting the essential de-

tails in both space and time of motion for applica-

tions such as forecasting changes in ﬂow. The TKAN

model represents complex functions as:

f (x

, x

, . . . , x

) =

∑

p,q

)

. (10)

where:

• ϕ

p,q

are univariate functions mapping each input

variable x

• Φ

aggregates these mappings, facilitating the

identiﬁcation of temporal patterns across data se-

quences.

3.4 PSO for Model Optimization

To improve TKAN performance, the PSO method

was adopted to optimize hyperparameters (layer

Multi-Agent Trajectory Prediction for Urban Environments with UAV Data Using Enhanced Temporal Kolmogorov-Arnold Networks with

Particle Swarm Optimization

591

Table 1: Extensive feature set of original and extracted features.

Feature Unit Symbol Brief Description Base Feature Derived Feature

Timestamp [s] t Frame time Yes No

Category – cat Agent type (9 categories) Yes No

Track ID – ID Unique agent identiﬁer Yes No

Translation [m] [x, y, z] Agent’s ground center position Yes No

Dimension [m] [l, w, h] Agent’s 3D bounding box Yes No

Velocity [m/s] [vx, vy, vz] Agent velocity vector Yes No

Acceleration [m/s

] [ax, ay, az] Agent acceleration vector Yes No

Distance Traveled – d

travel

Distance traveled by agents No Yes

Average Velocity [m/s] v

avg

Mean velocity of agent No Yes

Manhattan Distance [m] d

man

Distance between agents No Yes

Manhattan Average Speed [m/s] v

man

Average speed of surrounding agents No Yes

Yaw Rate [rad/s] ψ Angular velocity around z-axis No Yes

Jerk [m/s

] j Sudden changes in acceleration No Yes

Displacement [m] ∆d Agent’s movement in 3D space No Yes

Acceleration Magnitude [m/s

] a

mag

Overall acceleration intensity No Yes

Rotation Magnitude – r

mag

Total rotation change rate No Yes

Heading [rad] θ Agent’s movement direction No Yes

depth, neuron number, and learning rate) by simulat-

ing particles moving in the hyperparameter solution

space. The particles corresponding to the candidate

solutions are deﬁned by a combination of TKAN hy-

perparameters. The PSO process iterates as follows:

• Particle Position Representation. the current

position of each particle x

represents a unique set

of TKAN hyperparameters.

• Velocity Update. Each particle updates its veloc-

ity v

according to the equation:

(k + 1) = c

· v

(k) + c

· r

· (p

− x

)

+ c

· r

· (p

− x

)

(11)

Here, c

is the inertia coefﬁcient that maintains the

current momentum of the particle, c

is the cogni-

tive coefﬁcient that directs the particle towards its

best known position p

(personal best), and c

the social coefﬁcient that attracts the particle to-

ward the best global position p

. The random fac-

tors r

and r

introduce stochasticity, ensuring a

diverse exploration of the solution space.

• Position Update. Using the updated velocity, the

position of each particle is adjusted according to:

(k + 1) = x

(k) + v

(k + 1). (12)

This new position corresponds to a revised set of

hyperparameters for the TKAN model.

The function repeatedly adjusts velocity and po-

sition so that the particles move down to an optimal

combination of hyperparameters to reduce the error

of the model. This enhances the learning stages of

TKAN, which increases predictive accuracy and de-

creases error rates, allowing for a faster training pro-

cess. The use of PSO therefore simpliﬁes both TKAN

optimization and hyperparameter tuning. Each itera-

tion of this optimization pass is summarized in pseu-

docode form in Algorithm 1.

Algorithm 1: PSO for TKAN Hyperparameter Tuning.

Data: Initialize particles with random positions and

velocities

Result: Optimal hyperparameters for TKAN

for each particle do

Set p best to current position;

Evaluate ﬁtness (error metric for TKAN);

end

Set g best as best among all particles;

repeat

for each particle do

Update velocity:

(k + 1) = c

· v

(k)

+ c

· r

· (p best − x

)

+ c

· r

· (g best − x

)

Update position:

(k + 1) = x

(k) + v

(k + 1)

Evaluate ﬁtness for TKAN model;

if new ﬁtness < p best then

Update p best;

end

if any particle’s ﬁtness < g best then

Update g best;

end

until convergence criteria met;

Return g best as optimal hyperparameters for TKAN

4 EXPERIMENTAL RESULTS

This section describes the results obtained after opti-

mization of the TKAN performance using PSO and

ICAART 2025 - 17th International Conference on Agents and Artiﬁcial Intelligence

592

further analyzes the results of the trajectory predic-

tion. These ﬁndings reveal the improved performance

of the model in complex urban conditions in the real

world.

4.1 TKAN-PSO Model

The PSO algorithm was used to enhance the precision

of the TKAN anchored primitives in predicting trajec-

tories. This was used to ﬁne-tune the crucial hyper-

parameters of the model for dynamic urban environ-

ments where accurate and timely prediction of object

trajectories have to be computed in a cluttered scene.

For PSO implementation, each particle represents

the position of one TKAN hyperparameter conﬁgu-

ration in the swarm of particles. The deﬁned search

space was set up in intervals of interest to search for

optimal levels that enhance the attention mechanisms

and temporal performances of TKAN. These are the

most essential parameters of TKAN, which are crucial

to handle such complexity of temporal dependencies

and are highlighted in Table 2.

Table 2: Hyperparameter ranges for TKAN optimization.

Hyperparameter Range

Attention Layer Dimension 32 to 128 units

Dropout Rate (Attention Layer) 0.1 to 0.5

Sequence Length 10 to 100 steps

Time-Window Size 5 to 20 steps

To direct PSO toward an effective conﬁguration,

the ﬁtness function was designed to minimize the

Root Mean Squared Error (RMSE) on the validation

dataset. This function reliably assesses model accu-

racy, guiding PSO to converge on hyperparameter set-

tings that enhance the prediction precision of TKAN.

The ﬁtness function used in the optimization is de-

ﬁned as:

Fitness =

RMSE

(13)

The PSO conﬁguration was made up of 30 par-

ticles, with each position of the particle being itera-

tively updated for 100 rounds. The inertia weight w

was set to 0.7, while the cognitive and social coef-

ﬁcients (c

and c

) were both set to 1.5. These pa-

rameters were tuned to balance the algorithm between

exploration and convergence, ultimately contributing

to the model’s improvement. In Figure 2, the PSO

convergence process is visualized over successive it-

erations, highlighting the steady reduction in RMSE

as the optimal performance approaches.

The ﬁnal optimized conﬁguration, displayed in

Table 3, led to a substantial reduction in trajectory

prediction error, markedly improving the real-time

performance capabilities of the TKAN.

Figure 2: PSO convergence in TKAN hyperparameter opti-

mization.

Table 3: Optimized hyperparameters for TKAN.

Hyperparameter Optimized Value

Attention Layer Dimension 96 units

Dropout Rate (Attention Layer) 0.3

Sequence Length 40 steps

Time-Window Size 15 steps

Figure 3 presents the optimized TKAN-PSO

model, which outperforms the baseline by 10% in

RMSE. This conﬁguration enhances TKAN, making

it more effective for real-time trajectory prediction in

dynamic environments, such as rapidly changing ur-

ban settings. The TKAN-PSO model efﬁciently re-

duces error rates, well-suited for fast-paced scenarios.

Figure 3: Comparison of RMSE Between Baseline TKAN

and TKAN-PSO.

4.2 Visualization of Prediction Results

In this section, the experimental results of the op-

timized TKAN-PSO model are analyzed and per-

formed within a complex multi-agent urban scenario.

In a busy city space monitored by a set of UAVs, dif-

ferent types of entities, from vehicles to pedestrians,

are tracked. In these multi-agent settings, the behav-

ior of one entity may inﬂuence the other, where ve-

hicles change lanes to avoid pedestrians, and pedes-

trians are deterred when they see vehicles approach-

ing or other nearby agents. The proﬁciency and ca-

pacity of the model to predict these dynamics driven

by interactions between agents is evaluated through

these visualizations, demonstrating its effectiveness

Multi-Agent Trajectory Prediction for Urban Environments with UAV Data Using Enhanced Temporal Kolmogorov-Arnold Networks with

Particle Swarm Optimization

593

in predicting trajectories in high-density natural en-

vironments.

In Figure 4, a heat map of the test area is pre-

sented, where 12 UAVs control vehicles and pedes-

trians. The movements across multiple lanes and

walkways have been captured by each UAV, provid-

ing a comprehensive view of high-trafﬁc zones. This

heatmap serves to identify areas of increased interac-

tion and potential congestion, which are essential for

real-time monitoring and predictive analysis.

Figure 4: Heatmap of urban monitoring area captured by

UAVs.

Figure 5 presents a combined view of predicted

versus actual trajectories for all agent classes. The

close matching between the predicted and the actual

paths within this larger scene underscores the reliabil-

ity of the model to generate persistent predictions in

highly variable urban scenarios. Further, its compe-

tencies toward adaptation to various settings and con-

ditions highlight the robustness of the model for ac-

curate predictions of motions in crowded, interacting

areas.

Figure 5: Comparison of predicted and actual trajectories

for all agents.

Figure 6 illustrates the 3D visualization and fur-

ther emphasizes the effectiveness of the model in

capturing lateral and vertical movements, crucial in

multi-agent scenarios where both dimensions are inte-

gral to situational awareness. This three-dimensional

view validates the accuracy of the model in spatial

tracking, which is essential in scenarios that require

real-time positional adjustments.

Figure 6: 3D visualization of predicted and actual trajecto-

ries across urban terrain.

The predicted trajectories of different agent

classes, overlaid against the ground truth trajectories

of the model, are shown in Figures 7.a to 7.c. Speciﬁ-

cally, predicted trajectories for the car, pedestrian, and

truck classes are shown in red, green, and brown, re-

spectively; their ground truth trajectories are shown

in the matching colors of blue, purple, and orange,

and follow each other nearby. This optimization from

PSO tuning reﬂects in these predictions and points to

the precision of the model in high-trafﬁc, dynamic ur-

ban settings where accurate real-time adjustments are

important for instant decision-making.

Finally, Figure 8 demonstrates how the model is

able to avoid collisions in real-time in the reactive

path adjustment. The historical data (indicated by

red points) aligns closely with the observed move-

ments (blue points), demonstrating how real-time ad-

justments are predicted by the model. When an ob-

ject encounters nearby entities, such as another vehi-

cle or a pedestrian, the predictive paths of the model

(shown in green) adapt to prevent collisions while ac-

curately projecting future paths. It therefore allows

for the capability to make interaction-based adjust-

ments that support suitability in real time for crowded

settings where smooth ﬂow and safety are of con-

cern. It should be noted that the representation of the

predicted results reﬂects the accuracy and ﬂexibility

achieved by the TKAN-PSO model. The consistent

alignment with real-world trajectories in complex ur-

ban scenarios illustrates the impact of PSO tuning,

which enhances the capacity of the model to handle

real-time, multi-agent environments. This precision

provides the TKAN-PSO model with considerable

use value for application in urban trafﬁc management,

autonomous navigation, and multi-agent surveillance

ICAART 2025 - 17th International Conference on Agents and Artiﬁcial Intelligence

594

Figure 7: Predicted vs. actual trajectories for multi-agents.

Figure 8: Illustration of historical, real, and predicted trajectories of a car agent.

systems where reliable and efﬁcient decision-making

is essential.

4.3 Model Evaluation Metrics

To evaluate the performance of predictive models,

key metrics provide information on each accuracy,

efﬁciency, and overall effectiveness of the model

(M. R. Mohebbi and Yamnenko, ). This section

presents the performance of the proposed TKAN-

PSO model in comparison with other ML and DL

models. Performance evaluation is detailed in essen-

tial metrics: Mean Squared Error (MSE), Root Mean

Squared Error (RMSE), Mean Absolute Error (MAE),

R² score, and Training Time per Epoch, highlighting

the strengths and weaknesses of the model for urban

trajectory prediction.

MAE =

∑

i=1

− A

|, (14)

MSE =

∑

i=1

− A

)

, (15)

RMSE =

∑

i=1

− Q

)

, (16)

= 1 −

∑

i=1

− Q

)

∑

i=1

−

, (17)

MPE =

∑

i=1

− Q

× 100. (18)

where O

denotes the predicted value of i

, A

rep-

resents the actual value of i

, and N signiﬁes the num-

ber of test samples, and in R² score formula,

A is the

mean of observed values, and N is the total number of

samples.

These metrics collectively offer a comprehensive

evaluation framework that balances error magnitude,

ﬁt quality, and training efﬁciency. By analyzing

these aspects, practitioners can better understand the

strengths and weaknesses of the model, enabling in-

formed decisions for further optimization and im-

provement.

Table 4 presents the results of each model across

these metrics. With an R² score of 0.98, an RMSE of

39.85, and an MSE of 1588.5, the TKAN-PSO model

emerged as the most accurate in trajectory prediction.

Other models, including LSTM and GRU, performed

well, but TKAN-PSO consistently delivered superior

results across all metrics, reinforcing its reliability for

urban, multi-agent trajectory prediction.

Table 4: Performance comparison of all models on evalua-

tion metrics.

Model MAE RMSE

MPE Time

TKAN-

PSO

13.75 39.85 0.98 -12.9 128

LSTM 21.3 52.9 0.93 -15.7 152

GRU 22.6 55.3 0.91 -37.4 144

KNN 27.4 67.0 0.85 -21.1 97

RNN 24.9 60.7 0.89 -18.2 125

MLP 26.3 63.5 0.91 -19.9 107

SVR 28.7 69.8 0.83 -49.7 93

Multi-Agent Trajectory Prediction for Urban Environments with UAV Data Using Enhanced Temporal Kolmogorov-Arnold Networks with

Particle Swarm Optimization

595

Table 5 shows the performance of the TKAN-PSO

model when predicting trajectory for speciﬁc agent

types (cars, pedestrians, and trucks), highlighting an

improved accuracy for cars due to the larger dataset

for this category. Conversely, pedestrians and trucks

show slightly lower R² score values, reﬂecting data

variability and the challenges inherent in predicting

these trajectories of agents.

Table 5: Performance of the TKAN-PSO model across var-

ious agent types and all agents.

Agent MAE RMSE

MPE Time

Cars 12.6 37.45 0.98 -11.75 122

Pedestrians 19.8 51.6 0.93 -15.68 137

Truck 18.1 50.7 0.93 -14.50 135

All Agents 13.75 39.85 0.98 -13.93 128

Figure 9.a visually compares the MSE-based

models, highlighting the clear advantage of the pro-

posed model over the others. In Figure 9.b, the pre-

diction accuracy of the TKAN-PSO model is shown

for each agent and the overall accuracy in the urban

trafﬁc environment, underlining its effectiveness in

predicting urban trajectory. This underlines the effec-

tiveness of the TKAN-PSO model in predicting the

trajectory of urban settings viewed in this work. Ad-

ditionally, this is due to the diversity of data and struc-

tural optimization using PSO that enhances the abil-

ity of TKAN to make reliable, real-time predictions

within complex environmental settings. These analy-

ses help the practitioner ﬁnd where the best model is

strong and where it will need improvement to support

decisions toward further optimization and enhancing

predictive performance.

5 CONCLUSIONS

The proposed TKAN-PSO model realizes marked im-

provements in multi-agent trajectory prediction. This

proves that the model is able to handle multiple urban

agents, such as vehicles, pedestrians, and trucks, with

signiﬁcant accuracy in dynamic and high-density en-

vironments. The inclusion of PSO allows for the ﬁne-

tuning of hyperparameters and therefore contributes

to major improvements in predictive accuracy and

error minimization. This optimization strategy en-

hances the resilience of the proposed model to the var-

ied spatial and temporal dependencies present in ur-

ban data, ensuring robust performance across diverse

scenarios. Central to the effectiveness of the model is

a thoughtful feature extraction process that systemat-

ically incorporates critical attributes. These features

capture key spatial and temporal patterns that allow

TKAN-PSO to predict complex trajectories for indi-

Figure 9: Comparison of model prediction accuracy by

MSE and agent type.

vidual agents while considering the broader interac-

tions among multiple agents in real-time. The multi-

agent predictive ability of TKAN-PSO will have wide

implications in the ITS area, which is crucial for real-

time decision support, urban trafﬁc management, and

object tracking, all key elements in much safer and

more efﬁcient mobility. Performing accurate predic-

tion and adaptation to the complex circumstances of

urban settings, TKAN-PSO enables intelligent mo-

bility. It provides a key tool that improves ﬂow and

safety and reduces congestion in modern cities and

towns.

ACKNOWLEDGEMENTS

The ﬁnancial support by the Austrian Federal Min-

istry of Labour and Economy, the National Founda-

tion for Research, Technology and Development and

the Christian Doppler Research Association is grate-

fully acknowledged.

REFERENCES

A. Kutsch, M. M. and Bogenberger, K. (2024). Tum-

dot–muc: Data collection and processing of multi-

modal trajectories collected by aerial drones. Data

Science for Transportation, 6(2):15.

ICAART 2025 - 17th International Conference on Agents and Artiﬁcial Intelligence

596

A. Rudenko, L. Palmieri, M. H. K. M. K.-D. M. G. and

Arras, K. O. (2020). Human motion trajectory predic-

tion: A survey. The International Journal of Robotics

Research, 39(8):895–935.

B. Yang, S. Sun, J. L. X. L. and Tian, Y. (2019). Trafﬁc

ﬂow prediction using lstm with feature enhancement.

Neurocomputing, 332:320–327.

C. Arti, G. Sharad, K. P. P. C. and Kumar, S. (2022).

Urban trafﬁc congestion: Its causes-consequences-

mitigation. Research Journal of Chemistry and En-

vironment, 26(12):164–176.

C. Wang, H. L. and Lu, W. (2022). Fast prediction of

vehicle driving intentions and trajectories based on

lightweight methods. IEEE Journal of Radio Fre-

quency Identiﬁcation, 6:917–921.

Deo, N. and Trivedi, M. M. (2017). Learning and predict-

ing on-road pedestrian behavior around vehicles. In

2017 IEEE 20th International Conference on Intelli-

gent Transportation Systems (ITSC), pages 1–6. IEEE.

E. Zhang, N. Masoud, M. B. J. L. and Malhan, R. K. (2022).

Step attention: Sequential pedestrian trajectory pre-

diction. IEEE Sensors Journal, 22(8):8071–8083.

Genet, R. and Inzirillo, H. (2024). Tkan: Tempo-

ral kolmogorov-arnold networks. arXiv preprint

arXiv:2405.07344.

H. Xue, D. Q. H. and Reynolds, M. (2020). Poppl:

Pedestrian trajectory prediction by lstm with auto-

matic route class clustering. IEEE Transactions on

Neural Networks and Learning Systems, 32(1):77–90.

H. Zhao, J. Gao, T. L. C. S. B. S. B. V. Y. S. Y. C. C. S. and

Li, C. (2021). Tnt: Target-driven trajectory predic-

tion. In Conference on Robot Learning, pages 895–

904. PMLR.

H. Zhou, X. Yang, D. R. H. H. and Fan, M. (2023). Csir:

Cascaded sliding cvaes with iterative socially-aware

rethinking for trajectory prediction. IEEE Transac-

tions on Intelligent Transportation Systems.

J. Sun, Z. Wang, J. L. and Lu, C. (2021). Uniﬁed and fast

human trajectory prediction via conditionally parame-

terized normalizing ﬂow. IEEE Robotics and Automa-

tion Letters, 7(2):842–849.

K. Lv, L. Y. and Ni, X. (2024). Learning autoencoder diffu-

sion models of pedestrian group relationships for mul-

timodal trajectory prediction. IEEE Transactions on

Instrumentation and Measurement.

K. Xu, L. C. and Wang, S. (2024). Kolmogorov-arnold net-

works for time series: Bridging predictive power and

interpretability. arXiv preprint, arXiv:2406.02496.

Kasheﬁ, A. (2024). Kolmogorov-arnold pointnet: Deep

learning for prediction of ﬂuid ﬁelds on irregular ge-

ometries. arXiv preprint, arXiv:2408.02950.

L. Shi, L. Wang, C. L. S. Z. W. T. N. Z. and Hua, G. (2023).

Representing multimodal behaviors with mean loca-

tion for pedestrian trajectory prediction. IEEE Trans-

actions on Pattern Analysis and Machine Intelligence,

45(9):11184–11202.

M. R. Mohebbi, J. Klinger, M. D. J. M. N. A. and Tava-

soli, M. (2024). Advanced driving behavior analy-

sis through kolmogorov-arnold network and uav traf-

ﬁc data. Accepted, to appear.

M. R. Mohebbi, M. Tavasoli, M. D. J. M. N. A. M. J. H. Z.

and Yamnenko, I. (2024). Vehicle trajectory predic-

tion in congested urban trafﬁc leveraging liquid neural

network and uav data. Accepted, to appear.

M. R. Mohebbi, M. J. Hassan Zada, M. R.-S. M. D. and

Yamnenko, I. Network trafﬁc co-movement assess-

ment via oriented basis signal processing and ensem-

ble decision trees. Not Published.

A. De Miguel, J. M. A. and Garcia, F. (2022). Vehi-

cles trajectory prediction using recurrent vae network.

IEEE Access, 10:32742–32749.

P. Rathore, D. Kumar, S. R. M. P. and Bezdek, J. C. (2019).

A scalable framework for trajectory prediction. IEEE

Transactions on Intelligent Transportation Systems,

20(10):3860–3874.

R. SenthilPrabha, D. Sasikumar, G. S. K. N. and Har-

ish, P. (2023). Smart trafﬁc management system

through optimized network architecture for the smart

city paradigm shift. In 2023 International Confer-

ence on Intelligent Systems for Communication, IoT

and Security (ICISCoIS), pages 700–705. IEEE.

R. Wang, X. Song, Z. H. and Cui, Y. (2022). Spatio-

temporal interaction aware and trajectory distribution

aware graph convolution network for pedestrian mul-

timodal trajectory prediction. IEEE Transactions on

Instrumentation and Measurement, 72:1–1.

S. Gundreddy, R. Ramkumar, R. R. K. M. and Bakshi, S.

(2023). Perspective distortion model for pedestrian

trajectory prediction for consumer applications. IEEE

Transactions on Consumer Electronics.

S. Qiao, F. Gao, J. W. and Zhao, R. (2023). An enhanced ve-

hicle trajectory prediction model leveraging lstm and

social-attention mechanisms. IEEE Access.

T. Wu, P. Lei, F. L. and Chen, J. (2022). Space-time tree

search for long-term trajectory prediction. IEEE Ac-

cess, 10:117745–117756.

W. Zhu, Y. Liu, M. Z. and Yi, Y. (2023). Reciprocal con-

sistency prediction network for multi-step human tra-

jectory prediction. IEEE Transactions on Intelligent

Transportation Systems, 24(6):6042–6052.

X. Kong, Z. Xu, G. S. J. W. Q. Y. and Zhang, B. (2016). Ur-

ban trafﬁc congestion estimation and prediction based

on ﬂoating car trajectory data. Future Generation

Computer Systems, 61:97–107.

X. Shan, W. Yu, Z. L. C. W. Y. R. and Zhang, J. (2023).

Vehicle trajectory-based trafﬁc volume prediction on

urban roads with fast-communication license plate

recognition data. IEEE Transactions on Intelligent

Transportation Systems, 25(3):2768–2778.

Y. Li, Y. Jiang, Z. X. and Wu, X. (2024). Improv-

ing interaction-based vehicle trajectory prediction via

handling sensing failures. IEEE Sensors Journal.

Y. Lv, Y. Duan, W. K. Z. L. and Wang, F. Y. (2014). Traf-

ﬁc ﬂow prediction with big data: A deep learning ap-

proach. IEEE Transactions on Intelligent Transporta-

tion Systems, 16(2):865–873.

Y. Ma, X. Zhu, S. Z. R. Y. W. W. and Manocha, D. (2019).

Trafﬁcpredict: Trajectory prediction for heteroge-

neous trafﬁc-agents. In Proceedings of the AAAI Con-

ference on Artiﬁcial Intelligence, volume 33, pages

6120–6127.

Z. Wang, H. Zhang, C. Q. B. L. Y. C. and Jiang, M. (2024).

A hybrid lstm network for long-range vehicle trajec-

tory prediction based on adaptive chirp mode decom-

position. IEEE Sensors Journal.

Multi-Agent Trajectory Prediction for Urban Environments with UAV Data Using Enhanced Temporal Kolmogorov-Arnold Networks with

Particle Swarm Optimization

597