Prediction-Based Selective Negotiation for Reﬁning

Multi-Agent Resource Allocation

Madalina Croitoru

, Cornelius Croitoru

and Gowrishankar Ganesh

University of Montpellier, LIRMM, France

Faculty of Computer Science, Al. I. Cuza Univerisity, Iasi, Romania

LIRMM, CNRS (National Center of Scientiﬁc Research), France

Keywords:

Collective Decision Making, Multi Agent Systems, Negotiation.

Abstract:

This paper proposes a 2-stage framework for multi-agent resource allocation. Following a Borda-based al-

location, machine learning predictions about agent preferences are used to selectively choose agent pairs to

perform negotiations to swap resources. We show that this selective negotiation improves overall satisfaction

towards the resource redistribution.

1 INTRODUCTION

Resource allocation (Ibaraki and Katoh, 1988) is a

core problem in a wide range of real-world multi-

agent applications (Chevaleyre et al., 2005), from

distributing medical supplies in emergency situations

(Zhang et al., 2016) to assigning computational re-

sources in cloud computing (Vinothina et al., 2012)

or allocating advertising slots to bidders in digital

marketplaces (Li et al., 2018). Such problems re-

quire dividing limited resources among competing

agents, each with their own preferences. Finding opti-

mal solutions to such problems is typically NP-hard,

with linear programming methods used for approxi-

mate solutions (Katoh and Ibaraki, 1998; Croitoru and

Croitoru, 2011).

Advancements in generative Artiﬁcial Intelligence

(AI) and autonomous systems are shaping hybrid so-

cieties where human agents and artiﬁcial systems are

deeply interconnected. In these environments, sub-

jective, context-dependent human preferences must

integrate with the pre-coded preferences of artiﬁcial

agents. Resource allocation problems involving hu-

man agents should consider contextual factors such as

socioeconomic status or geographic location, as these

can inﬂuence priorities that may evolve during the al-

location process (Dafoe et al., 2020). For instance,

in digital marketplaces, preferences for goods or ser-

vices can be inferred from browsing behavior or de-

mographic proﬁles. Similarly, in education, predic-

tive models can guide resource allocation to students

based on their speciﬁc learning needs.

In hybrid societies, achieving the global good re-

quires systems that facilitate fair trade-offs among di-

verse stakeholders, balancing individual preferences

with collective outcomes. This paper examines a

straightforward approach to negotiation: the swap-

ping of goods. This method serves as a form

of compromise, enabling mutually acceptable out-

comes through direct exchanges while supporting the

broader collective interest. We propose a 2-stage

framework reﬁning multi-agent resource allocation:

1. In the ﬁrst step, individual preferences are ag-

gregated using the Borda voting method to cre-

ate a collective preference ranking. This aggre-

gated ranking, combined with the individual pref-

erences of each agent, is used to allocate goods in

a greedy manner. Starting with the most preferred

good in the Borda aggregated ranking, goods are

allocated to the maximum number of agents, pro-

ceeding sequentially to less preferred goods.

2. In the second step, following a Condorcet-like ap-

proach, pairs of agents are identiﬁed based on ma-

chine learning predictions of their features, indi-

cating potential for compromise. These agents

are then considered for swapping their allocated

goods to enhance overall societal satisfaction by

aligning the allocations with predicted prefer-

ences.

In this work, our contributions are threefold:

• We formalize the 2-stage framework for

prediction-based resource allocation.

• We design an algorithm that integrates preference

prediction and preference aggregation for overall

agent satisfaction.

656

Croitoru, M., Croitoru, C. and Ganesh, G.

Prediction-Based Selective Negotiation for Reﬁning Multi-Agent Resource Allocation.

DOI: 10.5220/0013368000003890

In Proceedings of the 17th International Conference on Agents and Artiﬁcial Intelligence (ICAART 2025) - Volume 1, pages 656-662

ISBN: 978-989-758-737-5; ISSN: 2184-433X

• Third, we analyse the satisfaction guarantees pro-

vided by the proposed framework.

The paper is structured as follows. Section 2 pro-

vides a motivating example illustrating how the use

of swaps can improve outcomes compared to a sim-

ple naive allocation. The naive allocation relies on

a lexicographical ordering of agents, assigning each

their most preferred goods in sequence until all goods

are allocated. Section 3 formally deﬁnes the resource

allocation problem and introduces the pseudocode for

the naive algorithm used in the motivating example

presented in Section 2. Section 4 details the Borda

voting method, which serves as the basis for the re-

ﬁned allocation process described in the paper, and

the Condorcet voting method, which relies on pair-

wise comparisons of preferences. The intuition be-

hind Condorcet’s pairwise comparisons is used to jus-

tify why agents might swap goods. Section 5 intro-

duces our two-step framework and its theoretical sat-

isfaction guarantees. The framework (i) uses Borda

aggregation for an initial allocation, followed by (ii)

optimization through swaps between agents, guided

by feature-based preference predictions. Section 6

concludes the paper.

2 MOTIVATING EXAMPLE

This section shows an illustrative example involving

10 agents and 5 goods, each with a maximum avail-

ability of 4 units. The scenario explains how a simple

greedy initial allocation, while respecting multiplicity

constraints, may fail to achieve optimal satisfaction.

Using predicted preferences provided by a machine

learning model, we are then able to target the agents

that might engage in negotiation to improve the allo-

cation by augmenting overall satisfaction.

We consider 10 children A = {a

,. . . ,a

} and

a pool of 5 goods G = {g

}, each with a

multiplicity of 4. Each child has a valuation function

: G → {15,10,5,0}, representing their preferences

for the goods. The top three preferences for each child

are shown in Table 1.

An initial allocation respecting multiplicities (no

good allocated to more than 4 children) is:

= {g

}, O

= {g

}, O

= {g

}, O

= {g

}, O

= {g

}, O

= {g

}, O

= {g

}, O

= {g

The overall initial allocation satisfaction is:

(O) =

∑

i=1

)

= 15 + 15 + 15 + 15 + 10 + 15 + 15 + 10

+ 15 + 0 = 135.

Using negotiations children can adjust the alloca-

tion to improve overall satisfaction:

• a

, receiving g

(value 10), negotiates with a

swap g

for g

, as g

is a

’s top preference and g

is a

’s second preference.

• a

, currently receiving g

(value 0), negotiates

with a

to swap g

for g

, as g

is a

’s top pref-

erence and g

is in a

’s top three.

After negotiation, the ﬁnal allocation is:

∗

= {g

}, O

∗

= {g

}, O

∗

= {g

∗

= {g

}, O

∗

= {g

},O

∗

= {g

}, O

∗

= {g

∗

= {g

}, O

∗

= {g

}, O

∗

= {g

The overall ﬁnal allocation satisfaction is:

∗

) =

∑

i=1

∗

) = 150.

3 RESOURCE ALLOCATION

A resource allocation problem is deﬁned as a tuple

(A, G, M,V ), where:

• A = {a

,. . . ,a

} is the set of n ≥ 2 agents.

• G = {g

,. . . ,g

} is the set of m ≥ 1 goods.

• M = {m

,. . . ,m

} is the multiplicity m

≥ 1

of each good g

∈ G, representing the maximum

number of agents that can receive g

. The total

availability of goods is

∑

j=1

• V = {v

,. . . ,v

} is the set of ordinal preference

functions, one for each agent a

∈ A. Each v

de-

ﬁnes a strict ranking over the goods G, such that

for any two goods g

∈ G, v

) < v

) im-

plies that g

is strictly preferred to g

by agent a

An allocation O = {O

,. . . ,O

} is a mapping

of goods to agents, where O

⊆ G denotes the subset

of goods allocated to agent a

. The allocation must

satisfy the multiplicity constraints:

∑

∈A

∈O

≤ m

, ∀g

∈ G,

where 1

∈O

is an indicator function that equals 1 if

∈ O

and 0 otherwise.

The (individual) satisfaction of agent a

is a func-

tion S

that associates to each subset G

′

⊆ G of goods

Prediction-Based Selective Negotiation for Reﬁning Multi-Agent Resource Allocation

657

Table 1: Top 3 goods and their valuations for each child.

Child Top 3 Goods (Values v

(g))

) = 15, v

) = 10, v

) = 5

) = 15, v

) = 10, v

) = 5

) = 15, v

) = 10, v

) = 5

) = 15, v

) = 10, v

) = 5

) = 15, v

) = 10, v

) = 5

) = 15, v

) = 10, v

) = 5

) = 15, v

) = 10, v

) = 5

) = 15, v

) = 10, v

) = 5

) = 15, v

) = 10, v

) = 5

) = 15, v

) = 10, v

) = 5

a positive real value S

′

). This value is strongly de-

pendent of the preferences values v

(g), for g ∈ G

′

The satisfaction of an allocation O is the sum of the

satisfactions of the agents for their respective alloca-

tions: S(O) =

∑

i=1

). The objective is to ﬁnd

an allocation O

∗

that maximizes satisfaction over all

possible allocations.

A straightforward allocation algorithm can allo-

cate goods to agents based on their ordinal prefer-

ences in a sequential manner. For each agent a

∈ A

(in a pre-deﬁned order) the algorithm will allocate the

most preferred good g

(1)

of a

that is still available,

if g

(1)

is no longer available, allocate the next most

preferred good g

(2)

, and so on, until either a good is

allocated or all preferred goods are unavailable.

This greedy approach ensures that each agent re-

ceives the best possible good based on availability,

prioritizing agents in the order they are processed. In

order to keep the code simple, we used the command

Proceed to the next agent which tries to be equitable

enough and which can restart from the ﬁrst agent if

there are unallocated goods and also agents a

with

̸= G. Obviously, this algorithm may not result in

optimal overall satisfaction.

4 PREDICTIVE-BASED

RESOURCE ALLOCATION

In this section, we introduce the concepts that under-

pin the two-step framework for reﬁning multi-agent

resource allocation. The ﬁrst step relies on the aggre-

gated preferences of agents, calculated using social

choice methods outlined in Section 4.1 and detailed

in (Brandt et al., 2016). The second step involves

the use of classiﬁers (Jordan and Mitchell, 2015), dis-

cussed in Section 4.2, to identify pairs of agents for

good-swapping, inspired by Condorcet principles.

Input: A = {a

,. . . ,a

} (agents in a

predeﬁned order),

G = {g

,. . . , g

} (set of goods),

M = {m

,. . . , m

} (multiplicities of goods),

V = {v

,. . . , v

} (ordinal preference

functions for each agent).

Output: An allocation O = {O

,. . . , O

}

where O

is the set of goods allocated

to agent a

Initialize O

←

0 for all a

∈ A ;

No goods ←

∑

j=1

;

while No goods > 0 do

foreach a

∈ A do

foreach g

∈ G in order given by v

if m

> 0 and g

/∈ O

then

← O

∪ {g

} ;

← m

− 1 ;

No goods ← No goods − 1 ;

Proceed to the next agent;

end

return O

Algorithm 1: Simple Allocation Algorithm.

4.1 Social Choice

Let us now deﬁne an aggregated preference v

that

combines the individual preferences v

,. . . ,v

all agents A = {a

,. . . ,a

} into a single collective

preference over the goods G = {g

,. . . ,g

}. The

aggregated preference v

can be formally expressed

as:

= Aggregation(v

,. . . ,v

where, Aggregation is the function (e.g., Borda or

Condorcet) used to combine the individual prefer-

ences into a collective ranking.

In the Borda method, a score is assigned to goods

based on their ordinal rankings in each agent’s prefer-

ICAART 2025 - 17th International Conference on Agents and Artiﬁcial Intelligence

658

ence. The aggregated preference v

is deﬁned as:

) =

∑

i=1

(n − v

) + 1),

• v

) is the rank of g

in agent a

’s preference list

(lower ranks indicate higher preference),

• (n − v

) + 1) converts the rank into a score

(higher scores indicating higher preference).

The goods are then ordered in descending order of

) to form the aggregated preference.

In the Condorcet method, the aggregated pref-

erence is determined through pairwise comparisons.

For each pair of goods (g

, a preference matrix M

is constructed:

M(g

) =

∑

i=1

)<v

)

• M(g

) is the number of agents preferring g

• 1

)<v

)

= 1 if g

is ranked higher than g

agent a

, and 0 otherwise.

A good g

is a Condorcet winner if it is preferred to

every other good in pairwise comparisons:

M(g

) > M(g

) ∀g

̸= g

An aggregation function for collective prefer-

ences should adhere to several key principles to en-

sure fairness and rationality: Pareto efﬁciency, Non-

Dictatorship, Independence of Irrelevant Alternatives

(IIA), and Anonymity.

Pareto Efﬁciency requires that if all agents prefer

a good g

over another good g

, the collective prefer-

ence v

ranks g

higher than g

) < v

) ∀i ∈ A,

then the aggregated preference must satisfy:

) < v

Non-Dictatorship ensures that no single agent a

∈

A can unilaterally determine the collective preference

, unless their preferences align with all agents unan-

imous preference:

∃g

∈ G such that v

) < v

) and

) < v

for at least one pair of goods g

and a

Independence of Irrelevant Alternatives (IIA) en-

sures that the collective ranking of two goods g

and

depends on their relative rankings in individual

preferences, unaffected by the presence or absence of

other goods. Formally, if:

) < v

) ∀i ∈ A,

then the aggregated preference must also preserve this

ordering:

) < v

Anonymity requires that the aggregation mechanism

treats all agents equally, i.e. the outcome is invariant

to the permutation of agent indices.

The Condorcet method satisﬁes Pareto efﬁciency, as

any unanimously preferred good will dominate oth-

ers in pairwise comparisons. It also respects Non-

Dictatorship, as no single agent can dictate out-

comes unless their preferences align with the unan-

imous preference of all agents. Condorcet also sat-

isﬁes Anonymity, as all agents are treated symmet-

rically in the aggregation process. However, Con-

dorcet violates IIA because the addition or removal

of other goods can alter pairwise comparison results.

A Condorcet winner might not always exist (Arrow,

2012). The Borda method equally violates IIA and re-

spects Anonymity, as the scoring mechanism treats all

agents equally. However, Borda fails to satisfy Pareto

efﬁciency because the collective ranking can assign

higher aggregate scores to a good g

that is unani-

mously less preferred than g

. Furthermore, Borda

does not adhere to the Condorcet criterion, as it can

select a good that loses in pairwise comparisons to

another (Arrow, 2012).

As an example consider four agents A =

} and three goods G = {g

} with

the following preferences:

: g

≻ g

, a

: g

≻ g

: g

≻ g

, a

: g

≻ g

Using Borda points are assigned as 2 for ﬁrst

place, 1 for second, and 0 for third:

: 3, g

: 4, g

: 5.

The Borda ranking is:

≻ g

For Condorcet each pair of goods is compared

across agents:

vs. g

: 2 votes for each (Tie),

vs. g

: 2 votes for each (Tie),

vs. g

: 2 votes for each (Tie).

Since no good wins all pairwise comparisons, no

Condorcet winner exists.

4.2 Preference Classiﬁers

Classiﬁers in Machine Learning analyse data fea-

tures to learn patterns that distinguish between differ-

ent categories. Features are measurable attributes or

Prediction-Based Selective Negotiation for Reﬁning Multi-Agent Resource Allocation

659

properties of the data that are considered relevant for

the classiﬁcation task. The classiﬁer evaluates these

features to identify patterns or decision boundaries

that separate classes.

A classiﬁer can be formalized as a function f :

X → Y , where:

• X is the input space, representing the set of possi-

ble feature vectors, x = (x

,. . . ,x

) ∈ X , where

d is the number of features.

• Y is the output space, which contains the set

of possible labels or classes, typically Y =

{1,2,. . . ,C}, where C is the number of classes.

• f (x;θ) is the decision function, parameterized by

θ, which maps input features x to a predicted class

ˆy ∈ Y .

The classiﬁer is trained on a dataset D =

{(x

)}

i=1

, where x

∈ X are feature vectors and

∈ Y are the corresponding ground truth labels. The

objective during training is to optimize the parameters

θ by minimizing a loss function L, i.e. the discrepancy

between the predicted labels ˆy

= f (x

;θ) and the true

labels y

θ = argmin

∑

i=1

L( f (x

;θ), y

The classiﬁer can also incorporate a hypothesis

space H , representing the set of all possible decision

functions f that can be chosen given the parameteri-

zation:

f ∈ H , H = { f (x; θ) | θ ∈ Θ},

where, Θ is the parameter space.

Once trained, the classiﬁer predicts the label for a

new input x

∗

by computing:

ˆy = argmax

y∈Y

P(y | x

∗

;

θ),

where, P(y | x

∗

;

θ) represents the model’s estimated

probability of class y for the input x

∗

, given the opti-

mized parameters

θ.

The Predictive-Based Resource Allocation Problem

extends the traditional resource allocation problem by

incorporating classiﬁers to predict preferences based

on agent feature as follows:

(A, G, M,F , C ),

where:

• A = {a

,. . . ,a

} is the set of n agents.

• G = {g

,. . . ,g

} is the set of m goods.

• M = {m

,. . . ,m

} speciﬁes the maximum

number of agents m

that can receive each good

, satisfying

∑

j=1

≥ n.

• F is the feature space, where each agent a

is as-

sociated with a feature vector x

= ( f

,. . . , f

• C is the set of classiﬁers, where each classiﬁer

: X → Y predicts the preference rankings of

agents based on their features.

For each agent a

, the predicted preference rank-

ing over goods is given by:

ˆv

= f

where f

is a classiﬁer applied to x

, and ˆv

deﬁnes

the predicted strict ranking of goods G.

An allocation O = {O

,. . . ,O

} maps subsets of

goods to agents, satisfying the multiplicity:

∑

∈A

∈O

≤ m

, ∀g

∈ G,

where 1

∈O

= 1 if g

∈ O

, and 0 otherwise.

The satisfaction of agent a

is a function S

that as-

sociates to each subset G

′

⊆ G of goods a positive real

value S

′

). Note that here, the individual satisfac-

tion S

of agent a

depends on the predicted preference

ranking ˆv

. The satisfaction of an allocation O is the

sum of the satisfaction of the agents for their respec-

tive allocations: S(O) =

∑

i=1

). The objective is

to ﬁnd an allocation O

∗

that maximizes satisfaction.

5 OPTIMISING THE

ALLOCATION

The algorithm introduced in this section operates in

four phases, each presented as a distinct algorithm

for lisibility. The ﬁrst phase, which corresponds to

the naive allocation based on the lexicographical or-

der of agents, is identical to the algorithm described

in Section 3 and is repeated here for readability. In

the second phase, the Borda aggregation is com-

puted, which is then used in the third phase to per-

form a Borda-based allocation of goods. Finally, in

the fourth phase, the allocation is reﬁned through

classiﬁer-driven swaps to further enhance satisfaction.

The last two phases form the two-step framework for

prediction-based reﬁnement of the allocation.

In the second phase, the Borda method is applied

to aggregate individual preferences into a collective

ranking of goods. Each good receives a score based

on its rank in the agents’ preference lists. The goods

are then ordered by their scores to produce a collective

preference ranking that reﬂects the priorities of the

group.

ICAART 2025 - 17th International Conference on Agents and Artiﬁcial Intelligence

660

Input: A = {a

,. . . , a

}: Set of agents;

G = {g

,. . . , g

}: Set of goods;

M = {m

,. . . , m

}: Multiplicities of goods;

,. . . , v

}: Agents’ preferences over

goods;

A lexicographical order on A;

Output: Initial allocation O = {O

,. . . , O

}

Construct allocation O using Algorithm in

Section 3 or using any else Heuristic;

Compute overall satisfaction S(O) of the initial

allocation;

return O;

Algorithm 2: Phase 1: Initial Allocation.

Input: A = {a

,. . . , a

}: Set of agents;

G = {g

,. . . , g

}: Set of goods;

,. . . , v

}: Agents’ preferences over

goods;

Output: Aggregated preference v

foreach g

∈ G do

Compute Borda score:

B(g

) =

∑

i=1

score(g

in v

)

end

Sort G in descending order of Borda scores to

form aggregated preference v

;

return v

;

Algorithm 3: Phase 2: Aggregated Preferences (Borda

Method).

In the third phase, the algorithm reﬁnes the Borda

allocation to improve overall satisfaction according to

the collective preferences. Starting with the most pre-

ferred good in the collective ranking, the algorithm

allocates goods to agents in a way that maximizes the

number of agents receiving goods they rank highly.

This process continues for successive goods until ei-

ther the goods are exhausted or further allocations no

longer align with the collective ranking.

In the ﬁnal phase, predictive models are used to

adjust the allocation further. For each agent, a classi-

ﬁer predicts preferences based on their feature proﬁle.

Agents whose predicted preferences differ from their

true preferences are identiﬁed, and the algorithm pro-

poses swaps with other agents in order to increase the

overall satisfaction of the allocation. The swaps are

done iteratively to improve the alignment of the allo-

cation. More precisely, a swap between agents a

and

means that goods ginO

and g

′

∈ O

are identiﬁed

such that

((O

− {g}) ∪ {g

′

}) + S

((O

− {g

′

}) ∪ {g}) >

) + S

Input: A = {a

,. . . ,a

}: Set of agents;

G = {g

,. . . ,g

}: Set of goods;

Aggregated preference v

;

Current allocation O;

′

,. . . ,m

′

}: Current multiplicity of

available (m

′

> 0) goods;

Output: Updated allocation O

foreach g

∈ G (in v

-descending order) do

foreach a

∈ A do

if m

′

> 0 and g

/∈ O

then

Allocate g

to a

: O

← O

∪ {g

};

Update m

′

← m

′

− 1;

end

return O;

Algorithm 4: Phase 3: Satisfaction Aggregated Preference

Boosting.

Input: A = {a

,. . . ,a

}: Set of agents;

G = {g

,. . . ,g

}: Set of goods;

Feature space F ;

Classiﬁers C ; Current allocation O;

Output: Optimized allocation O

∗

foreach a

∈ A do

Predict preferences using classiﬁer:

ˆv

= f

)

if ˆv

̸= v

then

Identify agents a

such that a swap with

is possible;

Propose swaps between a

and a

improve satisfaction;

Update O

and O

if swaps are accepted;

end

return O

∗

= O;

Algorithm 5: Phase 4: Optimization.

The algorithm outputs an optimized allocation that

improves on the naive approach by incorporating both

collective preferences and predictions. In each of the

Phases 1, 3 and 4, an implicit assumption is made:

each (individual) satisfaction function S

of the agent

is monotone:

if G

⊆ G

⊆ G then S

) ≤ S

Then, it is not difﬁcult to see that the following theo-

rem holds.

Theorem 1. Let O be the naive initial allocation and

∗

the ﬁnal allocation produced by the Predictive-

Based Resource Allocation Algorithm. If all satisfac-

tion function S

of the agents are monotone, then

S(O

∗

) ≥ S(O),

where S(·) is the overall satisfaction of an allocation.

Prediction-Based Selective Negotiation for Reﬁning Multi-Agent Resource Allocation

661

6 FUTURE WORK

In this paper, we proposed a predictive-based resource

allocation algorithm that combines machine learning

predictions with preference aggregation techniques to

optimize resource allocation in multi-agent systems.

• Experimental validation is necessary to assess the

framework performance in diverse practical set-

tings (cloud resource allocation, logistics, pub-

lic goods distribution etc.). This includes a for-

mal computational complexity analysis evaluating

scalability of the proposed algorithm as well as

execution time, memory usage, and performance

to assess the scalability of your approach while

varying distributions of preferences and goods.

• Apart from the theoretical study of the properties

such as envy-freeness or equitability another no-

tion that should be investigated is that of “stable”

allocation (the agents do not have any incentive to

further swap).

• When several agents are eligible for a swap based

on their proﬁles and predicted preferences, crite-

ria for choosing the most appropriate participants

need to be deﬁned. This decision introduces po-

tential concerns regarding ethics and bias, particu-

larly when prioritizing agents could inadvertently

favor certain groups over others (Hurwicz, 1973).

• Furthermore, the notion of overall satisfaction

could be reﬁned to incorporate subpopulation-

speciﬁc goals. For instance, rather than optimiz-

ing global satisfaction, the algorithm could priori-

tize improving the satisfaction of speciﬁc subpop-

ulations (e.g., AI-agents vs humans in hybrid soci-

eties) based on implicit or explicit norms (Aldew-

ereld et al., 2016). Similarly to as above, this issue

is directly related to the fairness and ethical con-

cerns of our approach.

REFERENCES

Aldewereld, H., Dignum, V., and Vasconcelos, W. W.

(2016). Group norms for multi-agent organisations.

ACM Transactions on Autonomous and Adaptive Sys-

tems (TAAS), 11(2):1–31.

Arrow, K. J. (2012). Social choice and individual values,

volume 12. Yale university press.

Brandt, F., Conitzer, V., Endriss, U., Lang, J., and Procac-

cia, A. D. (2016). Handbook of computational social

choice. Cambridge University Press.

Chevaleyre, Y., Dunne, P. E., Endriss, U., Lang, J.,

Lemaitre, M., Maudet, N., Padget, J., Phelps, S.,

Rodr

ıgues-Aguilar, J. A., and Sousa, P. (2005). Issues

in multiagent resource allocation.

Croitoru, M. and Croitoru, C. (2011). Generalised net-

work ﬂows for combinatorial auctions. In 2011

IEEE/WIC/ACM International Conferences on Web

Intelligence and Intelligent Agent Technology, vol-

ume 2, pages 313–316. IEEE.

Dafoe, A., Hughes, E., Bachrach, Y., Collins, T., Mc-

Kee, K. R., Leibo, J. Z., Larson, K., and Graepel,

T. (2020). Open problems in cooperative ai. arXiv

preprint arXiv:2012.08630.

Hurwicz, L. (1973). The design of mechanisms for resource

allocation. The American Economic Review, 63(2):1–

30.

Ibaraki, T. and Katoh, N. (1988). Resource allocation prob-

lems: algorithmic approaches. MIT press.

Jordan, M. I. and Mitchell, T. M. (2015). Machine learn-

ing: Trends, perspectives, and prospects. Science,

349(6245):255–260.

Katoh, N. and Ibaraki, T. (1998). Resource allocation

problems. Handbook of Combinatorial Optimization:

Volume1–3, pages 905–1006.

Li, J., Ni, X., Yuan, Y., and Wang, F.-Y. (2018). A hier-

archical framework for ad inventory allocation in pro-

grammatic advertising markets. Electronic Commerce

Research and Applications, 31:40–51.

Vinothina, V. V., Sridaran, R., and Ganapathi, P. (2012). A

survey on resource allocation strategies in cloud com-

puting. International Journal of Advanced Computer

Science and Applications, 3(6).

Zhang, J., Zhang, M., Ren, F., and Liu, J. (2016). An in-

novation approach for optimal resource allocation in

emergency management. IEEE Transactions on Com-

puters.

ICAART 2025 - 17th International Conference on Agents and Artiﬁcial Intelligence

662