Fault Tolerance through Interaction and Mutual Cooperation in Hierarchical Multi-Agent Systems

Rade Stanković, Maja Štula


Multi-Agent Systems (MASs) are well suited for development of complex, distributed systems. In its essence MAS is a distributed system that consists of multiple agents working together to solve common problems. Failure handling is an important property of large scale MAS because the failure rate grows with both the number of the hosts, deployed agents and the duration of agent’s task execution. Numerous approaches have been introduced to deal with some aspects of the failure handling. However, absence of centralized control and large number of individual intelligent components makes it difficult to detect and to treat errors. Risk of uncontrollable fault propagation is high and can seriously impact the performance of the system. Although existing research has been extensive, it still needs to attend the MAS failure handling problem in all its aspects, which makes this topic very interesting. We propose a concept of agent interaction that enables any hierarchical MAS to become fault tolerant, regardless of the used agent framework.


