cases where redun dant additiona l information wors-
ens learning accuracy and identified priorities for ad-
ditional information in fu lly an d partially observable
In this paper, experiments were conduc ted with
additional informa tion limited to agent observations
and actions. Future research should consider ap-
plying this approach to other types of additional in-
formation. Furthermore, the inform ation selection
method in this study was defined at runtime and re-
mained fixed throughout the learning process. Given
the complexity of multi-agent reinf orcement learn-
ing (MARL), it is likely that the critical information
may vary depending on the learning stage. Therefore,
dynamic selection based on the progress of learning
would be a valuable direction for future work.
