For every example, the LLM could reason and in-
fer the task to be performed, defining a plan of action
to achieve it. This showed the potential of LLMs as
reasoning and planning mechanisms in ad hoc envi-
ronments.
For future works, we intend to adjust the prompt to
prevent the LLM from ’cheating,’ which is adjusting
the functionalities to address unanticipated tasks that
are not entirely equivalent. We also want to include
more complex scenarios and more examples, espe-
cially the ones in real-world settings, receiving data
from different sensors (cameras and microphones)
and connecting the tools to the robot operating sys-
tem so that it can actually perform the actions.
ACKNOWLEDGEMENTS
This work was partially funded by the National Coun-
cil for Scientific and Technological Development
(CNPQ), under grant number 141809/2020-5. In ad-
dition, this material is based upon work supported
by the Air Force Office of Scientific Research under
award number FA9550-22-1-0475.
REFERENCES
Barrett, S., Rosenfeld, A., Kraus, S., and Stone, P. (2017).
Making friends on the fly: Cooperating with new
teammates. Artificial Intelligence, 242:132–171.
Briggs, J. and Ingham, F. (2023). Langchain: Introduction
and getting started. Pinecone.
Brohan, A., Brown, N., Carbajal, J., Chebotar, Y., Chen,
X., Choromanski, K., Ding, T., Driess, D., Dubey,
A., Finn, C., et al. (2023). Rt-2: Vision-language-
action models transfer web knowledge to robotic con-
trol. arXiv preprint arXiv:2307.15818.
Devlin, J., Chang, M.-W., Lee, K., and Toutanova, K.
(2019). Bert: Pre-training of deep bidirectional trans-
formers for language understanding.
Huang, S., Jiang, Z., Dong, H., Qiao, Y., Gao, P., and Li, H.
(2023). Instruct2act: Mapping multi-modality instruc-
tions to robotic actions with large language model.
arXiv preprint arXiv:2305.11176.
Melo, F. S. and Sardinha, A. (2016). Ad hoc teamwork
by learning teammates’ task. Autonomous Agents and
Multi-Agent Systems, 30:175–219.
Min, S. Y., Chaplot, D. S., Ravikumar, P., Bisk, Y., and
Salakhutdinov, R. (2021). Film: Following instruc-
tions in language with modular methods. arXiv
preprint arXiv:2110.07342.
Mirsky, R., Carlucho, I., Rahman, A., Fosong, E., Macke,
W., Sridharan, M., Stone, P., and Albrecht, S. V.
(2022). A survey of ad hoc teamwork research. In
European Conference on Multi-Agent Systems (EU-
MAS).
OpenAI (2023). Gpt-4 technical report.
Ravula, M. C. R. (2019). Ad-hoc teamwork with behavior-
switching agents. PhD thesis, University of Texas.
Ribeiro, J. G., Faria, M., Sardinha, A., and Melo, F. S.
(2021). Helping people on the fly: Ad hoc team-
work for human-robot teams. In Progress in Artificial
Intelligence: 20th EPIA Conference on Artificial In-
telligence, EPIA 2021, Virtual Event, September 7–9,
2021, Proceedings 20, pages 635–647. Springer.
Ribeiro, J. G., Rodrigues, G., Sardinha, A., and Melo,
F. S. (2023). Teamster: Model-based reinforcement
learning for ad hoc teamwork. Artificial Intelligence,
324:104013.
Vemprala, S., Bonatti, R., Bucker, A., and Kapoor, A.
(2023). Chatgpt for robotics: Design principles and
model abilities. Microsoft Auton. Syst. Robot. Res,
2:20.
Xiong, H., Bian, J., Yang, S., Zhang, X., Kong, L., and
Zhang, D. (2023). Natural language based context
modeling and reasoning with llms: A tutorial. arXiv
preprint arXiv:2309.15074.
Xu, M., Huang, P., Yu, W., Liu, S., Zhang, X., Niu, Y.,
Zhang, T., Xia, F., Tan, J., and Zhao, D. (2023). Cre-
ative robot tool use with large language models.
Zhao, P., Jin, Z., and Cheng, N. (2023). An in-depth survey
of large language model-based artificial intelligence
agents. arXiv preprint arXiv:2309.14365.
Zheng, Z., Wang, M., and Zixia Jia, B. T. (2023). Langsuit
e: Controlling, planning, and interacting with large
language models in embodied text environments.
Experimenting with Planning and Reasoning in Ad Hoc Teamwork Environments with Large Language Models
445