2026ConferenceACL 2026
Reliability Evaluation of Tool-Augmented Multilingual Agents
Taro Yamada, Alice Smith, Kenji Sato
LLMAgentsMultilingual
We analyze failure modes of tool-augmented LLM agents in multilingual settings and introduce a reliability benchmark.