Skip to content

Publications

Publication list sorted by year with filters for type and tags.

2026ConferenceACL 2026

Reliability Evaluation of Tool-Augmented Multilingual Agents

Taro Yamada, Alice Smith, Kenji Sato

LLMAgentsMultilingual

We analyze failure modes of tool-augmented LLM agents in multilingual settings and introduce a reliability benchmark.

PDFDOIarXivCode
2025JournalTACL

Hallucination Control with Evidence-Grounded Evaluation

Mika Tanaka, Taro Yamada, Robert Lee

HallucinationEvaluationRAG

We reduce factual hallucinations by explicitly training evidence-grounded reasoning.

PDFDOIarXiv
2024PreprintarXiv

Safe Response Optimization for Dialogue Models

Taro Yamada, Emi Kobayashi

SafetyDialogue

We propose a safe response optimization method combining human feedback and constrained decoding.

PDFarXivCode
2023ConferenceEMNLP 2023

Domain Adaptation of Japanese LLMs for Low-Resource Settings

Haruto Watanabe, Taro Yamada, Yuki Ito

JapaneseDomain AdaptationLow-resource

We evaluate continual pretraining and instruction tuning on low-resource domain corpora.