← 返回大厅
arXiv (CS.CL) 2026-06-25 12:00 DOI: arXiv:2606.25436

Evaluating Japanese Dialect Robustness Across Speech and Text-based Large Language Models

摘要 / Abstract

Dialogue systems based on large language models (LLMs) have advanced significantly in recent years. However, dialectal variation remains a major challenge, particularly for systems that process spoken input. LLM-based speech language models (SLMs), which integrate LLMs with speech processing components, show promise for spoken language tasks, yet their ability to comprehend dialects has not been sufficiently studied. Moreover, it remains unclear how the dialectal understanding of the base LLM affects SLM performance. This study investigates the dialectal robustness of both LLMs and SLMs using Japanese dialects as a test case. We define robustness as the ratio of performance on dialectal versus standard inputs, enabling fair comparisons. Our experiments show that SLM robustness correlates with that of their text-based counterparts. Furthermore, training with dialectal data and fine-tuning the speech encoder each improves robustness in SLMs.

同行评议区

登录学者账户后即可在此处发表评述或点赞。

立即登录

暂无评议记录。