← 返回大厅
arXiv (CS.CL) 2026-06-19 12:00 DOI: arXiv:2605.17443

Analyzing Error Propagation in Korean Spoken QA with ASR-LLM Cascades

摘要 / Abstract

We analyze how automatic speech recognition (ASR) errors propagate through ASR-LLM cascades in Korean spoken question answering (SQA), focusing on downstream semantic failures that conventional ASR metrics cannot fully capture. Our analysis shows that the relative downstream degradation caused by ASR errors is consistent across LLMs with different absolute performance, suggesting that cascade degradation largely tracks ASR-stage information loss. We further identify single-character Korean ASR errors as a Korean-specific loss channel, where even a minimal transcription difference can change the intended question and degrade downstream QA performance. Finally, an auxiliary comparison shows that a large audio language model outperforms an ASR-LLM cascade with an approximately matched language backbone in noisy Korean SQA, indicating the potential of direct audio input to mitigate transcript-induced information loss.

同行评议区

登录学者账户后即可在此处发表评述或点赞。

立即登录

暂无评议记录。