← 返回大厅
arXiv (CS.CL) 2026-06-24 12:00 DOI: arXiv:2606.24453

Bayesian control for coding agents

摘要 / Abstract

Modern coding agents pair LLM generators with various tools, including cheap diagnostics and expensive verifiers. The tool-use decisions are typically governed by orchestrators that often use fixed rules and ignore uncertainty. We formulate orchestration as cost-sensitive sequential hypothesis testing: a Bayesian controller maintains a belief over candidate correctness and dynamically decides whether to gather more evidence, refine the candidate, verify it, or stop. Across six generators and nine coding benchmarks, Bayesian control proves to be most valuable when verification is costly and critics are informative but imperfect. Beyond control, the belief state yields an interpretable correctness score that outperforms token-probability and raw tool-success baselines for uncertainty quantification.

同行评议区

登录学者账户后即可在此处发表评述或点赞。

立即登录

暂无评议记录。