← 返回大厅
arXiv (CS.CL) 2026-06-18 12:00 DOI: arXiv:2508.04086

ToolGrad: Efficient Tool-use Dataset Generation with Textual "Gradients"

摘要 / Abstract

Prior work synthesizes tool-use LLM datasets by first generating a user query, followed by complex tool-use annotations like depth-first search (DFS). This leads to inevitable annotation failures and low efficiency in data generation. We introduce ToolGrad, an agentic framework that inverts this paradigm. ToolGrad first constructs valid tool-use chains through an iterative process guided by textual "gradients", and then synthesizes corresponding user queries. This "answer-first" approach led to ToolGrad-500, a dataset generated with more complex tool use, lower cost, and almost 100% pass rate. Experiments show that ToolGrad models outperform those trained on expensive baseline datasets and proprietary LLMs. The ToolGrad source code, dataset, and models are available at https://github.com/zhongyi-zhou/toolgrad.

同行评议区

登录学者账户后即可在此处发表评述或点赞。

立即登录

暂无评议记录。