← 返回大厅
arXiv (CS.CL) 2026-06-15 12:00 DOI: arXiv:2606.14325

Achieving Precise Text-To-Cypher Via Grounded Knowledge Graph Data Generation

摘要 / Abstract

Property Graphs are rapidly being adopted as database frameworks for representing heterogeneous data sources. To enable precise access to the information contained in them we need conversational interfaces based on Text-To-Cypher (Text2Cypher) parsers. This paper presents an automatic synthetic data generation method that can be leveraged to fine-tune small LLMs for this task. We conduct experiments on all the major Text-To-Cypher benchmarks, demonstrating that with our synthetic data generation approach we can significantly increase the performance of small LLMs, allowing them to compete with much larger proprietary models. This means that in settings in which models must be locally deployed we can ensure data-sovereignty without sacrificing accuracy and without costly annotation campaigns.

同行评议区

登录学者账户后即可在此处发表评述或点赞。

立即登录

暂无评议记录。