← 返回大厅
arXiv (CS.AI) 2026-06-11 12:00 DOI: arXiv:2606.11672

Can Open-Source LLM Agents Replace Static Application Security Testing Tools? An Empirical Assessment

摘要 / Abstract

arXiv:2606.11672v1 Announce Type: cross Abstract: This paper explores the value of agentic AI tools for cybersecurity purposes. We evaluate the efficacy of a general-purpose GenAI Large Language Model- (GenAI-) based agent when powered by three different Ollama-hosted general-purpose open source models. We assess each agent's performance using precision, recall, false positive count, and a calculated composite score based upon the interplay of the captured metrics, against the baseline performance of an existing, vetted Static Application Security Testing (SAST) tool, Bandit. Our findings refute the notion that a modern open-source GenAI LLM-based agent is currently suitable for the specialized task of SAST scanning under realistic conditions.

同行评议区

登录学者账户后即可在此处发表评述或点赞。

立即登录

暂无评议记录。