Nature Medicine
2026-06-17 00:00
DOI:
HASH:c74fef73248ceb35532e1704f305e5bf
General-purpose chatbots outperform clinical AI tools on physicians’ real-world questions
作者:
未知作者
摘要 / Abstract
Specialized clinical AI tools are entering medical practice with little independent testing. In a head-to-head evaluation across two public benchmarks and real questions from physicians, three general-purpose frontier large language models outperformed two leading clinical AI tools, which performed no better than Google search AI overview.