How well does ChatGPT understand spine clinical guidelines?

Different versions of ChatGPT addressed clinical questions about lumbar disc herniation with radiculopathy differently, according to a study published in the September issue of the North American Spine Society Journal.

Advertisement

Researchers asked ChatGPT 3.5 and ChatGPT 4.0 15 questions from the 2012 NASS Clinical Guidelines for diagnosing and treating lumbar disc herniation with radiculopathy. Two independent authors assessed the accuracy, over-conclusiveness, supplementary and incompleteness of their outputs.

Of the 15 responses from ChatGPT 3.5, 47% were accurate, 47% were over-conclusive and 40% were incomplete. All were supplementary. The study found a “statistically significant difference in supplementary information” between ChatGPT 3.5 and ChatGPT 4.0.

The study concluded that “ChatGPT-4.0 provided less supplementary information and overall higher accuracy in question categories than ChatGPT-3.5. ChatGPT showed reasonable concordance to NASS guidelines, but clinicians should caution use of ChatGPT in its current state as it fails to safeguard against misinformation.”

At the Becker's 23rd Annual Spine, Orthopedic and Pain Management-Driven ASC + The Future of Spine Conference, taking place June 11-13 in Chicago, spine surgeons, orthopedic leaders and ASC executives will come together to explore minimally invasive techniques, ASC growth strategies and innovations shaping the future of outpatient spine care. Apply for complimentary registration now.

Advertisement

Next Up in Spine

  • Becker’s reported on multiple spine and neurosurgeons moved to new practices or added to their titles in May. Note: This…

  • Spine surgeon Babajide Ogunseinde, MD, published his PML Trilogy,, according to a May 28 news release. The three-book series covers…

  • Becker’s reported on six debut spine milestones at the regional and global scale in May from new devices and techniques.…

Advertisement

Comments are closed.