How well does ChatGPT understand spine clinical guidelines?

Researchers asked ChatGPT 3.5 and ChatGPT 4.0 15 questions from the 2012 NASS Clinical Guidelines for diagnosing and treating lumbar disc herniation with radiculopathy. Two independent authors assessed the accuracy, over-conclusiveness, supplementary and incompleteness of their outputs.

Of the 15 responses from ChatGPT 3.5, 47% were accurate, 47% were over-conclusive and 40% were incomplete. All were supplementary. The study found a “statistically significant difference in supplementary information” between ChatGPT 3.5 and ChatGPT 4.0.

The study concluded that “ChatGPT-4.0 provided less supplementary information and overall higher accuracy in question categories than ChatGPT-3.5. ChatGPT showed reasonable concordance to NASS guidelines, but clinicians should caution use of ChatGPT in its current state as it fails to safeguard against misinformation.”

At the Becker’s 32nd Annual Meeting: The Business and Operations of ASCs, taking place October 29-31 in Chicago, ASC leaders, surgeons and healthcare executives will explore strategies to drive growth, enhance operational performance, navigate reimbursement challenges and prepare for the future of ambulatory surgery. Apply for complimentary registration now.

Next Up in Spine

How well does ChatGPT understand spine clinical guidelines?

160 ambulatory leaders just ranked the EHR as the single system most overdue for AI reinvention

Next Up in Spine

15 rising stars in neurosurgery

The next spine practices likely to sell

Ohio State gets $1.2M grant to study spinal cord immunity

160 ambulatory leaders just ranked the EHR as the single system most overdue for AI reinvention

Next Up in Spine

15 rising stars in neurosurgery

The next spine practices likely to sell

Ohio State gets $1.2M grant to study spinal cord immunity

Join the spine leaders & surgeons who start their day with Becker's