Large language models excel in tests yet struggle to guide real patient decisions
A randomized study of 1,298 UK adults found that while large language models perform well on medical tasks alone, they do not improve and can worsen decision-making when used by…