AI models fall short in clinical conversations: Harvard study

Large language models like ChatGPT have performed well on medical exams, but they struggle with diagnostic accuracy in real-world clinical interactions.  Becker's Hospital Review