AgentClinic puts medical AI through a more realistic diagnostic test
AgentClinic is a multimodal benchmark that tests clinical AI agents in simulated, dialogue-driven diagnostic settings rather than static medical question-answer formats. The study found that model performance varied sharply by…