The performance of generative A.I. models for clinical reasoning are not holding up to increased scrutiny https://
arxiv.org/abs/2509.18234 @MSFTResearch https://
ai.nejm.org/doi/full/10.10
56/AIdbp2500120
… @NEJM_AI @AdamRodmanMD @LiamGMcCoy
Generative AI Clinical Reasoning Models Fail Under Scrutiny
By
–
