HomeSearchPaper Details

Towards conversational diagnostic artificial intelligence

Tao Tu, Mike Schaekermann, Anil Palepu, Khaled Saab, Jan Freyberg, Ryutaro Tanno, Amy Wang, Brenna Li, Mohamed Amin, Yong Cheng, Elahe Vedadi, Nenad Tomašev, Shekoofeh Azizi, K. K. Singhal, Le Hou...

Nature Portfolio (2025) • Volume 642, Issue 8067, Pages 442-450

Method-ToolRCTPDF AvailableGrade Eligible⚠️ Moderate Risk Flags

Overall Assessment

Adequate Methodological Quality

Assessment created by PaperScorers Medical AI v0.1.0 on Dec 22, 2025

C-
59/100

Key Takeaways

  • AMIE outperformed PCPs on DDx top-k accuracy across 159 OSCE scenarios (all k, FDR-corrected P<0.05).
  • Patient-actors and specialists rated AMIE higher on most communication and management axes.
  • Design was randomised, double-blind crossover; stats used bootstrapping/Wilcoxon with FDR.
  • Transparency is limited: no prereg, code closed, evaluation data partly restricted.

Conclusion

A strong, carefully analysed OSCE experiment for a medical LLM; promising performance but limited generalisability and openness.

Quick Actions

Read Full Paper

Quality Dimensions

Integrity & Transparency

Premise

Literature Positioning

Study Provenance

Methodological Assessment

Abstract

Study Overview

Publication Details

External Resources

Disclaimer: This assessment is generated by AI and should not be the sole basis for clinical or research decisions. Always review the original paper and consult with domain experts.


Suggested Papers