Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

Performance of 5 AI Models on United States Medical Licensing Examination Step 1 Questions: Comparative Observational Study

2026·0 Zitationen·JMIR AIOpen Access

Volltext beim Verlag öffnen

Zitationen

Autoren

2026

Jahr

Abstract

AI models showed varying strengths across domains, with Grok demonstrating the highest accuracy and consistency in this dataset, particularly for image-based and reasoning-heavy questions. Although ChatGPT-4 remains widely used, newer models like Grok and Copilot also performed competitively. Continuous evaluation is essential as AI tools rapidly evolve.

Autoren

Institutionen

Themen

Artificial Intelligence in Healthcare and EducationExplainable Artificial Intelligence (XAI)Intelligent Tutoring Systems and Adaptive Learning

Volltext beim Verlag öffnen

Performance of 5 AI Models on United States Medical Licensing Examination Step 1 Questions: Comparative Observational Study

Abstract

Ähnliche Arbeiten

Autoren

Institutionen

Themen