Journal of medical systemsHumansEducational MeasurementChinaEducationMedical

Towards A Fair Duel: Reflections on the Evaluation of DeepSeek-R1 and ChatGPT-4o in Chinese Medical Education.

Shangxuan Li

Published: 202510.1007/s10916-025-02316-7

Abstract

The recent study by Wu et al. (2025) comparing DeepSeek-R1 and ChatGPT-4o on the Chinese National Medical Licensing Examination (CNMLE) provides an important contribution to understanding large language model (LLM) performance in non-English medical…

Preview only. Read the full abstract at the source

View at DOI