Journal of medical systemsHumansEducational MeasurementChinaEducationMedical
Towards A Fair Duel: Reflections on the Evaluation of DeepSeek-R1 and ChatGPT-4o in Chinese Medical Education.
Shangxuan Li
Published: 202510.1007/s10916-025-02316-7
Abstract
The recent study by Wu et al. (2025) comparing DeepSeek-R1 and ChatGPT-4o on the Chinese National Medical Licensing Examination (CNMLE) provides an important contribution to understanding large language model (LLM) performance in non-English medical…
Preview only. Read the full abstract at the source