MRI-Derived Radiomics For Classifying Breast Cancer Molecular Subtypes: a Modeling Approach.
Tran Thi Hue, Nguyen Thu Huong, Tran Quoc Long, Nguyen Duy Hung
Abstract
Open AccessBackground: Breast cancer is a biologically heterogeneous disease with four major molecular subtypes that determine prognosis and treatment strategies. MRI-based radiomics provides a non-invasive method to predict these subtypes by quantifying tumor heterogeneity. Objective: To develop and validate a logistic-regression model using MRI-derived radiomic features to predict four molecular subtypes of invasive breast cancer: luminal A (LA), luminal B (LB), HER2-enriched (HER2), and triple-negative breast cancer (TNBC). Methods: A retrospective cohort of 169 patients with histologically proven invasive breast carcinoma who underwent pre-treatment dynamic contrast-enhanced MRI (DCE-MRI, 3.0 T) was analyzed. Tumors were manually segmented; radiomic texture features were extracted with LIFEx and standardized by z-score normalization. Feature selection was performed using L1-regularized logistic regression (LASSO). Four one-vs-rest logistic-regression models were trained with 5-fold cross-validation. Performance metrics included AUC, sensitivity, specificity, accuracy, and precision. Results: The models achieved AUCs of 0.840 (TNBC), 0.788 (HER2), 0.661 (LA), and 0.635 (LB). TNBC showed the highest accuracy (0.923), whereas LB had the lowest sensitivity (0.393). Confusion matrices revealed good classification for TNBC and HER2 but frequent misclassification between LA and LB. TNBC-related features were largely intensity- and entropy-based. Conclusion: MRI-derived radiomic signatures can non-invasively differentiate breast-cancer molecular phenotypes, with particularly strong performance for TNBC and HER2. Although LA-LB separation remains limited, the LASSO-logistic-regression framework offers moderate-to-high diagnostic accuracy and potential value as a complementary precision-oncology decision-support tool.