Purpose: The aim of this work is to compare different machine learning models for predicting pathological complete response (pCR) to neoadjuvant chemotherapy (NAC) in breast cancer using radiomics features from dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI). Method: The study included 55 patients with breast cancer, among whom 18 achieved pCR and 37 did not respond completely to NAC (non-pCR). After some pre-processing steps, 1446 features were extracted and corrected for batch effects using ComBat. Five machine learning algorithms, namely random forest (RF), decision tree (DT), logistic regression (LR), k-nearest neighbors (k-NN), and extreme gradient boosting (XGB), were evaluated using area under the receiver operating characteristic curve (AUC), accuracy, precision, recall, and F1 score as classification metrics. A Leave-Group-Out cross validation (LGOCV) was applied in the outer loop. Results: RF and DT models exhibited the highest performances compared to the other algorithms. DT achieved an accuracy of 0.96 ± 0.07, and RF achieved 0.95 ± 0.05. The AUC values for RF and DT were 0.98 ± 0.06 and 0.94 ± 0.07, respectively. LR and k-NN demonstrated lower performance across all metrics, while XGB showed competitive results but slightly lower than RF and DT. Conclusions: This study demonstrates the potential of radiomics and machine learning for predicting pCR to NAC in breast cancer. RF and DT models proved to be the most effective in capturing underlying patterns in radiomics data. Further research is required to validate and strengthen the proposed approach and explore its applicability in diverse radiomics datasets and clinical scenarios.

Comparative analysis of machine learning models for predicting pathological complete response to neoadjuvant chemotherapy in breast cancer: An MRI radiomics approach

D'Anna A.;Gueli A. M.;Pulvirenti A.;Stella G.
2025-01-01

Abstract

Purpose: The aim of this work is to compare different machine learning models for predicting pathological complete response (pCR) to neoadjuvant chemotherapy (NAC) in breast cancer using radiomics features from dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI). Method: The study included 55 patients with breast cancer, among whom 18 achieved pCR and 37 did not respond completely to NAC (non-pCR). After some pre-processing steps, 1446 features were extracted and corrected for batch effects using ComBat. Five machine learning algorithms, namely random forest (RF), decision tree (DT), logistic regression (LR), k-nearest neighbors (k-NN), and extreme gradient boosting (XGB), were evaluated using area under the receiver operating characteristic curve (AUC), accuracy, precision, recall, and F1 score as classification metrics. A Leave-Group-Out cross validation (LGOCV) was applied in the outer loop. Results: RF and DT models exhibited the highest performances compared to the other algorithms. DT achieved an accuracy of 0.96 ± 0.07, and RF achieved 0.95 ± 0.05. The AUC values for RF and DT were 0.98 ± 0.06 and 0.94 ± 0.07, respectively. LR and k-NN demonstrated lower performance across all metrics, while XGB showed competitive results but slightly lower than RF and DT. Conclusions: This study demonstrates the potential of radiomics and machine learning for predicting pCR to NAC in breast cancer. RF and DT models proved to be the most effective in capturing underlying patterns in radiomics data. Further research is required to validate and strengthen the proposed approach and explore its applicability in diverse radiomics datasets and clinical scenarios.
2025
Breast cancer
Dynamic contrast-enhanced magnetic resonance imaging
Neoadjuvant chemotherapy
Pathological complete response
Predictive models
Radiomics
Treatment optimization
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S1120179725000419-main.pdf

accesso aperto

Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 2.13 MB
Formato Adobe PDF
2.13 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11769/662249
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact