Introduction

Cervical cancer is the fourth most common malignancy among women worldwide [1]. The two primary histological subtypes are squamous cell carcinoma (SCC) and adenocarcinoma (AC), accounting for approximately 70-75% and 10-25% of cases, respectively. Notably, the incidence of AC has been steadily rising over recent decades [2]. Understanding the distinctions between AC and SCC from multiple perspectives is essential, as these differences can inform more personalized treatment strategies. Wu et al. demonstrated that treatment modality influences survival outcomes in cervical cancer depending on histopathological subtype, with surgery offering a survival advantage in AC [3].

Pathological examination remains the gold standard for diagnosing cervical cancer [4]. Considering the limited availability of pathologists, delays in pathology reporting, substantial workload demands, observer variability, and the additional time and cost associated with immunohistochemical analyses, there is a growing need for tools capable of providing faster and more standardized diagnostic outcomes. Distinguishing between AC and SCC using the current standard imaging methods – such as magnetic resonance imaging (MRI), positron emission tomography/computed tomography (PET/CT), and ultrasound (USG) – remains a challenge for radiologists, often resulting in low interobserver consistency [5,6].

MRI is the preferred imaging modality for the pretreatment staging of cervical cancer and plays a central role in contemporary management algorithms. Its superior soft-tissue contrast enables accurate evaluation of tumor size, depth of stromal invasion, parametrial and vaginal extension, as well as adjacent organ involvement – features that constitute key components of the International Federation of Gynecology and Obstetrics staging system and serve as critical determinants in treatment planning [7].

Radiomics, an emerging and rapidly evolving field of research, addresses this challenge by extracting quantitative information from medical images and transforming it into high-dimensional datasets for advanced analysis. This approach allows a more comprehensive characterization of disease features and provides substantial support for clinical decision-making [8,9]. Medical imaging is widely used to facilitate the diagnosis and staging of numerous conditions; in cervical cancer, pretreatment MRI is typically performed to evaluate the tumor, monitor its progression, and assess treatment response [10,11]. However, radiological interpretation remains inherently subjective and cannot provide the definitive, objective accuracy offered by histopathological evaluation. There remains a notable lack of studies evaluating postoperative outcomes in patients with SCC compared with those with non-SCC subtypes.

In this research, we aimed to examine the potential use of multiparametric MRI-based radiomics analysis in distinguishing cervical cancer histopathological subtypes, particularly SCC versus non-SCC.

Material and methods

Study subjects and subject screening

Between May 2019 and September 2024, 113 patients who underwent MRI for suspected cervical mass or histopathologically confirmed cervical cancer were identified. Among those, 25 were excluded, either due to loss to follow-up after imaging (n = 10), or non-diagnostic image artifacts on cervical protocol MRI (n = 5), or millimetric tumors not detectable on MRI (n = 10) (Figure 1). MRI examinations, including T1-weighted imaging (T1WI), T2-weighted imaging (T2WI), diffusion-weighted imaging (DWI) and dynamic contrast-enhanced (DCE)-MRI, were performed prior to surgery. Ultimately, 88 patients with cervical cancer (47 SCC and 41 non-SCC) were included in the study. This retrospective study was approved by the institutional ethics committee (approval No. 2/25/889).

Figure 1

Flowchart of patient selection

MRI – magnetic resonance imaging

https://www.polradiol.com/f/fulltexts/216134/PJR-21-216134-g001_min.jpg

MRI acquisition and processing

All patients underwent SIGNA 3T MRI scans prior to surgical treatment. Patients were advised to fast for 8-10 hours prior to examination and were positioned supine with a partially full bladder. The scanning range was appropriately aligned with the cervix, extending from the upper edge of the iliac crest to the level of the bilateral femoral neck, with the axial scanning plane perpendicular to the long axis of the cervix. The imaging sequences included T1WI, T2WI, DWI (b = 50, 800, 1400), and DCE-MRI. Sagittal T2WI was acquired with a repetition time (TR)/echo time (TE) of 2980/106 ms, field of view of 256 × 320 mm and a slice thickness of 3.5 mm. Axial T2WI, oriented perpendicular to the long axis of the cervical canal, was obtained with a TR/TE of 4600/114 ms, field of view of 384 × 288 mm and a slice thickness of 3.3 mm. DWI was performed in the axial plane using single-shot echo-planar imaging with b-values of 50, 800, 1400 s/mm², TR/TE of 2000/79 ms, field of view of 110 × 44 mm, and a slice thickness of 5 mm.

The MRI images (sagittal T2WI, axial T2WI, DWI) of the patients included in the study were uploaded from the hospital’s picture archiving and communication system to the 3DSlicer Version 5.2.2 program stored in the hospital’s internal network. The region of interest was obtained from the area where the largest dimension of the lesion was located on each sequence. Tumor segmentation was performed by a radiologist with 5 years of experience in gynecological imaging and reviewed by a senior radiologist with over 15 years of experience in gynecological imaging. MRI images and segmentations of patients diagnosed with different types of cervical cancer are shown in Supplement Images 1, 2 and 3.

Statistical analysis

Dataset

To evaluate the combined impact of multimodal data fusion, a combined dataset was created which included features from all three sequences. Using a typical radiomics pipeline comprising shape, first-order statistics, and texture features – such as gray-level co-occurrence matrix (GLCM), gray-level run length matrix, and gray-level size zone matrix – the dataset contained 945 extracted radiomic features per patient. Age was a structured variable in the clinical information. For modeling purposes, the target variable was converted into a binary classification problem. The mean imputation technique was used to impute all missing variables. To guarantee uniform scaling across characteristics, radiomic and clinical features were then independently normalized using z-score normalization (standard scaler).

Model training and feature selection

Using radiomic and clinical characteristics, three supervised machine learning (ML) classifiers were used to distinguish between distinct subtypes of cervical cancer: random forest (RF), support vector machine (SVM), and logistic regression (LR). These classifiers were trained using the combined dataset and each MRI sequence (axial T2WI, sagittal T2WI, and DWI). The least absolute shrinkage and selection operator (LASSO) approach was used for feature selection, as it is particularly well suited for high-dimensional radiomic data and helps minimize overfitting while improving model interpretability. By reducing the weights of less informative features to zero, LASSO promotes sparsity by applying an L1 penalty to the regression coefficients. The definition of the optimization problem is:

minβ{12ni=1n(yiXiβ)2+λj=1p|βj|}

where yiis the binary target label, Xi is the feature vector, βj are the model coefficients and l is the regularization parameter [12]. Hyperparameter optimization for all models was performed using GridSearchCV with 5-fold stratified cross-validation, optimizing for receiver operating characteristic (ROC) area under the curve (AUC). The dataset was split into 80% training and 20% test sets, maintaining the label distribution using stratified sampling. All implementations were carried out in Python (v3.9) using the Scikit-learn library.

Evaluation metrics

Both probabilistic (ROC-AUC) and threshold-based metrics (accuracy, precision, recall, and F1-score) were used to evaluate classification performance. All metrics are based on the confusion matrix components:

Accuracy=TP+TNTP+FP+TN+FN
Precision=TPTP+FP
Recall=TPTP+FN
F1-score=2×(Precision×Recall)(Precision+Recall)

False negative (FN), true negative (TN), false positive (FP) and true positive (TP) samples were used for calculation. The F1-score, which is particularly useful for imbalanced datasets and is calculated as the harmonic mean of precision and recall, was used. For imbalanced classes, the area under the precision–recall curve (AUC-PRC), which evaluates the trade-off between precision and recall across thresholds and provides greater utility than ROC-AUC, was used [13]. From the ML classifiers, feature importance was derived for the radiomic features that contributed most. To identify multicollinearity and possible redundancy, radiomic characteristics were correlated using Pearson correlation analysis. Because of the nature of the data (no repeated measures design), Pearson correlation analysis was used instead of intra-class correlation coefficient analysis to evaluate dependability among radiomic characteristics.

Results

The best-performing parameters for the axial T2WI sequence were found as follows: RF with 100 trees, maximum depth of 5, and minimum samples per split set to 5; LR with a regularization strength C = 100; and SVM with a linear kernel and C = 1. For the sagittal T2WI sequence, optimal parameters included: RF with the same configuration as above; LR with C = 100; and SVM with radial basis function (RBF) kernel and C = 0.1. On the DWI sequence, the best settings were: RF with 100 estimators, maximum depth of 5, and minimum samples per split set to 2; LR with C = 100; and SVM with an RBF kernel and C = 10. For the combined model incorporating features from all sequences, the RF achieved the best results with 200 estimators, maximum depth of 5, and minimum samples per split of 2; LR performed best with a more regularized setting C = 0.01; and the optimal SVM configuration used an RBF kernel with C = 1. To impose sparsity and reduce overfitting, LASSO regularization was used for the initial high-dimensional radiomic features. The RF, SVM, and LR models were then trained on all MRI sequences using the 88 predictive features with non-zero coefficients that were produced by this process.

The classification performance for several MRI sequences and classifiers is summarized in Table 1. In the axial T2WI sequence, RF models had low precision (0.60) compared to LR and SVM (0.86), and also showed poor ROC-AUC (0.464), despite achieving higher accuracy and F1-scores in other MRI sequences.

Table 1

Overall test performances of the machine learning architectures for different magnetic resonance imaging (MRI) sequences

MRI SequenceClassifierAccuracyPrecisionRecallF1-scoreROC-AUC
Axial T2WIRF0.780.60.780.680.464
LR0.830.860.830.790.393
SVM0.830.860.830.790.375
Sagittal T2WIRF0.780.600.780.680.750
SVM0.830.820.830.820.839
LR0.780.780.780.780.732
DWIRF0.830.820.830.820.634
SVM0.670.780.670.690.679
LR0.780.740.780.750.500
CombinedRF0.920.930.920.920.958
SVM0.910.920.910.900.953
LR0.910.910.910.910.951

[i] AUC – area under the curve, LR – logistic regression, RF – random forest, ROC – receiver operating characteristic, SVM – support vector machine, T2WI – T2-weighted imaging

In the sagittal T2WI sequence, SVM achieved the highest accuracy (83%), F1-score (82%), and ROC-AUC (0.839). It was also associated with strong balance between recall and precision. LR and RF both achieved moderate performance, with accuracy around 78% and lower F1-scores. However, these values were inferior to SVM. The ROC-AUC values for all classifiers in sagittal T2WI (0.732-0.839) were markedly higher than those obtained from axial T2WI (0.375-0.464).

RF achieved the highest performance among the classifiers for DWI, with an accuracy of 83%, F1-score of 82%, and ROC-AUC of 0.634. Comparing across sequences, DWI was inferior to sagittal T2WI in terms of ROC-AUC (0.634-0.679 vs. 0.732-0.839), but still superior to axial T2WI (0.375-0.464).

Among all the classifiers, the combined sequence model that incorporated features from axial, sagittal, and DWI images demonstrated the best classification performance. RF achieved the highest accuracy (92%), with excellent precision (93%), recall (92%), F1-score (92%), and a near-perfect ROC-AUC of 0.958 for combined sequence. SVM also performed extremely well, with an accuracy of 91%, precision of 92%, recall of 91%, F1-score of 90%, and ROC-AUC of 0.953 in combined sequence. LR yielded comparable performance, with accuracy of 91%, precision of 91%, recall of 91%, F1-score of 91%, and ROC-AUC of 0.951. In contrast to the variable and sometimes limited results of the individual sequences, the combined approach produced uniformly high values across all metrics, with minimal differences between classifiers.

Strong classification performance with reasonably balanced sensitivity and specificity is demonstrated by the confusion matrices of the ML classifiers trained on combined radiomics features, as shown in Figure 2. With high TP and TN rates, the RF model achieved the optimum agreement, correctly detecting 28 out of 28 SCC instances and 21 out of 25 non-SCC cases. With a slightly higher number of FN (5), the LR model likewise demonstrated strong performance, accurately identifying all SCC instances (28/28) and 20 of the 25 non-SCC cases. With just one FN and four FP, the SVM model also showed good discrimination; in terms of sensitivity, it approximately matched RF.

Figure 2

Confusion matrix for machine learning (ML) architectures of combined sequences

https://www.polradiol.com/f/fulltexts/216134/PJR-21-216134-g002_min.jpg

All three ML classifiers trained on LASSO-selected radiomics features from the concatenated sequences show high discriminative ability, as indicated by the ROC curves in Figure 3. Accordingly, the RF, LR, and SVM classifiers achieved ROC-AUC values of 0.96, 0.95, and 0.95, respectively, in distinguishing between SCC and non-SCC groups.

Figure 3

Receiver operating characteristic (ROC) for machine learning (ML) architectures of combined sequences

https://www.polradiol.com/f/fulltexts/216134/PJR-21-216134-g003_min.jpg

In the RF model trained on the combined MRI sequences, the most informative radiomic features included Strength, GrayLevelNonUniformityNormalized, Busyness, ClusterTendency, LargeAreaLowGrayLevelEmphasis, and JointEnergy, as illustrated in Figure 4.

Figure 4

Top 20 Radiomics Feature Importance obtained from random forest (RF) combined sequence

https://www.polradiol.com/f/fulltexts/216134/PJR-21-216134-g004_min.jpg

The Pearson correlation heatmap for the top 20 radiomic characteristics with the highest inter-feature correlations is shown in Figure 5. The matrix shows that a large number of features have substantial correlations, with correlation coefficients higher than 0.95.

Figure 5

Top 20 Correlated Features

https://www.polradiol.com/f/fulltexts/216134/PJR-21-216134-g005_min.jpg

Discussion

The main findings of this study indicate that the sagittal T2WI sequence provides greater diagnostic value than the axial sequence. In particular, the SVM classifier not only achieved accurate classification in most cases but also demonstrated a strong balance between sensitivity and precision on sagittal T2WI, making it especially effective for distinguishing SCC from non-SCC cases across a range of probability thresholds. Moreover, the integration of multiple MRI sequences consistently yielded superior performance, regardless of the classifier used. When features from axial T2WI, sagittal T2WI, DWI, and potentially DCE-MRI were combined, all ML classifiers showed markedly improved performance compared with models trained on any single sequence alone. In the combined model, RF had a slightly higher AUC compared to LR and SVM, but the difference was minimal.

The most accurate image of the typical regional anatomy of the cervix is found on T2WI, where the mucosa and central endocervical glands exhibit hyperintense signal intensity, surrounded by hypointense fibrous stroma and loose stroma that extends to the parametrium with intermediate signal intensity [14]. Cervical tumors, when identifiable on imaging, are most clearly visualized on T2WI, typically presenting as lesions with intermediate signal intensity that stand out against the low-signal cervical stroma. DWI can further facilitate tumor detection, particularly in cases where the lesion appears isointense relative to the surrounding cervical tissue. Several studies have highlighted the incremental benefit of DWI for identifying tumors at initial staging – especially small lesions that may be subtle or isointense on T2WI – as well as for improving the detection of recurrent cervical cancer [15]. In line with the most recent European Society of Urogenital Radiology guidelines, T2WI and DWI constitute the core sequences for cervical cancer MRI. These sequences should ideally be acquired with matching planes, field of view, and slice thickness to enable optimal side-by-side assessment. They are considered essential for initial staging, evaluation of treatment response, and surveillance for recurrence. By contrast, the same recommendations indicate that DCE-MRI may be used selectively and remains optional [16]. Consistent with current recommendations, our study incorporated axial and sagittal T2WI together with DWI.

Tumor heterogeneity is an important prognostic factor, as greater heterogeneity is often associated with enhanced aggressiveness, resistance to therapy, and poorer clinical outcomes [17]. Cervical cancer, in particular, shows considerable heterogeneity at both genetic and histopathological levels [8,18]. Among prognostic factors, histopathological subtype plays a crucial role, as different subtypes are characterized by distinct biological behaviors [19]. Although current National Comprehensive Cancer Network guidelines do not provide different treatment strategies for SCC and AC [20], multiple studies have indicated that AC is associated with a less favorable prognosis [21-23]. Compared to SCC at the same stage, AC tends to be more aggressive, less responsive to chemoradiotherapy, and more prone to metastasis, resulting in poorer outcomes and lower survival rates [21-23]. Therefore, acknowledging these differences is essential for developing precise and individualized treatment strategies, highlighting the importance of subtype-oriented clinical approaches.

Although pathological examination remains the gold standard for subtype differentiation, it has several limitations, including inherent subjectivity, interobserver variability, the need for additional consultation among pathologists, and, in certain cases, the requirement for supplementary immunohistochemical staining – all of which increase both time and cost. Moreover, as pathology relies on small tissue samples, inadequate specimens may necessitate repeated biopsies, leading to additional invasive procedures and further delays. In contrast, radiomics-based, non-invasive imaging may provide potential advantages such as reducing biopsy-related risks, capturing the entire tumor heterogeneity, rather than a limited sample, enabling earlier treatment planning, and potentially reducing time and cost in the long term. However, distinguishing cervical cancer subtypes and assessing tumor heterogeneity remain difficult tasks when relying on conventional non-invasive imaging techniques [8]. Several studies have employed MRI with DWI or fluorodeoxyglucose PET/CT [5,21,24-28]. Some studies showed that the apparent diffusion coefficient (ADC) values were significantly lower in SCC compared to AC [5,21,24], whereas others found no statistically significant differences between the two subtypes [28]. Non-SCC tumors tend to be less conspicuous on MRI, making them more difficult to detect compared to SCC. Consequently, attempts have been made to separate SCC from non-SCC tumors using various imaging modalities [29-31]. This reduced visibility may lead to false negative interpretations, potentially resulting in an underestimation of the required surgical resection. More recently, radiomics has gained attention as a promising tool, enabling the extraction of high-dimensional quantitative features from imaging platforms such as MRI, CT, PET, and USG. By capturing subtle imaging patterns beyond visual assessment, radiomics provides deeper insights into tumor biology and holds considerable potential for diagnostic, prognostic, and predictive applications [32].

In a study by Wang et al., radiomic features derived from three MRI sequences, including T2WI and ADC maps, demonstrated that ACs exhibit greater tissue heterogeneity compared to SCCs; however, shape-based features did not show significant differences between the two histological subtypes [33]. A total of 105 radiomic features were extracted, many of which showed significant differences between AC and SCC. Notably, the sagittal T2WI sequence achieved the highest individual AUC of 0.86. Furthermore, a radiomics model integrating features from five MRI sequences outperformed any single sequence, achieving an AUC of 0.89, with an accuracy of 0.81, sensitivity of 0.67, and specificity of 0.94, highlighting the enhanced discriminative potential of a multi-parametric radiomics approach [33]. In a retrospective study conducted by Wang et al. [34], a multi-parameter MRI-based radiomic signature was developed and validated for the diagnosis of low-differentiated and high-differentiated cervical SCC. Radiomic signatures derived from the combination of DWI and T2WI demonstrated better clinical utility. To distinguish between low-differentiated and high-differentiated cervical SCC, the AUC reached 0.844 in the training cohort, and 0.822 in the validation cohort. In our study, the accuracy for distinguishing SCC from non-SCC subtypes was high (0.92), and the performance of the ML models, along with the evaluation metrics, further demonstrated strong overall diagnostic capability. Unlike previous studies, our analysis additionally included precision, recall, F1-score, and ROC-AUC metrics, resulting in promising outcomes. In our study, combined MRI sequences, based on the combination of all three sequences, instead of a single MRI sequence, yielded the best results through LR, SVM, and RF classifiers from ML models. Among these models, the RF model derived from combined sequences demonstrated the best performance, achieving accuracy, precision, recall, F1-score, and ROC-AUC values of 0.92, 0.93, 0.92, 0.92, and 0.958, respectively. The RF classifier achieved excellent performance, with the highest ROC-AUC of 0.958 and an accuracy of 0.91, outperforming the SVM (0.953) and LR (0.951) models, though the difference was minimal. These results represent the highest performance reported in the literature for the differentiation of cervical cancer subtypes.

GLCM-based features, particularly cluster tendency, busyness, and strength, highlight that textural heterogeneity is critical for differentiating SCC from other histopathological types. Tumor homogeneity and structural irregularity, reflected in zone emphasis and gray-level non-uniformity metrics, serve as important diagnostic indicators. These findings demonstrate that multi-sequence MRI-derived radiomic signatures can identify subtle but distinct imaging biomarkers linked to the pathophysiology of cervical tumors. GLCM features capture comprehensive information about spatial distribution, including direction, distance, gray-level intensity, and structural patterns, with higher scores indicating greater heterogeneity [35].

Unlike previous MRI studies, our analysis revealed significant redundancy among certain radiomic features, as highlighted in the Pearson correlation heatmap. Features such as Difference Entropy, Interquartile Range, Mean Absolute Deviation, and 90th Percentile frequently exhibited strong mutual correlations. To prevent multicollinearity and overfitting in ML models, dimensionality reduction or feature selection methods such as LASSO are necessary. Overall, cervical tumor radiomic profiles appear to be dominated by specific texture and intensity patterns, reflected in tightly clustered feature groups.

Limitations

This study has several limitations. First, it was conducted retrospectively; selection bias cannot be excluded. Second, due to its single-center design, the sample size was relatively limited; however, given that the study was conducted at a tertiary cancer center with a high patient volume and experienced radiology staff, the sample size was still reasonable for a single-center study. Variability in MRI acquisition and segmentation protocols may also affect feature reproducibility. Despite using LASSO for feature selection, the high-dimensional radiomic data carry an inherent risk of overfitting. Nevertheless, the study demonstrates notable strengths. By integrating multi-sequence MRI radiomics (axial T2WI, sagittal T2WI, and DWI), ML classifiers (RF, SVM, LR) achieved excellent accuracy, precision, recall, F1-score, and ROC-AUC values, with RF reaching a near-perfect ROC-AUC of 0.958. Comprehensive evaluation metrics and analysis of GLCM-based texture features provided detailed insights into tumor heterogeneity and biologically relevant imaging biomarkers. The use of LASSO reduced feature redundancy and multicollinearity, enhancing model robustness, while the non-invasive approach captures whole-tumor characteristics compared to limited biopsy samples.

Conclusions

Our study demonstrates that multi-sequence MRI-based radiomics, combined with advanced ML classifiers, can effectively differentiate cervical SCC from non-SCC subtypes with unprecedented accuracy. The integration of axial T2WI, sagittal T2WI, and DWI sequences consistently outperformed single-sequence models, with the RF classifier achieving near-perfect ROC-AUC and balanced performance across all evaluation metrics. GLCM-based texture features and other radiomic biomarkers captured subtle yet clinically relevant tumor heterogeneity, highlighting the potential of radiomics to provide biologically meaningful insights beyond conventional imaging. Those underscore the promise of non-invasive, quantitative imaging for precise tumor characterization, potentially guiding personalized management strategies and reducing reliance on invasive biopsies. These results pave the way for future prospective, multi-center studies to validate and expand the clinical applicability of MRI radiomics in cervical cancer.