Original Article

Evaluation of Radiomics and Machine Learning for Classifying Pulmonary Nodules in CT Images

Abstract

Lung cancer is a deadly disease which has high occurrence and death rates, worldwide. Computed Tomography (CT) imaging is being widely used by clinicians for detection of lung cancer. Radiomics extracted from medical images together with Machine Learning (ML) platform has given encouraging results in lung cancer diagnosis. Therefore, this study is proposed with the aim to efficiently apply and evaluate radiomics and ML techniques to classify pulmonary nodules in CT images. Lung Image Data Consortium is utilized in which nodules are given malignancy score 1 through 5 i.e. benign through malignant. Three scenarios are created randomly using these groups: G54 Vs G12, G543 Vs G12, and G54 Vs G123. Radiomics are extracted using Shape, Gray Level Co-occurrence Method, Gray Level Difference Method, and Gray Level Run Length Matrix along with Wavelet Packet Transform. To select a relevant set of features, four techniques i.e. Chi-square test, Analysis of variance, boosted ensemble classification tree and bagged ensemble classification tree are applied. The classification of nodule into benign or malignant is evaluated by using six models of Support vector machine. The results, in Scenario 1, show that MGSVM+Chi-square yields the best outcome compared to rest of the models with 75.3% accuracy, 77.9% sensitivity and 71.5% specificity. In Scenario 2, QSVM+Chi-square yields the best outcome compared to rest of the models with 74.7% accuracy, 70.3% sensitivity and 77.4% specificity. And in Scenario 3, CSVM+BACET yields comparatively better results with70.3% accuracy, 70.6% sensitivity and 62.1% specificity.

1- Ziyad, S.R., Radha, V., Vaiyapuri, T., "Noise removal in lung LDCT images by novel discrete wavelet-based denoising with adaptive thresholding technique." International Journal of E-Health and Medical Communications (IJEHMC), 12(5), pp.1-15, 2021. http://doi.org/10.4018/IJEHMC.20210901.oa1
2- Orozco, H.M., et al., "Automated system for lung nodules classification based on wavelet feature descriptor and support vector machine." Biomedical Engineering Online, 14(1), pp. 1-20, (2015). https://doi.org/10.1186/s12938-015-0003-y
3- Donga, H.V., Karlapati, J.S.A.N., Desineedi, H.S.S., Periasamy, P., TR, S.,"Effective Framework for Pulmonary Nodule Classification from CT Images Using the Modified Gradient Boosting Method" Applied Sciences, 12(16), pp.8264, (2022). https://doi.org/10.3390/app12168264
4- Alzubaidi, M. A., Mwaffaq, O., Jaradat, H., "Comprehensive and comparative global and local feature 1- Ziyad, S.R., Radha, V., Vaiyapuri, T., "Noise removal in lung LDCT images by novel discrete wavelet-based denoising with adaptive thresholding technique." International Journal of E-Health and Medical Communications (IJEHMC), 12(5), pp.1-15, 2021. http://doi.org/10.4018/IJEHMC.20210901.oa1
2- Orozco, H.M., et al., "Automated system for lung nodules classification based on wavelet feature descriptor and support vector machine." Biomedical Engineering Online, 14(1), pp. 1-20, (2015). https://doi.org/10.1186/s12938-015-0003-y
3- Donga, H.V., Karlapati, J.S.A.N., Desineedi, H.S.S., Periasamy, P., TR, S.,"Effective Framework for Pulmonary Nodule Classification from CT Images Using the Modified Gradient Boosting Method" Applied Sciences, 12(16), pp.8264, (2022). https://doi.org/10.3390/app12168264
4- Alzubaidi, M. A., Mwaffaq, O., Jaradat, H., "Comprehensive and comparative global and local feature resonance imaging”, Dialogues in Clinical Neuroscience, 6, pp. 243–250, (2004). https://doi.org/10.31887/DCNS.2004.6.2/amaterka
22- O’Connor, J. P. B. et al., “Imaging intratumor heterogeneity: Role in therapy response, resistance, and clinical outcome”, Clinical Cancer Research, 21, 249–257, (2015). doi:10.1158/1078-0432.CCR-14-0990
23- Mir, A.H., Hanmandlu, M., Tandon, S.N., “Texture analysis of CT images” IEEE Engineering in Medicine and Biology Magazine 14 (6), pp. 781-786, (1995). doi: 10.1109/51.473275.
24- Garcia, C., Zikos, G., Tziritas, G., “Wavelet packet analysis for face recognition Image and Vision Computing”, 18, pp. 289–297, (2000). https://doi.org/10.1016/S0262-8856(99)00056-6.
25- Han, J.G., Ren, W.X., Sun, Z.S.,”Wavelet packet based damage identification of beam structures”, International Journal of Solids and Structures, 42(26), 2005, pp. 6610-6627, (2005). https://doi.org/10.1016/j.ijsolstr.2005.04.031
26- Perlibakas, V., “Face Recognition Using Principal Component Analysis of the Wavelet Packet Decomposition”, Science Direct Working Paper No S1574-034X(04)70005-8, April (2004).
27- Daubechies, I., Alex, G., Yves, M., "Painless nonorthogonal expansions", Journal of Mathematical Physics, 27(5), pp. 1271-1283, (1986).
28- Pudjihartono, N., Fadason, T., Kempa-Liehr, A. W., O'Sullivan J.M., “A Review of Feature Selection Methods for Machine Learning-Based Disease Risk Prediction”, Frontiers in Bioinformatics, vol 2, (2022). https://doi.org/10.3389/fbinf.2022.927312 29- Bommert, A., Welchowski, T., Schmid, M., Rahnenführer, J., "Benchmark of filter methods for feature selection in high-dimensional gene expression survival data." Briefings in Bioinformatics, 23(1), bbab354, (2022). https://doi.org/10.1093/bib/bbab354
30- Zhang, Y., et al., "Alzheimer’s disease multiclass diagnosis via multimodal neuroimaging embedding feature selection and fusion." Information Fusion, 66, pp. 170-183, (2021).
31- Shehu, Ina., "Testing Statistical Hypothesis on Learning Effectiviness: pre-and post-COVID 19." South East European Journal of Sustainable Development, 6(2), (2022).
32- Yu, Z., et al., "Beyond t test and ANOVA: applications of mixed-effects models for more rigorous statistical analysis in neuroscience research.", Neuron, 110(1), pp. 21-35, (2022). DOI: 10.1016/j.neuron.2021.10.030
33- Hatwell, J., Gaber, M.M., Azad, R.M.A., "gbt-hips: Explaining the classifications of gradient boosted tree ensembles.", Applied Sciences, 11(6). Pp. 2511, (2021). https://doi.org/10.3390/app11062511
34- Mosavi, A.,et al., "Ensemble boosting and bagging based machine learning models for groundwater potential prediction.", Water Resources Management , 35, pp. 23-37, (2021).
35- Khan, Y. F., Kaushik, B., Chowdhary C.L., Srivastava, G, "Ensemble model for diagnostic classification of Alzheimer’s disease based on brain anatomical magnetic resonance imaging.", Diagnostics, 12(12), pp. 3193, (2022). https://doi.org/10.3390/diagnostics12123193
36- Goudjil, M., Koudil, M., Bedda, M. et al. “A Novel Active Learning Method Using SVM for Text Classification.” Int. J. Autom. Comput., 15, pp. 290–298, (2018). https://doi.org/10.1007/s11633-015-0912-z
37- Khan, Y.F., Kaushik, B., Mir, B.A., "Computational Intelligent Models for Alzheimer's Prediction Using Audio Transcript Data." Computing and Informatics 41(6), pp. 1589-1624.
https://doi.org/10.31577/cai_2022_6_1589
38- Vapnik, V. N., "An overview of statistical learning theory." IEEE Transactions on Neural Networks, 10(5), pp. 988-999, (1999).
39- Pisner, D.A., Schnyer, D.M., Support vector machine. In Machine Learning; Elsevier: Amsterdam, The Netherlands, pp. 101–121, (2020).
Files
IssueVol 12 No 4 (2025) QRcode
SectionOriginal Article(s)
DOI https://doi.org/10.18502/fbt.v12i4.19811
Keywords
Lung Cancer, LIDC, Radiomics, SVM, Feature Selection, Machine Learning.

Rights and permissions
Creative Commons License This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
How to Cite
1.
Nissar A, Mir AH. Evaluation of Radiomics and Machine Learning for Classifying Pulmonary Nodules in CT Images. Frontiers Biomed Technol. 2025;12(4):742-756.