Prostate Cancer Detection Using Gradient Boosting Machines Effectively

https://doi.org/10.22146/ijccs.107742

Muslimin MusliminB(1*), Syafei Karim(2), Asep Nurhuda(3)

(1) Politeknik Pertanian Negeri Samarinda
(2) Sistem Informasi Akuntansi, Politeknik Pertanian Negeri Samarinda
(3) Teknologi Rekayasa Perangkat Lunak, Politeknik Pertanian Negeri Samarinda
(*) Corresponding Author

Abstract


Prostate cancer remains a leading cause of cancer-related deaths among men globally, emphasizing the critical need for accurate diagnostic tools. This study investigates the application of Gradient Boosting Machines (GBMs) for prostate cancer detection using a dataset with key tumor characteristics such as radius, texture, area, and symmetry. Data preprocessing included normalization, missing value handling, and the Synthetic Minority Oversampling Technique (SMOTE) to address class imbalance. The GBM model demonstrated an accuracy of 75%, with high precision (82%) and recall (88%) for malignant cases, underscoring its potential as a reliable diagnostic tool. However, the model's performance for benign cases was limited by severe class imbalance, reflected in a precision of 33% and recall of 25%. Interpretability was enhanced using SHAP values, identifying key predictors like tumor perimeter and compactness. While GBMs show promise in prostate cancer diagnostics, future research should incorporate multimodal data, advanced balancing techniques, and rigorous validation frameworks to enhance generalizability and fairness. This study highlights the value of machine learning in healthcare, contributing to improved diagnostic accuracy and patient outcomes.


Keywords


Prostate Cancer Detection; Gradient Boosting Machines; Machine Learning; Class Imbalance; SHAP Interpretability

Full Text:

PDF


References

X. Zhang et al., "Gradient Boosting for Lung Cancer Screening," IEEE Transactions on Medical Imaging, vol. 39, no. 4, pp. 1234–1245, 2023. [2] Y. Liu et al., "Data Preprocessing in Machine Learning Applications for Medical Data," IEEE Access, vol. 10, pp. 3456–3467, 2022. [3] K. Gupta and A. Sharma, "Feature Engineering Techniques in Lung Cancer Analysis," Proceedings of the IEEE International Conference on Data Science, pp. 567–573, 2021. [4] S. Mehta et al., "Hyperparameter Optimization in Gradient Boosting for Medical Applications," IEEE Transactions on Biomedical Engineering, vol. 68, no. 8, pp. 2345–2354, 2021. [5] H. Kim and M. Choi, "Explainable AI in Lung Cancer Diagnostics Using SHAP," IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 19, no. 1, pp. 56–67, 2023. [6] F. Almeida et al., "Clinical Decision Support Systems with Gradient Boosting Integration," IEEE Access, vol. 10, pp. 12345–12353, 2022. [7] D. Brown et al., "Advancing Imbalanced Dataset Handling: Lessons from Lung Cancer Detection Models," IEEE Access, vol. 11, pp. 4500–4510, 2023. [8] R. Tanaka et al., "Efficient Gradient Boosting Techniques for Small Dataset Learning," Proceedings of the IEEE Symposium on Machine Learning Applications in Medicine, pp. 345–350, 2022. [9] M. Singh et al., "Handling Missing Data in Medical Datasets: A Review of Techniques," IEEE Transactions on Computational Biology and Bioinformatics, vol. 18, no. 4, pp. 765–774, 2021. [10] J. Wilson et al., "Interpretable AI for Cancer Diagnostics Using SHAP and Feature Attribution," IEEE Transactions on Artificial Intelligence, vol. 5, no. 3, pp. 234–245, 2022. [11] T. Lopez and K. Carter, "Gradient Boosting vs Random Forest: Comparative Performance in Imbalanced Datasets," IEEE Access, vol. 9, pp. 11234–11245, 2021. [12] P. Verma et al., "A Review of Ensemble Learning Techniques in Cancer Detection," IEEE Access, vol. 10, pp. 25678–25690, 2022. [13] L. Nguyen and A. Patel, "Challenges and Solutions for Imbalanced Medical Data: A Case Study," Proceedings of the IEEE International Conference on Biomedical Engineering, pp. 789–794, 2023.



DOI: https://doi.org/10.22146/ijccs.107742

Article Metrics

Abstract views : 1305 | views : 696

Refbacks

  • There are currently no refbacks.




Copyright (c) 2025 IJCCS (Indonesian Journal of Computing and Cybernetics Systems)

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.



Copyright of :
IJCCS (Indonesian Journal of Computing and Cybernetics Systems)
ISSN 1978-1520 (print); ISSN 2460-7258 (online)
is a scientific journal the results of Computing
and Cybernetics Systems
A publication of IndoCEISS.
Gedung S1 Ruang 416 FMIPA UGM, Sekip Utara, Yogyakarta 55281
Fax: +62274 555133
email:ijccs.mipa@ugm.ac.id | http://jurnal.ugm.ac.id/ijccs



View My Stats1
View My Stats2