Document Type: Original Research

Authors

1 PhD, Department of Health Information Technology and Management, School of Allied Medical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran

2 PhD Candidate, Department of Health Information Technology and Management, School of Allied Medical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran

3 MD, PhD, Department of obstetrics and gynaecology, preventative gynaecology Research centre, Shahid Beheshti University of Medical Sciences Tehran, Iran

Abstract

Background: Compared to other genital cancers, cervical cancer is the most prevalent and the main cause of mortality in females in third-world countries, affected by different factors, including smoking, poor nutritional status, immune-deficiency, long-term use of contraceptives and so on.
Objective: The present study was conducted to predict cervical cancer and identify its important predictors using machine learning classification algorithms.
Material and Methods: In a cross-sectional study, the data of 145 patients with 23 attributes, which referred to Shohada Hospital Tehran, Iran during 2017–2018, were analyzed by machine learning classification algorithms which included SVM, QUEST, C&R tree, MLP and RBF. The criteria measurement used to evaluate these algorithms included accuracy, sensitivity, specificity and area under the curve (AUC).
Results: The accuracy, sensitivity, specificity and AUC of Quest and C&R tree were, respectively 95.55, 90.48, 100, and 95.20, 95.55, 90.48, 100, and 95.20, those of RBF 95.45, 90.00, 100 and 91.50, those of SVM 93.33, 90.48, 95.83 and 95.80 and those of MLP 90.90, 90.00, 91.67 and 91.50 percentage. The important predictors in all the algorithms were found to comprise personal health level, marital status, social status, the dose of contraceptives used, level of education and number of caesarean deliveries.
Conclusion: This investigation confirmed that ML can enhance the prediction of cervical cancer. The results of this study showed that Decision Tree algorithms can be applied to identify the most relevant predictors. Moreover, it seems that improving personal health and socio-cultural level of patients can be causing cervical cancer prevention.

Keywords

  1. Jaime G, Morales-Vásquez F, Meneses-Garcia A, editors. Cervical cancer. Springer; 2017. doi: 10.1007/978-3-319-45231-9.
  2. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. A Cancer Journal for Clinicians. 2018;68(6):394-424. doi: 10.3322/caac.21492.
  3. De Martel C, Plummer M, Vignat J, Franceschi S. Worldwide burden of cancer attributable to HPV by site, country and HPV type. Int J Cancer. 2017;141(4):664-70. doi: 10.1002/ijc.30716.
  4. Momenimovahed Z, Salehiniya H. Cervical cancer in Iran: integrative insights of epidemiological analysis. BioMedicine. 2018;8(3):18. doi: 10.1051/bmdcn/2018080318. PubMed PMID: 30141405. PubMed PMCID: PMC6108227.
  5. Globocan W. Estimated cancer incidence and mortality and prevalenc worldwide in 2018. Globocan; 2018 [cited 11 March 2019]. Available from: http://gco.iarc.fr/today/data/factsheets/populations/364-iran-islamic-republic-of-fact-sheets.pdf.
  6. Burchell AN, Winer RL, De Sanjosé S, Franco EL. Epidemiology and transmission dynamics of genital HPV infection. Vaccine. 2006;24:S52-61. doi: 10.1016/j.vaccine.2006.05.031.
  7. Aminisani N, Armstrong BK, Canfell K. Cervical cancer screening in Middle Eastern and Asian migrants to Australia: a record linkage study. Cancer epidemiology. 2012;36(6):e394-400. doi: 10.1016/j.canep.2012.08.009.
  8. Mohaghegh F, Ahmadlou M. A Study of the Prevalence of Cervical Cancer among Married Wemon in Arak, 2013. Journal of Arak university of Medical Sciences. 2015;18(4):65-70.
  9. Kurniawati YE, Permanasari AE, Fauziati S. Comparative study on data mining classification methods for cervical cancer prediction using pap smear results. 1st International Conference on Biomedical Engineering (IBIOMED); Yogyakarta, Indonesia: IEEE; 2016. p. 1-5. doi: 10.1109/IBIOMED.2016.7869827.
  10. Do HH, Taylor VM, Yasui Y, Jackson JC, Tu SP. Cervical cancer screening among Chinese immigrants in Seattle, Washington. Journal of immigrant health. 2001;3(1):15-21. doi: 10.1023/A:1026606401164.
  11. Ramondetta L. What is the appropriate approach to treating women with incurable cervical cancer?. Journal of the National Comprehensive Cancer Network. 2013;11(3):348-55.
  12. Esmati E, Kalaghchi B. Uterine cervix carcinoma: pathologic characteristics, treatment and follow-up evaluation.TUMJ. 2008;65(11):55-9.
  13. Le Borgne G, Mercier M, Woronoff AS, Guizard AV, Abeilard E, Caravati-Jouvenceaux A, et al. Quality of life in long-term cervical cancer survivors: a population-based study. Gynecologic oncology. 2013;129(1):222-8. doi: 10.1016/j.ygyno.2012.12.033.
  14. Latha DS, Lakshmi PV, Fathima S. Staging Prediction in Cervical Cancer Patients–A Machine Learning Approach. International Journal of Innovative Research and Practices. 2014;2(2):14-23.
  15. Mandelblatt J, Andrews H, Kerner J, Zauber A, Burnett W. Determinants of late stage diagnosis of breast and cervical cancer: the impact of age, race, social class, and hospital type. Am J Public Health. 1991;81(5):646-9. doi: 10.2105/ajph.81.5.646. PubMed PMID: 2014871. PubMed PMCID: PMC1405079.
  16. Workowski KA, Bolan GA. Sexually Transmitted Diseases Treatment Guidelines, 2015. Morbidity and Mortality Weekly Report (MMWR); Atlant: CDC; 2015. p. 1-137.
  17. Schiffman M, Wentzensen N. A suggested approach to simplify and improve cervical screening in the United States. Journal of lower genital tract disease. 2016;20(1):1. doi: 10.1097/LGT.0000000000000170. PubMed PMID: 26704326. PubMed PMCID: PMC4692178.
  18. Dietrich AJ, Tobin JN, Cassells A, Robinson CM, Greene MA, Sox CH, Beach ML, DuHamel KN, Younge RG. Telephone care management to improve cancer screening among low-income women: a randomized, controlled trial. Annals of internal medicine. 2006;144(8):563-71. doi: 10.7326/0003-4819-144-8-200604180-00006.
  19. Kusy M, Obrzut B, Kluska J. Application of gene expression programming and neural networks to predict adverse events of radical hysterectomy in cervical cancer patients. Medical & biological engineering & computing. 2013;51(12):1357-65. doi: 10.1007/s11517-013-1108-8.
  20. Sokouti B, Haghipour S, Tabrizi AD. A framework for diagnosing cervical cancer disease based on feedforward MLP neural network and ThinPrep histopathological cell image features. Neural Computing and Applications. 2014;24(1):221-32. doi: 10.1007/s00521-012-1220-y.
  21. Qiu X, Tao N, Tan Y, Wu X. Constructing of the risk classification model of cervical cancer by artificial neural network. Expert Systems with Applications. 2007;32(4):1094-9. doi: 10.1016/j.eswa.2006.02.020.
  22. Benazir B, Nagarajan A. An Expert System for Predicting the Cervical Cancer using Data Mining Techniques. International Journal of Pure and Applied Mathematics. 2018;118(20):1971-87.
  23. Tseng CJ, Lu CJ, Chang CC, Chen GD. Application of machine learning to predict the recurrence-proneness for cervical cancer. Neural Computing and Applications. 2014;24(6):1311-6. doi: 10.1007/s00521-013-1359-1.
  24. Fatlawi HK. Enhanced classification model for cervical cancer dataset based on cost sensitive classifier. International Journal of Computer Techniques. 2017;4(4):115-20.
  25. Chang CC, Cheng SL, Lu CJ, Liao KH. Prediction of recurrence in patients with cervical cancer using mars and classification. International Journal of Machine Learning and Computing. 2013;3(1):75.
  26. Alam TM, Khan MM, Iqbal MA, Abdul W, Mushtaq M. Cervical cancer prediction through different screening methods using data mining. International Journal of Advanced Computer Science and Applications. 2019;10(2):388-96. doi: 10.14569/IJACSA.2019.0100251.
  27. Zhang J, Liu Y. Cervical cancer detection using SVM based feature screening. International Conference on Medical Image Computing and Computer-Assisted Intervention; Berlin, Heidelberg: Springer; 2004. p. 873-880.
  28. Mukhopadhyay S, Kurmi I, Dey R, Das NK, Pradhan S, Pradhan A, Ghosh N, et al. Optical diagnosis of colon and cervical cancer by support vector machine. Biophotonics: Photonic Solutions for Better Health Care V, 98870U; Brussels, Belgium: International Society for Optics and Photonics; 2016. doi: 10.1117/12.2227316.
  29. Nematollahi M, Akbari R, Nikeghbalian S, Salehnasab C. Classification models to predict survival of kidney transplant recipients using two intelligent techniques of data mining and logistic regression. International journal of organ transplantation medicine. 2017;8(2):119-122. PubMed PMID: 28959387. PubMed PMCID: PMC5611541.
  30. Rezaianzadeh A, Dastoorpoor M, Sanaei M, Salehnasab C, Mohammadi MJ, Mousavizadeh A. Predictors of length of stay in the coronary care unit in patient with acute coronary syndrome based on data mining methods. Clinical Epidemiology and Global Health. 2019. doi: 10.1016/j.cegh.2019.09.007.
  31. Haykin S. Neural Networks: A comprehensive Foundation. New Jersey: Prentice Hall PTR Upper Saddle River; 1999;7458:161-75.
  32. West D. Neural network credit scoring models. Computers & Operations Research. 2000;27(11-12):1131-52. doi: 10.1016/S0305-0548(99)00149-5.
  33. Quinlan JR. Induction of decision trees. Machine learning. 1986;1(1):81-106. doi: 10.1007/BF00116251.
  34. Han J, Pei J, Kamber M. Data mining: concepts and techniques. Elsevier; 2011.
  35. Kim JW, Lee BH, Shaw MJ, Chang HL, Nelson M. Application of decision-tree induction techniques to personalized advertisements on internet storefronts. International Journal of Electronic Commerce. 2001;5(3):45-62. doi: 10.1080/10864415.2001.11044215.
  36. Burges CJ. A tutorial on support vector machines for pattern recognition. Data mining and knowledge discovery. 1998;2(2):121-67. doi: 10.1023/A:1009715923555.
  37. Burges CJ, Smola AJ, Scholkopf B. Advances in kernel methods. Support Vector Learning. London, England: MIT press; 1999.
  38. Schiffman M, Castle PE, Jeronimo J, Rodriguez AC, Wacholder S. Human papillomavirus and cervical cancer. The Lancet. 2007;370(9590):890-907. doi: 10.1016/S0140-6736(07)61416-0.
  39. Vidya R, Nasira GM. Knowledge extraction in medical data mining: a case based reasoning for gynecological cancer an expert diagnostic method. ARPN Journal of Engineering and Applied Sciences. 2006;10(9):3997-4001.
  40. Hemalatha K, Rani DU. Improvement of multilayer perceptron classification on cervical pap smear data with feature extraction. International Journal of Innovative Research in Science, Engineering, and Technology. 2016:20419-24.