CORRELATION OF COVID-19 TO LUNG INFECTIONS AND PREDICTION OF LUNG INFECTIONS IN COVID-19 PATIENTS IN IRAQ USING DATA MINING METHODS

Authors

  • Shivan S. Alomadi Department of Computer Science, College of Science, University of Duhok, Kurdistan Region-Iraq
  • Jihan A. Ahmed Rasool Department of Computer Science, College of Science, University of Duhok, Kurdistan Region-Iraq

DOI:

https://doi.org/10.25271/sjuoz.2024.12.2.1273

Keywords:

Covid-19, lung infections, Iraq, correlation, data mining, machine learning, Bagging, Boosting, Naïve Bayes, K-Nearest Neighbour, J48 decision tree, Random Resonance Theory, Binary Logistic Regression

Abstract

The Covid-19 pandemic emerged as an unforeseen global crisis, exerting a profound influence on various aspects of human life. Hence, the need for collaborative efforts and scholarly investigations to address and alleviate the challenges arising from this crisis is crucial. One notable concern pertains to lung infections, which are recognized as a highly perilous consequence of the aforementioned virus. Thus, this study aims to investigate the potential correlation between Covid-19 and lung infections, and test the efficacy of various algorithms in predicting lung infections amongst Covid-19 patients. For this purpose, data has been procured from multiple health institutions in Iraq. Using this data, a robust correlation between Covid-19 and lung infection cases was found and the bagging, boosting, naïve Bayes, K-Nearest Neighbour, J48, random forest, PART, and logistic regression algorithms showcased a high accuracy in prediction lung infection in Covid-19 patients, with naïve Bayes achieving the highest accuracy of 93.41 percent.

References

Abdulhafedh, A. (2017). Incorporating the Multinomial Logistic Regression in Vehicle Crash Severity Modeling: A Detailed Overview. Journal of Transportation Technologies, 07(03), 279–303. https://doi.org/10.4236/jtts.2017.73019

Abdulrahman, M.S., and Rasool, J.A. (2020). Using Data Mining Algorithms to Predict Recommendations on Products. International Journal of Advanced Science and Technology, 29(3), 4370 - 4381. Retrieved from http://sersc.org/journals/index.php/IJAST/article/view/5263

Altman, N. S. (1992). An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression. The American Statistician, 46(3), 175–185. https://doi.org/10.1080/00031305.1992.10475879

Annoni, A. D., Conte, E., Mancini, M. E., Gigante, C., Agalbato, C., Formenti, A., Muscogiuri, G., Mushtaq, S., Guglielmo, M., Baggiano, A., Bonomi, A., Pepi, M., Pontone, G., and Andreini, D. (2021). Quantitative Evaluation of COVID-19 Pneumonia Lung Extension by Specific Software and Correlation with Patient Clinical Outcome. Diagnostics, 11(2), 265. https://doi.org/10.3390/diagnostics11020265

Bhargava, N., Sharma, G., Bhargava, R., and Mathuria, M. (2013). Decision Tree Analysis on J48 Algorithm for Data Mining. Computer Science and Software Engineering. https://www.academia.edu/4375403/Decision_Tree_Analysis_on_J48_Algorithm_for_Data_Mining

Biau, G., and Scornet, E. (2016). A random forest guided tour. TEST, 25(2), 197–227. https://doi.org/10.1007/s11749-016-0481-7

Breiman, L. (2001). Random Forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/a:1010933404324

Cao, Y., and Wu, J. (2002). Projective ART for clustering data sets in high dimensional spaces. Neural Networks, 15(1), 105–120. https://doi.org/10.1016/s0893-6080(01)00108-3

Chen, L.-D., Zhang, Z.-Y., Wei, X.-J., Cai, Y.-Q., Yao, W.-Z., Wang, M.-H., Huang, Q.-F., and Zhang, X.-B. (2020). Association between cytokine profiles and lung injury in COVID-19 pneumonia. Respiratory Research, 21(1). https://doi.org/10.1186/s12931-020-01465-2

Chen, R.-C., and Chuang, C.-H. (2008). Automating construction of a domain ontology using a projective adaptive resonance theory neural network and Bayesian network. Expert Systems, 25(4), 414–430. https://doi.org/10.1111/j.1468-0394.2008.00476.x

Dawoud, M. M., Dawoud, T. M., Ali, N. Y. A., and Nagy, H. A. (2020). Chest CT in COVID-19 pneumonia: a correlation of lung abnormalities with duration and severity of symptoms. Egyptian Journal of Radiology and Nuclear Medicine, 51(1). https://doi.org/10.1186/s43055-020-00359-z

Duzgun, S. A., Durhan, G., Demirkazik, F. B., Akpinar, M. G., and Ariyurek, O. M. (2020). COVID-19 pneumonia: the great radiological mimicker. Insights into Imaging, 11(1). https://doi.org/10.1186/s13244-020-00933-z

Francone, M., Iafrate, F., Masci, G. M., Coco, S., Cilia, F., Manganaro, L., Panebianco, V., Andreoli, C., Colaiacomo, M. C., Zingaropoli, M. A., Ciardi, M. R., Mastroianni, C. M., Pugliese, F., Alessandri, F., Turriziani, O., Ricci, P., and Catalano, C. (2020). Chest CT score in COVID-19 patients: correlation with disease severity and short-term prognosis. European Radiology,30(12),6808–6817. https://doi.org/10.1007/s00330-020-07033-y

GÜNER, R., HASANOĞLU, İ., and AKTAŞ, F. (2020). COVID-19: Prevention and control measures in community. TURKISH JOURNAL OF MEDICAL SCIENCES, 50(SI-1), 571–577. https://doi.org/10.3906/sag-2004-146

Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., and Witten, I. H. (2009). The WEKA data mining software. ACM SIGKDD Explorations Newsletter, 11(1), 10–18. https://doi.org/10.1145/1656274.1656278

Harapan, H., Itoh, N., Yufika, A., Winardi, W., Keam, S., Te, H., Megawati, D., Hayati, Z., Wagner, A. L., and Mudatsir, M. (2020). Coronavirus disease 2019 (COVID-19): A literature review. Journal of Infection and Public Health, 13(5), 667–673. https://doi.org/10.1016/j.jiph.2020.03.019

Harding, J. H., Shahbaz, M., Srinivas, and Kusiak, A. (2006). Data Mining in Manufacturing: A Review. Journal of Manufacturing Science and Engineering-Transactions of The Asme, 128(4), 969–976. https://doi.org/10.1115/1.2194554

Hinton, P., McMurray, I., and Brownlow, C. (2014). SPSS Explained. Taylor and Francis.

Hussain, S., Muhammad, L. J., Ishaq, F. S., Yakubu, A., and Mohammed, I. A. (2019). Performance Evaluation of Various Data Mining Algorithms on Road Traffic Accident Dataset. Information and Communication Technology for Intelligent Systems, 67–78. https://doi.org/10.1007/978-981-13-1742-2_7

MacIntyre, C. R., and Wang, Q. (2020). Physical distancing, face masks, and eye protection for prevention of COVID-19. The Lancet, 395(10242), 1950–1951. https://doi.org/10.1016/s0140-6736(20)31183-1

Mahammedi, A., Ramos, A., Bargalló, N., Gaskill, M., Kapur, S., Saba, L., Carrete, H., Sengupta, S., Salvador, E., Hilario, A., Revilla, Y., Sanchez, M., Perez-Nuñez, M., Bachir, S., Zhang, B., Oleaga, L., Sergio, J., Koren, L., Martin-Medina, P., … Vagal, A. (2021). Brain and Lung Imaging Correlation in Patients with COVID-19: Could the Severity of Lung Disease Reflect the Prevalence of Acute Abnormalities on Neuroimaging? A Global Multicenter Observational Study. American Journal of Neuroradiology, 42(6), 1008–1016. https://doi.org/10.3174/ajnr.a7072

Maimon, O., and Rokach, L. (2005). Data mining and knowledge discovery handbook. Choice Reviews Online, 48(10), 48–5729. https://doi.org/10.5860/choice.48-5729

McHugh, M. M. (2013). The Chi-square test of independence. Biochemia Medica, 143–149. https://doi.org/10.11613/bm.2013.018

Olivieri, D., and Scoditti, E. (2005). Impact of environmental factors on lung defences. European Respiratory Review, 14(95), 51–56. https://doi.org/10.1183/09059180.05.00009502

Rasool, J. A. A. (2018). Analysis the Relationship between Social Media and Education System in Kurdistan region of Iraq Using Chi-Square Test. Academic Journal of Nawroz University, 7(4), 133–138. https://doi.org/10.25007/ajnu.v7n4a282

Santoso, P., Fauziah, F., and Nurhayati, N. (2020). Application Of Data Mining Classification For COVID-19 Infected Status Using Algortima Naïve Method. Jurnal Mantik, 4(1), 267–275. http://iocscience.org/ejournal/index.php/mantik/article/view/740

Skopljanac, I., Ivelja, M. P., Barcot, O., Brdar, I., Dolic, K., Polasek, O., and Radic, M. (2021). Role of Lung Ultrasound in Predicting Clinical Severity and Fatality in COVID-19 Pneumonia. Journal of Personalized Medicine, 11(8), 757. https://doi.org/10.3390/jpm11080757

Talib, H. J. (2021). Predicting the Correct Procedure of COVID-19 Patients in Hospitals Using Machine Learning. M.Sc. Thesis, Applied science private university, Amman-Jordan.

Tung-Chen, Y., de Gracia, M., Díez-Tascón, A., Alonso-González, R., Agudo-Fernández, S., Parra-Gordo, M. L., Ossaba-Vélez, S., Rodríguez-Fuertes, P., and Llamas-Fuentes, R. (2020). Correlation between Chest Computed Tomography and Lung Ultrasonography in Patients with Coronavirus Disease 2019 (COVID-19). Ultrasound in Medicine and Biology, 46(11), 2918–2926. https://doi.org/10.1016/j.ultrasmedbio.2020.07.003

Vieira, J. M., Ricardo, O. M. de P., Hannas, C. M., Kanadani, T. C. M., Prata, T. dos S., and Kanadani, F. N. (2020). What do we know about COVID-19? A review article. Revista Da Associação Médica Brasileira, 66(4), 534–540. https://doi.org/10.1590/1806-9282.66.4.534

Wang, X., Che, Q., Ji, X., Meng, X., Zhang, L., Jia, R., Lyu, H., Bai, W., Tan, L., and Gao, Y. (2021). Correlation between lung infection severity and clinical laboratory indicators in patients with COVID-19: a cross-sectional study based on machine learning. BMC Infectious Diseases, 21(1). https://doi.org/10.1186/s12879-021-05839-9

Witten, I., Frank, E., and Hall, M. (2011). Data Mining: Practical Machine Learning Tools and Techniques (The Morgan Kaufmann Series in Data Management Systems) (3rd ed.). Morgan Kaufmann.

Wu, B., Wang, X., Shen, H., and Zhou, X. (2012). Feature selection based on max–min-associated indices for classification of remotely sensed imagery. International Journal of Remote Sensing, 33(17), 5492–5512. https://doi.org/10.1080/01431161.2012.663111

Yağmur, A. R., Akbal Çufalı, Ş., Aypak, A., Köksal, M., Güneş, Y. C., and Özcan, K. M. (2021). Correlation of olfactory dysfunction with lung involvement and severity of COVID-19. Irish Journal of Medical Science (1971 -), 191(4), 1843–1848. https://doi.org/10.1007/s11845-021-02732-x

Yu, F., Du, L., Ojcius, D. M., Pan, C., and Jiang, S. (2020). Measures for diagnosing and treating infections by a novel coronavirus responsible for a pneumonia outbreak originating in Wuhan, China. Microbes and Infection, 22(2), 74–79. https://doi.org/10.1016/j.micinf.2020.01.003

Downloads

Published

2024-06-23

How to Cite

Alomadi, S. S., & Ahmed Rasool, J. A. (2024). CORRELATION OF COVID-19 TO LUNG INFECTIONS AND PREDICTION OF LUNG INFECTIONS IN COVID-19 PATIENTS IN IRAQ USING DATA MINING METHODS. Science Journal of University of Zakho, 12(2), 244–249. https://doi.org/10.25271/sjuoz.2024.12.2.1273

Issue

Section

Science Journal of University of Zakho