CORRELATION OF COVID-19 TO LUNG INFECTIONS AND PREDICTION OF LUNG INFECTIONS IN COVID-19 PATIENTS IN IRAQ USING DATA MINING METHODS
DOI:
https://doi.org/10.25271/sjuoz.2024.12.2.1273Keywords:
Covid-19, lung infections, Iraq, correlation, data mining, machine learning, Bagging, Boosting, Naïve Bayes, K-Nearest Neighbour, J48 decision tree, Random Resonance Theory, Binary Logistic RegressionAbstract
The Covid-19 pandemic emerged as an unforeseen global crisis, exerting a profound influence on various aspects of human life. Hence, the need for collaborative efforts and scholarly investigations to address and alleviate the challenges arising from this crisis is crucial. One notable concern pertains to lung infections, which are recognized as a highly perilous consequence of the aforementioned virus. Thus, this study aims to investigate the potential correlation between Covid-19 and lung infections, and test the efficacy of various algorithms in predicting lung infections amongst Covid-19 patients. For this purpose, data has been procured from multiple health institutions in Iraq. Using this data, a robust correlation between Covid-19 and lung infection cases was found and the bagging, boosting, naïve Bayes, K-Nearest Neighbour, J48, random forest, PART, and logistic regression algorithms showcased a high accuracy in prediction lung infection in Covid-19 patients, with naïve Bayes achieving the highest accuracy of 93.41 percent.
References
Abdulhafedh, A. (2017). Incorporating the Multinomial Logistic Regression in Vehicle Crash Severity Modeling: A Detailed Overview. Journal of Transportation Technologies, 07(03), 279–303. https://doi.org/10.4236/jtts.2017.73019
Abdulrahman, M.S., and Rasool, J.A. (2020). Using Data Mining Algorithms to Predict Recommendations on Products. International Journal of Advanced Science and Technology, 29(3), 4370 - 4381. Retrieved from http://sersc.org/journals/index.php/IJAST/article/view/5263
Altman, N. S. (1992). An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression. The American Statistician, 46(3), 175–185. https://doi.org/10.1080/00031305.1992.10475879
Annoni, A. D., Conte, E., Mancini, M. E., Gigante, C., Agalbato, C., Formenti, A., Muscogiuri, G., Mushtaq, S., Guglielmo, M., Baggiano, A., Bonomi, A., Pepi, M., Pontone, G., and Andreini, D. (2021). Quantitative Evaluation of COVID-19 Pneumonia Lung Extension by Specific Software and Correlation with Patient Clinical Outcome. Diagnostics, 11(2), 265. https://doi.org/10.3390/diagnostics11020265
Bhargava, N., Sharma, G., Bhargava, R., and Mathuria, M. (2013). Decision Tree Analysis on J48 Algorithm for Data Mining. Computer Science and Software Engineering. https://www.academia.edu/4375403/Decision_Tree_Analysis_on_J48_Algorithm_for_Data_Mining
Biau, G., and Scornet, E. (2016). A random forest guided tour. TEST, 25(2), 197–227. https://doi.org/10.1007/s11749-016-0481-7
Breiman, L. (2001). Random Forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/a:1010933404324
Cao, Y., and Wu, J. (2002). Projective ART for clustering data sets in high dimensional spaces. Neural Networks, 15(1), 105–120. https://doi.org/10.1016/s0893-6080(01)00108-3
Chen, L.-D., Zhang, Z.-Y., Wei, X.-J., Cai, Y.-Q., Yao, W.-Z., Wang, M.-H., Huang, Q.-F., and Zhang, X.-B. (2020). Association between cytokine profiles and lung injury in COVID-19 pneumonia. Respiratory Research, 21(1). https://doi.org/10.1186/s12931-020-01465-2
Chen, R.-C., and Chuang, C.-H. (2008). Automating construction of a domain ontology using a projective adaptive resonance theory neural network and Bayesian network. Expert Systems, 25(4), 414–430. https://doi.org/10.1111/j.1468-0394.2008.00476.x
Dawoud, M. M., Dawoud, T. M., Ali, N. Y. A., and Nagy, H. A. (2020). Chest CT in COVID-19 pneumonia: a correlation of lung abnormalities with duration and severity of symptoms. Egyptian Journal of Radiology and Nuclear Medicine, 51(1). https://doi.org/10.1186/s43055-020-00359-z
Duzgun, S. A., Durhan, G., Demirkazik, F. B., Akpinar, M. G., and Ariyurek, O. M. (2020). COVID-19 pneumonia: the great radiological mimicker. Insights into Imaging, 11(1). https://doi.org/10.1186/s13244-020-00933-z
Francone, M., Iafrate, F., Masci, G. M., Coco, S., Cilia, F., Manganaro, L., Panebianco, V., Andreoli, C., Colaiacomo, M. C., Zingaropoli, M. A., Ciardi, M. R., Mastroianni, C. M., Pugliese, F., Alessandri, F., Turriziani, O., Ricci, P., and Catalano, C. (2020). Chest CT score in COVID-19 patients: correlation with disease severity and short-term prognosis. European Radiology,30(12),6808–6817. https://doi.org/10.1007/s00330-020-07033-y
GÜNER, R., HASANOĞLU, İ., and AKTAŞ, F. (2020). COVID-19: Prevention and control measures in community. TURKISH JOURNAL OF MEDICAL SCIENCES, 50(SI-1), 571–577. https://doi.org/10.3906/sag-2004-146
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., and Witten, I. H. (2009). The WEKA data mining software. ACM SIGKDD Explorations Newsletter, 11(1), 10–18. https://doi.org/10.1145/1656274.1656278
Harapan, H., Itoh, N., Yufika, A., Winardi, W., Keam, S., Te, H., Megawati, D., Hayati, Z., Wagner, A. L., and Mudatsir, M. (2020). Coronavirus disease 2019 (COVID-19): A literature review. Journal of Infection and Public Health, 13(5), 667–673. https://doi.org/10.1016/j.jiph.2020.03.019
Harding, J. H., Shahbaz, M., Srinivas, and Kusiak, A. (2006). Data Mining in Manufacturing: A Review. Journal of Manufacturing Science and Engineering-Transactions of The Asme, 128(4), 969–976. https://doi.org/10.1115/1.2194554
Hinton, P., McMurray, I., and Brownlow, C. (2014). SPSS Explained. Taylor and Francis.
Hussain, S., Muhammad, L. J., Ishaq, F. S., Yakubu, A., and Mohammed, I. A. (2019). Performance Evaluation of Various Data Mining Algorithms on Road Traffic Accident Dataset. Information and Communication Technology for Intelligent Systems, 67–78. https://doi.org/10.1007/978-981-13-1742-2_7
MacIntyre, C. R., and Wang, Q. (2020). Physical distancing, face masks, and eye protection for prevention of COVID-19. The Lancet, 395(10242), 1950–1951. https://doi.org/10.1016/s0140-6736(20)31183-1
Mahammedi, A., Ramos, A., Bargalló, N., Gaskill, M., Kapur, S., Saba, L., Carrete, H., Sengupta, S., Salvador, E., Hilario, A., Revilla, Y., Sanchez, M., Perez-Nuñez, M., Bachir, S., Zhang, B., Oleaga, L., Sergio, J., Koren, L., Martin-Medina, P., … Vagal, A. (2021). Brain and Lung Imaging Correlation in Patients with COVID-19: Could the Severity of Lung Disease Reflect the Prevalence of Acute Abnormalities on Neuroimaging? A Global Multicenter Observational Study. American Journal of Neuroradiology, 42(6), 1008–1016. https://doi.org/10.3174/ajnr.a7072
Maimon, O., and Rokach, L. (2005). Data mining and knowledge discovery handbook. Choice Reviews Online, 48(10), 48–5729. https://doi.org/10.5860/choice.48-5729
McHugh, M. M. (2013). The Chi-square test of independence. Biochemia Medica, 143–149. https://doi.org/10.11613/bm.2013.018
Olivieri, D., and Scoditti, E. (2005). Impact of environmental factors on lung defences. European Respiratory Review, 14(95), 51–56. https://doi.org/10.1183/09059180.05.00009502
Rasool, J. A. A. (2018). Analysis the Relationship between Social Media and Education System in Kurdistan region of Iraq Using Chi-Square Test. Academic Journal of Nawroz University, 7(4), 133–138. https://doi.org/10.25007/ajnu.v7n4a282
Santoso, P., Fauziah, F., and Nurhayati, N. (2020). Application Of Data Mining Classification For COVID-19 Infected Status Using Algortima Naïve Method. Jurnal Mantik, 4(1), 267–275. http://iocscience.org/ejournal/index.php/mantik/article/view/740
Skopljanac, I., Ivelja, M. P., Barcot, O., Brdar, I., Dolic, K., Polasek, O., and Radic, M. (2021). Role of Lung Ultrasound in Predicting Clinical Severity and Fatality in COVID-19 Pneumonia. Journal of Personalized Medicine, 11(8), 757. https://doi.org/10.3390/jpm11080757
Talib, H. J. (2021). Predicting the Correct Procedure of COVID-19 Patients in Hospitals Using Machine Learning. M.Sc. Thesis, Applied science private university, Amman-Jordan.
Tung-Chen, Y., de Gracia, M., Díez-Tascón, A., Alonso-González, R., Agudo-Fernández, S., Parra-Gordo, M. L., Ossaba-Vélez, S., Rodríguez-Fuertes, P., and Llamas-Fuentes, R. (2020). Correlation between Chest Computed Tomography and Lung Ultrasonography in Patients with Coronavirus Disease 2019 (COVID-19). Ultrasound in Medicine and Biology, 46(11), 2918–2926. https://doi.org/10.1016/j.ultrasmedbio.2020.07.003
Vieira, J. M., Ricardo, O. M. de P., Hannas, C. M., Kanadani, T. C. M., Prata, T. dos S., and Kanadani, F. N. (2020). What do we know about COVID-19? A review article. Revista Da Associação Médica Brasileira, 66(4), 534–540. https://doi.org/10.1590/1806-9282.66.4.534
Wang, X., Che, Q., Ji, X., Meng, X., Zhang, L., Jia, R., Lyu, H., Bai, W., Tan, L., and Gao, Y. (2021). Correlation between lung infection severity and clinical laboratory indicators in patients with COVID-19: a cross-sectional study based on machine learning. BMC Infectious Diseases, 21(1). https://doi.org/10.1186/s12879-021-05839-9
Witten, I., Frank, E., and Hall, M. (2011). Data Mining: Practical Machine Learning Tools and Techniques (The Morgan Kaufmann Series in Data Management Systems) (3rd ed.). Morgan Kaufmann.
Wu, B., Wang, X., Shen, H., and Zhou, X. (2012). Feature selection based on max–min-associated indices for classification of remotely sensed imagery. International Journal of Remote Sensing, 33(17), 5492–5512. https://doi.org/10.1080/01431161.2012.663111
Yağmur, A. R., Akbal Çufalı, Ş., Aypak, A., Köksal, M., Güneş, Y. C., and Özcan, K. M. (2021). Correlation of olfactory dysfunction with lung involvement and severity of COVID-19. Irish Journal of Medical Science (1971 -), 191(4), 1843–1848. https://doi.org/10.1007/s11845-021-02732-x
Yu, F., Du, L., Ojcius, D. M., Pan, C., and Jiang, S. (2020). Measures for diagnosing and treating infections by a novel coronavirus responsible for a pneumonia outbreak originating in Wuhan, China. Microbes and Infection, 22(2), 74–79. https://doi.org/10.1016/j.micinf.2020.01.003
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 Shivan S. Alomadi, Jihan A. Ahmed Rasool
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License [CC BY-NC-SA 4.0] that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work, with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online.