Optimizing Liver Disease Prediction using SMOTE Integrated Supervised Learning Model

Authors

  • P. Deepthi Department of Computer Science and Engineering, Sree Dattha Institute of Engineering and Science, Sheriguda, Hyderabad, Telangana, India Author
  • B. Gowtami Department of Computer Science and Engineering, Sree Dattha Institute of Engineering and Science, Hyderabad, Telangana, India Author
  • D. Vidyadhar Department of Computer Science and Engineering, Sree Dattha Institute of Engineering and Science, Hyderabad, Telangana, India Author
  • T. Shivanjaneya Department of Computer Science and Engineering, Sree Dattha Institute of Engineering and Science, Hyderabad, Telangana, India Author
  • V. Vinay Department of Computer Science and Engineering, Sree Dattha Institute of Engineering and Science, Hyderabad, Telangana, India Author

Keywords:

Synthetic Minority Over-sampling Technique, Machine learning¸ Data Balancing, Liver Diseases.

Abstract

 Liver disease is a major worldwide health issue that affects millions of people. Prompt and precise diagnosis is crucial for efficient disease control and improved patient results. Machine learning (ML) methods have demonstrated significant potential in forecasting a range of medical disorders, including liver illnesses. Nevertheless, the efficacy of machine learning models is greatly dependent on the calibre and volume of the training data. Regrettably, numerous datasets are afflicted with class imbalance, wherein specific classes, such as diseased and non-diseased individuals, are not evenly distributed. Addressing this imbalance is critical to better the reliability of liver disease prediction using ML models, since it might result in biased predictions and reduced model accuracy. Thus, the objective of this project is to address the issue of class imbalance by utilizing sophisticated data balancing techniques. The suggested approach includes preparing the dataset using the Synthetic Minority Over-sampling Technique (SMOTE), which generates synthetic samples for the minority class, resulting in a more balanced dataset. Furthermore, it modifies the cost function of the learning process to consider the imbalance in class distribution, hence enhancing the performance of the model. After obtaining a dataset that is evenly distributed, we proceed to train a machine learning model (namely, logistic regression, support vector classifier, and gradient boosting classifier) with the purpose of predicting liver disease. The efficacy of the proposed model is assessed on a separate test dataset, employing diverse criteria like accuracy, precision, recall, and F1-score. By efficiently addressing class imbalance via data balancing algorithms, this model is anticipated to provide significant assistance to medical professionals in the early and precise diagnosis of liver illnesses, ultimately resulting in enhanced patient care and outcomes. 

Downloads

Download data is not yet available.

References

Amin, Ruhul, Rubia Yasmin, Sabba Ruhi, Md Habibur Rahman, and Md Shamim Reza.

"Prediction of chronic liver disease patients using integrated projection based statistical feature

extraction with machine learning algorithms." Informatics in Medicine Unlocked 36 (2023):

Md, Abdul Quadir, Sanika Kulkarni, Christy Jackson Joshua, Tejas Vaichole, Senthilkumar

Mohan, and Celestine Iwendi. "Enhanced Preprocessing Approach Using Ensemble Machine

Learning Algorithms for Detecting Liver Disease." Biomedicines 11, no. 2 (2023): 581.

Gupta, Ketan, Nasmin Jiwani, Neda Afreen, and D. Divyarani. "Liver Disease Prediction using

Machine learning Classification Techniques." In 2022 IEEE 11th International Conference on

Communication Systems and Network Technologies (CSNT), pp. 221-226. IEEE, 2022.

Grissa, Dhouha, Ditlev Nytoft Rasmussen, Aleksander Krag, Søren Brunak, and Lars Juhl

Jensen. "Alcoholic liver disease: A registry view on comorbidities and disease

prediction." PLoS Computational Biology 16, no. 9 (2020): e1008244.

Dritsas, Elias, and Maria Trigka. "Supervised machine learning models for liver disease risk

prediction." Computers 12, no. 1 (2023): 19.

Kumar, Divvela Vishnu Sai, Ritik Chaurasia, Anuradha Misra, Praveen Kumar Misra, and Alex

Khang. "Heart disease and liver disease prediction using machine learning." In Data-Centric

AI Solutions and Emerging Technologies in the Healthcare Ecosystem, pp. 205-214. CRC Press,

Behera, Mandakini Priyadarshani, Archana Sarangi, Debahuti Mishra, and Shubhendu Kumar

Sarangi. "A Hybrid Machine Learning algorithm for Heart and Liver Disease Prediction Using

Modified Particle Swarm Optimization with Support Vector Machine." Procedia Computer

Science 218 (2023): 818-827.

Singh, Jagdeep, Sachin Bagga, and Ranjodh Kaur. "Software-based prediction of liver disease

with feature selection and classification techniques." Procedia Computer Science 167 (2020):

-1980.

Azam, Md Shafiul, Aishe Rahman, SM Hasan Sazzad Iqbal, and Md Toukir Ahmed. "Prediction

of liver diseases by using few machine learning based approaches." Aust. J. Eng. Innov.

Technol 2, no. 5 (2020): 85-90.

Ghazal, Taher M., Aziz Ur Rehman, Muhammad Saleem, Munir Ahmad, Shabir Ahmad, and

Faisal Mehmood. "Intelligent Model to Predict Early Liver Disease using Machine Learning Technique." In 2022 International Conference on Business Analytics for Technology and

Security (ICBATS), pp. 1-5. IEEE, 2022.

Hashem, Somaya, Mahmoud ElHefnawi, Shahira Habashy, Mohamed El-Adawy, Gamal

Esmat, Wafaa Elakel, Ashraf Omar Abdelazziz et al. "Machine learning prediction models for

diagnosing hepatocellular carcinoma with HCV-related chronic liver disease." Computer

methods and programs in biomedicine 196 (2020): 105551.

Khan, Rayyan Azam, Yigang Luo, and Fang-Xiang Wu. "Machine learning based liver disease

diagnosis: A systematic review." Neurocomputing 468 (2022): 492-509.

Spann, Ashley, Angeline Yasodhara, Justin Kang, Kymberly Watt, B. O. Wang, Anna

Goldenberg, and Mamatha Bhat. "Applying machine learning in liver disease and

transplantation: a comprehensive review." Hepatology 71, no. 3 (2020): 1093-1105.

Kuzhippallil, Maria Alex, Carolyn Joseph, and A. Kannan. "Comparative analysis of machine

learning techniques for indian liver disease patients." In 2020 6th International Conference on

Advanced Computing and Communication Systems (ICACCS), pp. 778-782. IEEE, 2020.

Ambesange, Sateesh, A. Vijayalaxmi, Rashmi Uppin, Shruthi Patil, and Vilaskumar Patil.

"Optimizing Liver disease prediction with Random Forest by various Data balancing

Techniques." In 2020 IEEE international conference on cloud computing in emerging markets

(CCEM), pp. 98-102. IEEE, 2020.

Downloads

Published

2024-04-30

How to Cite

Deepthi, P., Gowtami, B., Vidyadhar, D., Shivanjaneya, T., & Vinay, V. (2024). Optimizing Liver Disease Prediction using SMOTE Integrated Supervised Learning Model. History of Medicine, 10(2), 20-30. https://historymedjournal.com/HOM/index.php/medicine/article/view/740