Optimizing Liver Disease Prediction using SMOTE Integrated Supervised Learning Model
Keywords:
Synthetic Minority Over-sampling Technique, Machine learning¸ Data Balancing, Liver Diseases.Abstract
Liver disease is a major worldwide health issue that affects millions of people. Prompt and precise diagnosis is crucial for efficient disease control and improved patient results. Machine learning (ML) methods have demonstrated significant potential in forecasting a range of medical disorders, including liver illnesses. Nevertheless, the efficacy of machine learning models is greatly dependent on the calibre and volume of the training data. Regrettably, numerous datasets are afflicted with class imbalance, wherein specific classes, such as diseased and non-diseased individuals, are not evenly distributed. Addressing this imbalance is critical to better the reliability of liver disease prediction using ML models, since it might result in biased predictions and reduced model accuracy. Thus, the objective of this project is to address the issue of class imbalance by utilizing sophisticated data balancing techniques. The suggested approach includes preparing the dataset using the Synthetic Minority Over-sampling Technique (SMOTE), which generates synthetic samples for the minority class, resulting in a more balanced dataset. Furthermore, it modifies the cost function of the learning process to consider the imbalance in class distribution, hence enhancing the performance of the model. After obtaining a dataset that is evenly distributed, we proceed to train a machine learning model (namely, logistic regression, support vector classifier, and gradient boosting classifier) with the purpose of predicting liver disease. The efficacy of the proposed model is assessed on a separate test dataset, employing diverse criteria like accuracy, precision, recall, and F1-score. By efficiently addressing class imbalance via data balancing algorithms, this model is anticipated to provide significant assistance to medical professionals in the early and precise diagnosis of liver illnesses, ultimately resulting in enhanced patient care and outcomes.
Downloads
References
Amin, Ruhul, Rubia Yasmin, Sabba Ruhi, Md Habibur Rahman, and Md Shamim Reza.
"Prediction of chronic liver disease patients using integrated projection based statistical feature
extraction with machine learning algorithms." Informatics in Medicine Unlocked 36 (2023):
Md, Abdul Quadir, Sanika Kulkarni, Christy Jackson Joshua, Tejas Vaichole, Senthilkumar
Mohan, and Celestine Iwendi. "Enhanced Preprocessing Approach Using Ensemble Machine
Learning Algorithms for Detecting Liver Disease." Biomedicines 11, no. 2 (2023): 581.
Gupta, Ketan, Nasmin Jiwani, Neda Afreen, and D. Divyarani. "Liver Disease Prediction using
Machine learning Classification Techniques." In 2022 IEEE 11th International Conference on
Communication Systems and Network Technologies (CSNT), pp. 221-226. IEEE, 2022.
Grissa, Dhouha, Ditlev Nytoft Rasmussen, Aleksander Krag, Søren Brunak, and Lars Juhl
Jensen. "Alcoholic liver disease: A registry view on comorbidities and disease
prediction." PLoS Computational Biology 16, no. 9 (2020): e1008244.
Dritsas, Elias, and Maria Trigka. "Supervised machine learning models for liver disease risk
prediction." Computers 12, no. 1 (2023): 19.
Kumar, Divvela Vishnu Sai, Ritik Chaurasia, Anuradha Misra, Praveen Kumar Misra, and Alex
Khang. "Heart disease and liver disease prediction using machine learning." In Data-Centric
AI Solutions and Emerging Technologies in the Healthcare Ecosystem, pp. 205-214. CRC Press,
Behera, Mandakini Priyadarshani, Archana Sarangi, Debahuti Mishra, and Shubhendu Kumar
Sarangi. "A Hybrid Machine Learning algorithm for Heart and Liver Disease Prediction Using
Modified Particle Swarm Optimization with Support Vector Machine." Procedia Computer
Science 218 (2023): 818-827.
Singh, Jagdeep, Sachin Bagga, and Ranjodh Kaur. "Software-based prediction of liver disease
with feature selection and classification techniques." Procedia Computer Science 167 (2020):
-1980.
Azam, Md Shafiul, Aishe Rahman, SM Hasan Sazzad Iqbal, and Md Toukir Ahmed. "Prediction
of liver diseases by using few machine learning based approaches." Aust. J. Eng. Innov.
Technol 2, no. 5 (2020): 85-90.
Ghazal, Taher M., Aziz Ur Rehman, Muhammad Saleem, Munir Ahmad, Shabir Ahmad, and
Faisal Mehmood. "Intelligent Model to Predict Early Liver Disease using Machine Learning Technique." In 2022 International Conference on Business Analytics for Technology and
Security (ICBATS), pp. 1-5. IEEE, 2022.
Hashem, Somaya, Mahmoud ElHefnawi, Shahira Habashy, Mohamed El-Adawy, Gamal
Esmat, Wafaa Elakel, Ashraf Omar Abdelazziz et al. "Machine learning prediction models for
diagnosing hepatocellular carcinoma with HCV-related chronic liver disease." Computer
methods and programs in biomedicine 196 (2020): 105551.
Khan, Rayyan Azam, Yigang Luo, and Fang-Xiang Wu. "Machine learning based liver disease
diagnosis: A systematic review." Neurocomputing 468 (2022): 492-509.
Spann, Ashley, Angeline Yasodhara, Justin Kang, Kymberly Watt, B. O. Wang, Anna
Goldenberg, and Mamatha Bhat. "Applying machine learning in liver disease and
transplantation: a comprehensive review." Hepatology 71, no. 3 (2020): 1093-1105.
Kuzhippallil, Maria Alex, Carolyn Joseph, and A. Kannan. "Comparative analysis of machine
learning techniques for indian liver disease patients." In 2020 6th International Conference on
Advanced Computing and Communication Systems (ICACCS), pp. 778-782. IEEE, 2020.
Ambesange, Sateesh, A. Vijayalaxmi, Rashmi Uppin, Shruthi Patil, and Vilaskumar Patil.
"Optimizing Liver disease prediction with Random Forest by various Data balancing
Techniques." In 2020 IEEE international conference on cloud computing in emerging markets
(CCEM), pp. 98-102. IEEE, 2020.
Downloads
Published
Issue
Section
License
You are free to:
- Share — copy and redistribute the material in any medium or format for any purpose, even commercially.
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
- The licensor cannot revoke these freedoms as long as you follow the license terms.
Under the following terms:
- Attribution — You must give appropriate credit , provide a link to the license, and indicate if changes were made . You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.
Notices:
You do not have to comply with the license for elements of the material in the public domain or where your use is permitted by an applicable exception or limitation .
No warranties are given. The license may not give you all of the permissions necessary for your intended use. For example, other rights such as publicity, privacy, or moral rights may limit how you use the material.