Applying Advanced Computational Models to Optimise Electronic Health Records and Enhance Clinical Decision-Making
DOI:
https://doi.org/10.48047/HM.V9.I2.2023.Keywords:
Electronic Health Records (EHRs), Machine Learning in Healthcare, Clinical Decision Support Systems (CDSS), Cardiovascular Disease Prediction, Random Forest, XGBoost, LSTM, Predictive Analytics in MedicineAbstract
Electronic Health Records (EHRs) have become a cornerstone in modern healthcare, providing structured patient data that can significantly enhance clinical decision-making. However, the manual interpretation of large-scale EHR data presents challenges in efficiency and accuracy, necessitating the integration of advanced computational models. This study explores machine learning and deep learning techniques to optimise EHR-based cardiovascular disease (CVD) prediction, leveraging data-driven insights to enhance patient risk stratification. The Cardiovascular Disease Dataset from Kaggle, comprising 70,000 patient records, was utilised to train and evaluate three models: Random Forest, XGBoost, and Long Short-Term Memory (LSTM) networks. Among the three models, XGBoost demonstrated the highest predictive accuracy at 73.5%, making it the most effective model for CVD detection. LSTM exhibited the highest recall (0.81), making it well-suited for identifying high-risk patients, but it also generated a higher number of false positives, potentially leading to unnecessary medical interventions. Random Forest, a baseline model, achieved 71.2% accuracy, showing stable but slightly lower performance. These findings highlight the superiority of XGBoost in predictive accuracy, while LSTM remains useful in maximising sensitivity to disease detection. The results emphasise the potential of machine learning in automated cardiovascular risk assessment, allowing for data-driven clinical decision-making that can enhance early intervention strategies. The study's implications extend to EHR optimisation, AI integration in medical workflows, and the deployment of computational models for clinical risk management.
Downloads
References
Maredza, M., Preventing Cardiovascular Disease in Rural South Africa. 2018, University of the Witwatersrand, Johannesburg (South Africa).
Moja, L., et al., Effectiveness of computerised decision support systems linked to electronic health records: a systematic review and meta-analysis. American journal of public health, 2014. 104(12): p. e12-e22.
Shen, Y., et al., Twenty-Five Years of Evolution and Hurdles in Electronic Health Records and Interoperability in Medical Research: Comprehensive Review. Journal of Medical Internet Research, 2025. 27: p. e59024.
Powell, L., et al., Assessment of health information technology–related outpatient diagnostic delays in the US Veterans Affairs health care system: A qualitative study of aggregated root cause analysis data. JAMA Network Open, 2020. 3(6): p. e206752-e206752.
Fatima, S., Improving Healthcare Outcomes through Machine Learning: Applications and Challenges in Big Data Analytics. International Journal of Advanced Research in Engineering Technology & Science, 2024. 11.
Priyadarshini, K. et al., Integrating relational and sequential information for enhanced detection of autoimmune disorders with relational Neural Networks and Long ShortTerm Memory Networks. Biomedical Signal Processing and Control, 2024. 96: p.106495.
Rostamzadeh, N., Visual Analytics for Performing Complex Tasks with Electronic Health Records. 2021: The University of Western Ontario (Canada).
Ananthajothi, K., J. David, and A. Kavin. Cardiovascular disease prediction using patient history and real time monitoring. in 2024 2nd International Conference on Intelligent Data Communication Technologies and Internet of Things (IDCIoT). 2024. IEEE.
Du, Z., et al., Accurate prediction of coronary heart disease for patients with hypertension from electronic health records with big data and machine-learning methods: model development and performance evaluation. JMIR medical informatics, 2020. 8(7): p. e17257.
Ambinder, E.P., Electronic health records. Journal of oncology practice, 2005. 1(2): p.57.
Sitapati, A., et al., Integrated precision medicine: the role of electronic health records in delivering personalized treatment. Wiley Interdisciplinary Reviews: Systems Biology and Medicine, 2017. 9(3): p. e1378.
Chen, F., et al., Unmasking bias in artificial intelligence: a systematic review of bias detection and mitigation strategies in electronic health record-based models. Journal of the American Medical Informatics Association, 2024. 31(5): p. 1172-1183.
Khan, M.S., et al., Leveraging electronic health records to streamline the conduct of cardiovascular clinical trials. European heart journal, 2023. 44(21): p. 1890-1909.
Marcilly, R., et al., Usability flaws of medication-related alerting functions: a systematic qualitative review. Journal of biomedical informatics, 2015. 55: p. 260-271.
Sarwar, T., et al., The secondary use of electronic health records for data mining: data characteristics and challenges. ACM Computing Surveys (CSUR), 2022. 55(2): p. 1-40.
Javaid, M., et al., Significance of machine learning in healthcare: Features, pillars and applications. International Journal of Intelligent Networks, 2022. 3: p. 58-73.
Ismanto, E., et al., A Comparative Study of Improved Ensemble Learning Algorithms for Patient Severity Condition Classification. Journal of Electronics, Electromedical Engineering, and Medical Informatics, 2024. 6(3): p. 312-321.
Morid, M.A., O.R.L. Sheng, and J. Dunbar, Time series prediction using deep learning methods in healthcare. ACM Transactions on Management Information Systems, 2023. 14(1): p. 1-29.
Hanieh, A. and A.A.a. Mohammad, Using Machine Learning Application for Cardiovascular Disease Prediction and Diagnosis. 2023.
Shameer, K., et al., Machine learning in cardiovascular medicine: are we there yet? Heart, 2018. 104(14): p. 1156-1164.
Nandini, S., Comparative Study of Machine Learning Algorithms in Detecting Cardiovascular Diseases. arXiv preprint arXiv:2405.17059, 2024.
Naser, M.A., et al., A review of machine learning’s role in cardiovascular disease prediction: recent advances and future challenges. Algorithms, 2024. 17(2): p. 78.
Ullah, M., et al., Smart technologies used as smart tools in the management of cardiovascular disease and their future perspective. Current Problems in Cardiology, 2023. 48(11): p. 101922.
Ekundayo, F. and H. Nyavor, AI-Driven Predictive Analytics in Cardiovascular Diseases: Integrating Big Data and Machine Learning for Early Diagnosis and Risk Prediction.
Eom, J.-H., S.-C. Kim, and B.-T. Zhang, AptaCDSS-E: A classifier ensemble-based clinical decision support system for cardiovascular disease level prediction. Expert Systems with Applications, 2008. 34(4): p. 2465-2479.
Gao, X.-Y., et al., Improving the accuracy for analyzing heart diseases prediction based on the ensemble method. Complexity, 2021. 2021(1): p. 6663455.
Ren, Y., et al., A new random forest ensemble of intuitionistic fuzzy decision trees. IEEE Transactions on Fuzzy Systems, 2022. 31(5): p. 1729-1741.
Zhang, P., Y. Jia, and Y. Shang, Research and application of XGBoost in imbalanceddata. International Journal of Distributed Sensor Networks, 2022. 18(6): p.15501329221106935.
Shakkeera, L., Predictive analytics: Unveiling the potential of machine learning and deep learning. International Journal of Systematic Innovation, 2025. 9(1): p. 116-128.
Poongodi, T., et al., Deep learning techniques for electronic health record (EHR) analysis. Bio-inspired Neurocomputing, 2021: p. 73-103.
Radha, M.G., Data-driven health monitoring and lifestyle interventions: towards management of hypertension and other lifestyle diseases through data-driven modelling of physiology and behaviour. 2020.
Ghahramani, Z., Probabilistic machine learning and artificial intelligence. Nature, 2015. 521(7553): p. 452-459.
Morgan, M. and J. Shanahan, Two decades of cultivation research: An appraisal and meta-analysis, in Communication yearbook 20. 2012, Routledge. p. 1-45.
Syafrudin, M., et al., Performance analysis of IoT-based sensor, big data processing, and machine learning model for real-time monitoring system in automotive manufacturing. Sensors, 2018. 18(9): p. 2946.
Salmi, M., et al., Handling imbalanced medical datasets: review of a decade of research. Artificial Intelligence Review, 2024. 57(10): p. 273.
Li, D.-C., C.-W. Liu, and S.C. Hu, A learning method for the class imbalance problem with medical data sets. Computers in biology and medicine, 2010. 40(5): p. 509-518.
Fernández, A., et al., SMOTE for learning from imbalanced data: progress andchallenges, marking the 15-year anniversary. Journal of artificial intelligence research, 2018. 61: p. 863-905.
García-Laencina, P.J., et al., K nearest neighbours with mutual information for simultaneous classification and missing data imputation. Neurocomputing, 2009. 72(7-9): p. 1483-1493.
Mesinovic, M., P. Watkinson, and T. Zhu, Explainable AI for clinical risk prediction: a survey of concepts, methods, and modalities. arXiv preprint arXiv:2308.08407, 2023.
Albahri, A.S., et al., A systematic review of trustworthy and explainable artificial intelligence in healthcare: Assessment of quality, bias risk, and data fusion. Information Fusion, 2023. 96: p. 156-191.
Rane, N., S. Choudhary, and J. Rane, Explainable artificial intelligence (XAI) in healthcare: Interpretable models for clinical decision support. Available at SSRN 4637897, 2023.
Husby, U.E., Exploring Breast Cancer Diagnosis: A Study of SHAP and LIME in XAIDriven Medical Imaging. 2024, Norwegian University of Life Sciences.
Esmaeilzadeh, P., Challenges and strategies for wide-scale artificial intelligence (AI) deployment in healthcare practices: A perspective for healthcare organizations. Artificial Intelligence in Medicine, 2024. 151: p. 102861.
Forcier, M.B., et al., Integrating artificial intelligence into health care through data access: can the GDPR act as a beacon for policymakers? Journal of Law and the Biosciences, 2019. 6(1): p. 317-335.
Ching, T., et al., Opportunities and obstacles for deep learning in biology and medicine. Journal of the royal society interface, 2018. 15(141): p. 20170387.
Kalusivalingam, A.K., et al., Enhancing Predictive Business Analytics with Deep Learning and Ensemble Methods: A Comparative Study of LSTM Networks and Random Forest Algorithms. International Journal of AI and ML, 2020. 1(2).
Dang, Y., et al., A comparative study of non-deep learning, deep learning, and ensemble learning methods for sunspot number prediction. Applied Artificial Intelligence, 2022. 36(1): p. 2074129.
Lv, H., et al., machine learning–driven models to predict prognostic outcomes in patients hospitalized with heart failure using electronic health records: retrospective study. Journal of medical Internet research, 2021. 23(4): p. e24996.
Downloads
Published
Issue
Section
License
Copyright (c) 2023 Busayo Jegede

This work is licensed under a Creative Commons Attribution 4.0 International License.
You are free to:
- Share — copy and redistribute the material in any medium or format for any purpose, even commercially.
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
- The licensor cannot revoke these freedoms as long as you follow the license terms.
Under the following terms:
- Attribution — You must give appropriate credit , provide a link to the license, and indicate if changes were made . You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.
Notices:
You do not have to comply with the license for elements of the material in the public domain or where your use is permitted by an applicable exception or limitation .
No warranties are given. The license may not give you all of the permissions necessary for your intended use. For example, other rights such as publicity, privacy, or moral rights may limit how you use the material.