Diabetes disease prediction using machine learning

Diabetes disease prediction using machine learning

Jobin M. Scaria, Haritha Bargav, Sharmila Joseph, Sasikumar V R, A. Baseera | June 2025

Loading

Abstract

Diabetes mellitus is a chronic disease affecting millions worldwide, and early predictions play a crucial role in preventive healthcare. This project aims to develop an efficient and user-friendly system for diabetes disease prediction using machine learning techniques. Leveraging the Random Forest Classifier from the scikit -learn library, the model was trained on the 2019 diabetes dataset after proper preprocessing with Pandas and Label Encoder. Model performance was evaluated using standard metrics like accuracy score after data splitting using train_test_split. To visualize model performance, Matplotlib was used to create comparative bar charts. The project integrates machine learning with a web-based interface to enhance accessibility and usability. The frontend is built using HTML5 and Jinja templates, dynamically rendering the welcome screen, questionnaire form, and result page. The backend is developed using Flask, a lightweight Python web framework, which manages routing, form submissions, and session handling. The trained ML model is serialized using Pickle, allowing seamless loading for real-time predictions. Additionally, FPDF is employed to generate downloadable PDF reports for users, summarizing their input, risk score, and relevant recommendations. This system not only demonstrates the practical application of ML in healthcare but also offers a complete pipeline from data input to result delivery. Future work includes expanding the dataset, integrating additional health indicators, and deploying the solution for mobile use or real clinical environments.

wpChatIcon
wpChatIcon
Open chat
Hello,
How can we help you?