This project developed a machine learning model to predict the likelihood of strokes by analyzing patient health data, including age, cholesterol levels, blood pressure, glucose levels, and smoking status. The most effective model used was a Support Vector Machine (SVM) with a radial basis function kernel, demonstrating high accuracy and precision. The project also utilized Random Forest for comparison, ultimately confirming the superiority of the SVM model.
pandas
, scikit-learn
, SMOTE
Support Vector Machine (SVM):
Random Forest:
SVM Performance:
Random Forest Performance:
A simple UI was built to gather necessary information and predict stroke probability using the model. The UI includes fields and radio buttons for input.
The project successfully developed an SVM-based model for stroke prediction, demonstrating high accuracy and potential for clinical application. Future work will focus on integrating the model into clinical workflows and exploring real-time analytics for preventive healthcare strategies.