GitHub - aakware/heart-disease-prediction: A Logistic Regression Model for Heart Disease Prediction

Heart Disease Prediction

A Logistic Regression Model trained on Kaggle's Heart Disease Dataset

Project Overview

Cardiovascular diseases are a leading cause of death worldwide. This project aims to develop a simple yet effective model to predict heart disease, helping in early detection and prevention. By analyzing key health indicators such as age, cholesterol levels, etc., this model can classify whether a patient is at risk of heart disease.

Exploration.ipynb: A Notebook dedicated to data exploration and preprocessing.
LogisticRegression.ipynb: The Notebook where the Logistic Regression is trained over the clean data.
XGBoost.ipynb: The Notebook containing Extreme Gradient Boost model trained over the clean data
requirements.txt: A file listing the Python dependencies needed to run the notebooks.
heart-disease.csv: The dataset file over which the model is trained.

How to Run

Clone this repository
Install dependencies
Run the notebooks.

Model Performance

For Logistic Regression:

              precision    recall  f1-score   support

           0       0.85      0.83      0.84        41
           1       0.86      0.88      0.87        50

    accuracy                           0.86        91
   macro avg       0.86      0.85      0.86        91
weighted avg       0.86      0.86      0.86        91

For XGB Classifier:

              precision    recall  f1-score   support

           0       0.88      0.85      0.86        41
           1       0.88      0.90      0.89        50

    accuracy                           0.88        91
   macro avg       0.88      0.88      0.88        91
weighted avg       0.88      0.88      0.88        91

Future Work

This project can be extended by exploring more advanced models to improve its accuracy. Regarding Random Forest, I could achieve Max accuracy upto 0.802, which is actually lower than that of Logistic Regression (0.86); perhaps more work need to be done to search for better data and preprocessing.

Contributing

Contributions are welcome! Feel free to make pull requests.

License

The data used in this project is licensed under MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Exploration.ipynb		Exploration.ipynb
LogisticRegression.ipynb		LogisticRegression.ipynb
README.md		README.md
XGBoost.ipynb		XGBoost.ipynb
heart-disease.csv		heart-disease.csv
requirement.txt		requirement.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Heart Disease Prediction

Project Overview

Contents

How to Run

Model Performance

Future Work

Contributing

License

About

Uh oh!

Releases

Packages

Languages

aakware/heart-disease-prediction

Folders and files

Latest commit

History

Repository files navigation

Heart Disease Prediction

Project Overview

Contents

How to Run

Model Performance

Future Work

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages