Multi-label Neural Model for Prediction of Myocardial Infarction Complications with Resampling and Explainability

This repository is the official implementation of Multi-label Neural Model for Prediction of Myocardial Infarction Complications with Resampling and Explainability. The dataset used in this project can be accessed here.

The figure below shows the pipeline system that summarizes the whole predictive modeling process:

Data Processing

The 2 notebooks document how the dataset was processed as delineated in the paper:

Data_Preprocessing_1 includes the following section:

Remove features with significant missingness (>25%)
Feature selection

Data_Preprocessing_2 includes the following section:

Data imputation (with MICE, PMM, Mean/Mode, regression, kNN and evaluated with NRMSE)
Splitting the dataset into test/train subsets
Outcome label space reconstruction

Training

The following multi-label models were trained and the results evaluated:

Neural network (NN) (The trained NN model is saved here)
Random k-labelsets (RAKEL): RakelD, RakelO
Multi-label support vector machines (MLTSVM)
Label powerset (LP)
Majority voting (MV)
Binary relevance k-nearest neighbor (BRkNN): BRkNNa, BRkNNb
Binary relevance (BR)
Classifier chains (CC)
Multi-label k-Nearest Neighbours (MlkNN)
Multi-output classifier (MOC)
Multi-label fuzzy adaptive resonance associative map (MLARAM)
Label space partition ensemble classifier (LSPEC)

Results

Our models achieve the following performances:

Shapley Analysis

Take "lethal outcome" as an example, the beeswarm plot below provides an overview of the impact of the features on the prediction, with each dot representing the Shapley value of every feature for all samples. The pink dots of positive Shapley values indicate that the higher values of the said feature pushes the model to a positive prediction, whereas the those of negative Shapley values push the prediction in the opposite direction. The duration of arterial hypertension, time elapsed from the beginning of the attack of CHD to the hospital, and quantity of myocardial infarctions in the anamnesis were observed to be the most important features that lead to a prediction of death while the presence of an anterior myocardial infarction pushes the model to a negative prediction.

The figure below shows the average absolute of the Shapley values over the whole testing dataset for all five prediction outcomes. The duration of arterial hypertension, exertional angina pectoris in the anamnesis, and presence of an anterior/inferior infarction were observed to be the most important features for all five outcomes.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Preprocessing		Preprocessing
Results		Results
Training		Training
Pipeline.PNG		Pipeline.PNG
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-label Neural Model for Prediction of Myocardial Infarction Complications with Resampling and Explainability

Data Processing

Data_Preprocessing_1 includes the following section:

Data_Preprocessing_2 includes the following section:

Training

Results

Shapley Analysis

About

Uh oh!

Releases

Packages

Languages

Munib5/MI-MultiLabel

Folders and files

Latest commit

History

Repository files navigation

Multi-label Neural Model for Prediction of Myocardial Infarction Complications with Resampling and Explainability

Data Processing

Data_Preprocessing_1 includes the following section:

Data_Preprocessing_2 includes the following section:

Training

Results

Shapley Analysis

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages