qetdr/xAutoML-Project2Public

NotificationsYou must be signed in to change notification settings
Fork2
Star4

Explainable Automated Machine Learning Framework for Predicting the Risk of Major Adverse Cardiac Event (MACE)

You must be signed in to change notification settings

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
tpot_models		tpot_models
.gitignore		.gitignore
README.md		README.md
autosklearn_approach.ipynb		autosklearn_approach.ipynb
graph_readme.png		graph_readme.png
interpretability.ipynb		interpretability.ipynb
requirements.txt		requirements.txt
tpot_approach.ipynb		tpot_approach.ipynb

Repository files navigation

Explainable Automated Framework for Predicting the Risk of Major Adverse Cardiac Event (MACE)

Project 2 for the 'Explainable Automated Machine Learning' (LTAT.02.023) course.

Team

Marilin Moor
Dmitri Rozgonjuk
Jure Vito Srovin
Lisanna Lehes
Allan Mitt

Project Objective

This project aims to use AutoML to predict clinical outcomes using imaging and clinical variables. The imaging modality of interest is positron emission tomography (PET). The outcome of interest to predict is major adverse cardiac event (MACE) with heart failure. The more specific goals are (1) finding the best machine learning pipelines (using theTPOT framework) for models based on two datasets with best weighted F1-scores, and (2) applying interpretability techniques (using theSHAP framework) to provide insights into the black-box models in order to explain the major drivers of predictions on a global, local as well on a group level.

Project Workflow

The general project workflow is presented in Figure 1.

Figure 1: Project workflow

We first started with selecting the appropriate data. We aimed to search for the best pipelines in two data sets: one that included less features (the 'yellow' columns; X1), and the other that included additional features ('yellow' + 'blue' columns; X2).
After data extraction, we split the total dataset into training and test data with 80/20 split. The test data were kept separately to compute the goodness of the best pipeline which, in turn, was searched for with the TPOT framework.
After the TPOT framework provided the best pipelines, we used them (i.e., for both X1 and X2 data) in interpretability frameworks. Model interpretability was provided on global (feature importances, SHAP summary plot), local (SHAP force_plot) as well as group (local explanation methods are used and aggregated for a certain group) levels. We extended some previously-produced software (SHAP) to include more customization in providing interpretability views.
Finally, evaluation of interpretability is done based on the paper foundhere.

Files and Directories

tpot_models/: directory that includes the python files for TPOT models (results)
- tpot_X1.py: results for Model 1 (less features)
- tpot_X2.py: results for Model 2 (more features)
README.md: the present file that includes the project meta-information
autosklearn_approach.ipynb: a notebook where we initially tried to implement theauto-sklearn approach; not used in the final solution.
interpretability.ipynb: a notebook that includes the interpretability part; pre-requisite is the existence of TPOT model files
requirements.txt: python packages for installing
tpot_approach.ipynb: a notebook with the TPOT implementation as the automated ML approach

How to Run

Although the present projects includes relatively extensively computed solutions, it is also possible to run the models with self-defined (hyper-)parameters. For that, one needs tofirst run the scripts in thetpot_approach.ipynb notebook whichprepares the data and runs the TPOT framework. Once the computations are done, the.py files for the best pipelines for datasetsX1 andX2 are produced. Then, one needs tocontinue with theinterpretability.ipynb notebook where the models are imported andinteractive interpretability approach can be executed. We kept these approaches in separate notebooks for more clarity, as in both cases, running the scripts in notebooks can be time-consuming.

About

Explainable Automated Machine Learning Framework for Predicting the Risk of Major Adverse Cardiac Event (MACE)

Languages

Jupyter Notebook100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Explainable Automated Framework for Predicting the Risk of Major Adverse Cardiac Event (MACE)

Team

Project Objective

Project Workflow

Files and Directories

How to Run

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Contributors5

Uh oh!

Languages

Movatterモバイル変換

qetdr/xAutoML-Project2

Folders and files

Latest commit

History

Repository files navigation

Explainable Automated Framework for Predicting the Risk of Major Adverse Cardiac Event (MACE)

Team

Project Objective

Project Workflow

Files and Directories

How to Run

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Contributors5

Uh oh!

Languages

Packages