cvjena/beyond-debiasingPublic

NotificationsYou must be signed in to change notification settings
Fork0
Star0

Code for our method for actively steering the features learned by a neural network presented in our DAGM GCPR 2023 paper "Beyond Debiasing: Actively Steering Feature Selection via Loss Regularization".

License

View license

0 stars 0 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
algebra.py		algebra.py
contextual_decomposition.py		contextual_decomposition.py
dataset_utils.py		dataset_utils.py
feature_steering_example.ipynb		feature_steering_example.ipynb
make_regression.py		make_regression.py
mixed_cmi_estimator.py		mixed_cmi_estimator.py
regression_dataset.py		regression_dataset.py
regression_network.py		regression_network.py
requirements.txt		requirements.txt
teaser.png		teaser.png
tensor_utils.py		tensor_utils.py

Repository files navigation

Beyond Debiasing: Actively Steering Feature Selection via Loss Regularization

Overview

This repository provides code to use the method presented in our DAGM GCPR 2023 paper"Beyond Debiasing: Actively Steering Feature Selection via Loss Regularization". If you want to get started, take a look at ourexample network and the correspondingjupyter notebook.

If you are only interested in the implementation of the feature steering part of the loss, you can find it infeat_steering_loss(...) ofregression_network.py.

By measuring the feature usage, we can steer the model towards (not) using features that are specifically (un-)desired.

Our method generalizes from debiasing to theencouragement and discouragement of arbitrary features. That is, it not only aims at removing the influence of undesired features / biases but also at increasing the influence of features that are known to be well-established from domain knowledge.

If you use our method, please cite:

@inproceedings{Blunk23:FS,author = {Jan Blunk and Niklas Penzel and Paul Bodesheim and Joachim Denzler},booktitle = {DAGM German Conference on Pattern Recognition (DAGM-GCPR)},title = {Beyond Debiasing: Actively Steering Feature Selection via Loss Regularization},year = {2023},}

Installation

Install with pip, Python and PyTorch 2.0+

git clone https://git.inf-cv.uni-jena.de/blunk/beyond-debiasing.gitcd beyond-debiasingpip install -r requirements.txt

First, create an environment with pip and Python first (Anaconda environment / Python virtual environment). We recommend to installPyTorch with CUDA support. Then, you can install all subsequent packages via pip as described above.

Usage in Python

Since our method relies on loss regularization, it is very simple to add to your own networks - you only need to modify your loss function. To help with that, we provide anexemplary network and ajupyter notebook with example code.

You can find the implementation of the feature steering part of the loss infeat_steering_loss(...) ofregression_network.py, which is where all the magic of our method takes place.

Repository

Installation:
- requirements.txt: List of required packages for installation with pip
Feature attribution:
- contextual_decomposition.py: Wrapper for contextual decomposition
- mixed_cmi_estimator.py: Python port of the CMIh estimator of the conditional
Redundant regression dataset:
- algebra.py: Generation of random orthogonal matrices
- make_regression.py: An adapted version of scikit-learns make_regression(...), where the coefficients are standard-uniform
- regression_dataset.py: Generation of the redundant regression dataset
- dataset_utils.py: Creation of torch dataset from numpy arrays
- tensor_utils.py: Some helpful functions for dealing with tensors
Example:
- feature_steering_example.ipynb: Example for generating the dataset, creating and training the network with detailed comments
- regression_network.py: Neural network (PyTorch) used in the example notebook

Withmixed_cmi_estimator.py this repository includes a Python implementation of the hybrid CMI estimator CMIh presented byZan et al. The authors' original R implementation can be foundhere.

License and Support

This repository is released underCC BY 4.0 license, which allows both academic and commercial use. If you need any support, please open an issue or contactJan Blunk.

About

Releases

No releases published

Packages

No packages published

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Beyond Debiasing: Actively Steering Feature Selection via Loss Regularization

Overview

Installation

Usage in Python

Repository

License and Support

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Languages

Movatterモバイル変換

License

cvjena/beyond-debiasing

Folders and files

Latest commit

History

Repository files navigation

Beyond Debiasing: Actively Steering Feature Selection via Loss Regularization

Overview

Installation

Usage in Python

Repository

License and Support

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Languages

Packages