Skip to content

Commit

Permalink
Update author info
Browse files Browse the repository at this point in the history
  • Loading branch information
htwangtw committed Jul 10, 2024
1 parent 6972910 commit a3184d2
Showing 1 changed file with 7 additions and 1 deletion.
8 changes: 7 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,10 @@
# General Class Balancer

Author: mleming
Maintainer: Hao-Ting Wang

## Description

This program finds a subset of your dataset with balanced confounding factors (also known as "data matching"), though it can work with any combination of categorical and continuous variables.

Given a labeled dataset with any number of classes and a number of confounding factors for each datapoint, this matches data in each class to one another, such that the distributions of each confounding factor are the same in each class. This may be used to sample a training set on which a given deep learning model will not take confounding factors into account during its classification.
Expand All @@ -8,8 +13,9 @@ A version of this method was originally introduced in https://arxiv.org/abs/2002

![alt text](description.png "A description of the general class balancer algorithm")

Everything in the presented code uses numpy arrays. The code, as well as a script that simulates data from random variables, is given. Simply run
## Testing

Everything in the presented code uses numpy arrays. The code, as well as a script that simulates data from random variables, is given. Simply run

```
python random_example_test.py
Expand Down

0 comments on commit a3184d2

Please sign in to comment.