mini

mechanistic interpretability for neural interpretability

Environment set-up

With pixi (recommended)

Prerequisites:

An installed version of conda
An installed version of pixi

In the root directory, just run pixi install --manifest-path ./pyproject.toml - this will create a conda env named 'mini'.

Other

All package dependencies are specified in the 'pyproject.toml'. You can format them as required for your favorite python environment / package management tool, and install them using this tool (e.g. via pip, poetry, conda (directly instead of with pixi), etc.)

Example usage pipeline

Given:

Neural data (in the form of binned spike counts as $[examples \times neurons]$)
Behavioral and/or environmental metadata

mini performs the following steps to find interpretable neural signatures that underlie behavioral and/or environmental features (referred to collectively as natural features)

Splits the neural data into train/val/test sets
Trains an SAE (on the train + val splits) to reconstruct the neural data
Validates the quality of the SAE, by looking at
1. Sparsity of SAE features
2. Reconstruction quality of neural data
Ranks the SAE features by interpretability likelihood
For each of the top $k$ SAE features, finds a corresponding natural feature with manual feedback.
Validates the SAE-natural feature pairing on the test split, by:
1. Looking at confusion matrix metrics for co-occurrences of the natural feature with the SAE feature.
2. Showing that the neural signature defined by the SAE feature can decode the natural feature as well as it can be decoded by the entirety of the neural data.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

readme.md

readme.md

mini

Environment set-up

With pixi (recommended)

Other

Example usage pipeline

Files

readme.md

Latest commit

History

readme.md

File metadata and controls

mini

Environment set-up

With pixi (recommended)

Other

Example usage pipeline