Skip to content
Change the repository type filter

All

    Repositories list

    • Experiments in transformer knowledge and reasoning
      Jupyter Notebook
      MIT License
      12500Updated Dec 21, 2024Dec 21, 2024
    • A framework for few-shot evaluation of language models.
      Python
      MIT License
      2k7.3k339101Updated Dec 20, 2024Dec 20, 2024
    • mdl

      Public
      Minimum Description Length probing for neural network representations
      Python
      MIT License
      21702Updated Dec 20, 2024Dec 20, 2024
    • gpt-neox

      Public
      An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
      Python
      Apache License 2.0
      1k7k6222Updated Dec 19, 2024Dec 19, 2024
    • Precisely estimating the volume of basins in neural net parameter space corresponding to interpretable behaviors
      Jupyter Notebook
      Apache License 2.0
      0000Updated Dec 19, 2024Dec 19, 2024
    • sae

      Public
      Sparse autoencoders
      Python
      MIT License
      5138552Updated Dec 18, 2024Dec 18, 2024
    • Jupyter Notebook
      Apache License 2.0
      1312661Updated Dec 18, 2024Dec 18, 2024
    • Erasing concepts from neural representations with provable guarantees
      Python
      MIT License
      1521522Updated Dec 18, 2024Dec 18, 2024
    • aria

      Public
      Python
      Apache License 2.0
      114200Updated Dec 17, 2024Dec 17, 2024
    • elk

      Public
      Keeping language models honest by directly eliciting knowledge encoded in their activations.
      Python
      MIT License
      331871510Updated Dec 16, 2024Dec 16, 2024
    • Jupyter Notebook
      MIT License
      0400Updated Dec 14, 2024Dec 14, 2024
    • Acompanying code for our research on SAE feature overlap when trained on different seeds.
      Jupyter Notebook
      Apache License 2.0
      0000Updated Dec 13, 2024Dec 13, 2024
    • website

      Public
      New website for EleutherAI based on Hugo static site generator
      HTML
      6402Updated Dec 12, 2024Dec 12, 2024
    • MIDI tokenizers and pre-processing utils.
      Python
      Apache License 2.0
      1100Updated Dec 12, 2024Dec 12, 2024
    • Jupyter Notebook
      Apache License 2.0
      21300Updated Dec 11, 2024Dec 11, 2024
    • cookbook

      Public
      Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
      Python
      Apache License 2.0
      3874180Updated Dec 10, 2024Dec 10, 2024
    • pythia

      Public
      The hub for EleutherAI's work on interpretability and learning dynamics
      Jupyter Notebook
      Apache License 2.0
      1752.3k253Updated Dec 5, 2024Dec 5, 2024
    • aria-amt

      Public
      Efficient and robust implementation of seq-to-seq automatic piano transcription.
      Python
      Apache License 2.0
      83200Updated Dec 2, 2024Dec 2, 2024
    • Closed-form polynomial approximations to neural networks
      Python
      MIT License
      0200Updated Nov 22, 2024Nov 22, 2024
    • The simplest, fastest repository for training/finetuning medium-sized GPTs.
      Python
      MIT License
      6.1k8700Updated Nov 19, 2024Nov 19, 2024
    • Jupyter Notebook
      54615Updated Nov 17, 2024Nov 17, 2024
    • monkfish

      Public
      Python
      MIT License
      1400Updated Nov 1, 2024Nov 1, 2024
    • Understanding how features learned by neural networks evolve throughout training
      Python
      MIT License
      13100Updated Oct 24, 2024Oct 24, 2024
    • The code used in "Balancing Label Quantity and Quality for Scalable Elicitation"
      Jupyter Notebook
      MIT License
      3300Updated Oct 22, 2024Oct 22, 2024
    • Efficiently computing & storing token n-grams from large corpora
      Rust
      MIT License
      31600Updated Oct 6, 2024Oct 6, 2024
    • Adds GaLore style projection wrappers to optax optimizers
      Python
      MIT License
      0400Updated Oct 3, 2024Oct 3, 2024
    • Equinox implementation of llama3 and llama3.1
      Python
      MIT License
      0710Updated Oct 3, 2024Oct 3, 2024
    • cupbearer

      Public
      A library for mechanistic anomaly detection
      Python
      MIT License
      10600Updated Oct 3, 2024Oct 3, 2024
    • w2s

      Public
      Python
      MIT License
      31820Updated Sep 24, 2024Sep 24, 2024
    • ccs

      Public
      Python
      MIT License
      6533Updated Sep 24, 2024Sep 24, 2024