Highlights
- Pro
Popular repositories Loading
-
interpretable-fine-tuning
interpretable-fine-tuning PublicTraining components that act on SAE latents so that it is describe what was learned during fine-tuning
Jupyter Notebook 2
-
sae_jailbreak_unlearning
sae_jailbreak_unlearning PublicInvestigating how well intervening on Sparse Autoencoder internals prevents adversaries from accessing dangerous knowledge.
Jupyter Notebook 1
-
-
Choregraphe-Library
Choregraphe-Library PublicChoregraphe was missing tons of useful methods. Seriously? No addition? What kind of """"programming language"""" doesn't have addition? This file is my attempt to put back some of the things that …
-
WeirdlyConcreteXKCD
WeirdlyConcreteXKCD PublicC++ code simulating the Weirdly Concrete problem from https://www.explainxkcd.com/wiki/index.php/2529:_Unsolved_Math_Problems. Python code can be found
C++
If the problem persists, check the GitHub status page or contact support.