Pinned Loading
Repositories
- model-vs-human Public
Benchmark your model on out-of-distribution datasets with carefully collected human comparison data (NeurIPS 2021 Oral)
bethgelab/model-vs-human’s past year of commit activity - CiteME Public
CiteME is a benchmark designed to test the abilities of language models in finding papers that are cited in scientific texts.
bethgelab/CiteME’s past year of commit activity - sort-and-search Public
Code for the paper: "Efficient Lifelong Model Evaluation in an Era of Rapid Progress" [NeurIPS'24]
bethgelab/sort-and-search’s past year of commit activity - frequency_determines_performance Public
Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance" [NeurIPS'24]
bethgelab/frequency_determines_performance’s past year of commit activity - foolbox Public
A Python toolbox to create adversarial examples that fool neural networks in PyTorch, TensorFlow, and JAX
bethgelab/foolbox’s past year of commit activity - DataTypeIdentification Public
Code for the ICLR'24 paper: "Visual Data-Type Understanding does not emerge from Scaling Vision-Language Models"
bethgelab/DataTypeIdentification’s past year of commit activity - robustness Public
Robustness and adaptation of ImageNet scale models. Pre-Release, stay tuned for updates.
bethgelab/robustness’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…