- 2024-02: π€ We released the datasets introduced by GIMMICK on HuggingFace
- 2024-02: π We released the first preprint of the GIMMICK Benchmark paper.
Tested only on Debian-based machines
Note that most models are loaded in FP16 or BF16 and Flash Attention 2. Make sure your GPU supports these features_
- Install mamba
mamba env create -f environment.yml
mamba activate gimmick && pip install flash-attn --no-build-isolation
src/scripts/check_env.sh
If the last step was successful, it should print something like this:
##################### PYTHON ENV 'gimmick' INFO START #####################
Python version: 3.12.8 | packaged by conda-forge | (main, Dec 5 2024, 14:24:40) [GCC 13.3.0]
PyTorch version: 2.5.1
CUDA available: True
CUDA version: 11.8
CUDA devices: 1
Flash Attention 2 Support: True
Transformers version: 4.48.1
##################### PYTHON ENV 'gimmick' INFO END #####################
Run PYTHONPATH=src python src/gimmick/main.py models
to print the list of supported models.
- Qwen2 VL 7B π€/qwen/qwen2-vl-7b-instruct
- Qwen2 VL 72B π€/qwen/qwen2-vl-72b-instruct
- Phi 3.5 Vision π€/microsoft/phi-3.5-vision-instruct
- MiniCPM V 2.6 π€/openbmb/minicpm-v-2_6
- Llama 3.2 11B Vision π€/meta-llama/llama-3.2-11b-vision-instruct
- InternVL2.5 1B π€/opengvlab/internvl2_5-1b
- InternVL2.5 2B π€/opengvlab/internvl2_5-2b
- InternVL2.5 4B π€/opengvlab/internvl2_5-4b
- InternVL2.5 8B π€/opengvlab/internvl2_5-8b
- InternVL2.5 26B π€/opengvlab/internvl2_5-26b
- InternVL2.5 38B π€/opengvlab/internvl2_5-38b
- InternVL2.5 78B π€/opengvlab/internvl2_5-78b
- Centurio Aya π€/wuenlp/centurio_aya
- Centurio Qwen π€/wuenlp/centurio_qwen
- Qwen2.5 0.5B π€/qwen/qwen2.5-0.5b-instruct
- Qwen2.5 1.5B π€/qwen/qwen2.5-1.5b-instruct
- Qwen2.5 3B π€/qwen/qwen2.5-3b-instruct
- Qwen2.5 7B π€/qwen/qwen2.5-7b-instruct
- Qwen2.5 32B π€/qwen/qwen2.5-32b-instruct
- Qwen2.5 72B π€/qwen/qwen2.5-72b-instruct
- InternLM2.5 1.8B π€/internlm/internlm2_5-1_8b-chat
- InternLM2.5 7B π€/internlm/internlm2_5-7b-chat
- InternLM2.5 20B π€/internlm/internlm2_5-20b-chat
- Aya Expanse 8B π€/cohereforai/aya-expanse-8b
- Phi 3.5 Mini π€/microsoft/phi-3.5-mini-instruct
- GPT-4o Mini gpt-4o-mini-2024-07-18
- GPT-4o gpt-4o-2024-11-20
- Gemini Flash gemini-1.5-flash-002
- Gemini Pro gemini-1.5-pro-002
- Claude 3.5 Sonnet claude-3-5-sonnet-20241022