v0.10.0
v0.10.0 (2024-03-26)
Ci
- ci: renamed test job and workflow (#282)
ci: Added tests (6675bb8
)
Documentation
-
docs: add dataset schemas (#255)
-
docs: update AbsTaskClassification.py document schema for classification task
-
update AbsTaskBitextMining.py
-
update BornholmskBitextMining.py
-
update AbsTaskClustering.py and BlurbsClusteringP2P.py
-
update 8 files
-
update 9 files
-
update AbsTaskReranking.py
-
update BlurbsClusteringP2P.py
-
update CMTEBPairClassification.py
-
update GerDaLIRRetrieval.py
-
update 7 files
-
update AbsTaskBitextMining.py
-
update AbsTaskClassification.py (
c3ce1ac
) -
docs: Add development installation instructions (#246)
-
docs: Add development installation instructions
-
removed unused requirements file
I don't believe this is nec. with the setup.py specifying the same dependencies
-
docs: Updated make file with new dependencies
-
ci: Update ci to use make commands
This ensure that the user runs exactly what the CI expects
-
ci: Avoid specifying tests folder as it causes issuew ith tests
-
ci: removed unec. args for test ci
-
Added dev install (
0048878
)
Feature
- feat: update revision id of wikicitiesclustering task (
fb90c02
)
Fix
-
fix: dead link in readme (
ecbb776
) -
fix: Added sizes to the metadata (#276)
-
restructing the readme
-
added mmteb
-
removed unec. method
-
Added docstring to metadata
-
Updated outdated examples
-
formatting documents
-
fix: Updated form to be parsed correctly
-
fix: Added sizes to the metadata
this allow for automatic metadata generations
-
Updated based on feedback
-
Apply suggestions from code review
Co-authored-by: Niklas Muennighoff <[email protected]>
-
updated based on feedback
-
Added suggestion from review
-
added correction based on review
-
reformatted empty fields to None
Co-authored-by: Niklas Muennighoff <[email protected]> (cd4a012
)
-
fix: remove debugging print statement (
d292d93
) -
fix: pass parallel_retrieval kwarg to use DenseRetrievalParallelExactSearch (
19b8f66
) -
fix: msmarco-v2 uses dev.tsv, not dev1.tsv (
6908d21
)
Refactor
-
refactor: add metadata basemodel (#260)
-
refactor: rename description to metadata dict
-
refactor: add TaskMetadata and first example
-
update 9 files
-
update TaskMetadata.py
-
update TaskMetadata.py
-
update TaskMetadata.py
-
update LICENSE, TaskMetadata.py and requirements.dev.txt
-
update 151 files
-
update 150 files
-
update 43 files and delete 1 file
-
update 106 files
-
update 45 files
-
update 6 files
-
update 14 files
-
Added model results to repo and updated CLI to create consistent folder structure. (#254)
-
Added model results to repo and updated CLI to create consistent folder structure.
-
ci: updated ci to use make install
-
Added missing pytest dependencies
-
Update README.md
Co-authored-by: Niklas Muennighoff <[email protected]>
Co-authored-by: Niklas Muennighoff <[email protected]>
-
Restructing the readme (#262)
-
restructing the readme
-
removed double specification of versions and moved all setup to pyproject.toml
-
correctly use flat-layout for the package
-
build(deps): update TaskMetadata.py and pyproject.toml
-
update 221 files
-
build(deps): update pyproject.toml
-
build(deps): update pyproject.toml
-
build(deps): update pyproject.toml
Co-authored-by: Kenneth Enevoldsen <[email protected]>
Co-authored-by: Niklas Muennighoff <[email protected]> (dd5d617
)
Unknown
-
Ci-fix (#289)
-
added release pipeline
-
v1.3.0
-
ci: moved release to the correct folder (
7f56c1a
) -
v1.3.0
-
added release pipeline
-
v1.3.0 (
5e4d10e
) -
tests: speed up tests (#283)
update Makefile and test_all_abstasks.py (2155bf6
)
-
Merge branch 'main' of https://github.com/embeddings-benchmark/mteb (
c9d1a03
) -
Enable ruff ci (#279)
-
restructing the readme
-
added mmteb
-
removed unec. method
-
Added docstring to metadata
-
Updated outdated examples
-
formatting documents
-
fix: Updated form to be parsed correctly
-
fix: Added sizes to the metadata
this allow for automatic metadata generations
-
Updated based on feedback
-
Apply suggestions from code review
Co-authored-by: Niklas Muennighoff <[email protected]>
-
updated based on feedback
-
Added suggestion from review
-
added correction based on review
-
reformatted empty fields to None
-
CI: Enable linter
Co-authored-by: Niklas Muennighoff <[email protected]> (a16eb07
)
-
Added MMTEB (#275)
-
restructing the readme
-
added mmteb
-
removed unec. method
-
Added docstring to metadata
-
Updated outdated examples
-
formatting documents
-
fix: Updated form to be parsed correctly
-
Updated based on feedback
-
Apply suggestions from code review
Co-authored-by: Niklas Muennighoff <[email protected]>
-
updated based on feedback
-
Added suggestion from review
-
added correction based on review
Co-authored-by: Niklas Muennighoff <[email protected]> (c0dc49a
)
-
dev: add isort (#271)
-
dev: add isort
-
dev: add isort (
845099d
) -
dev: run tests on pull request towards any branch (
13f759a
) -
Merge branch 'main' of https://github.com/embeddings-benchmark/mteb (
b42abe4
) -
replaced linter with ruff (#265)
-
restructing the readme
-
removed double specification of versions and moved all setup to pyproject.toml
-
correctly use flat-layout for the package
-
replaced linter with ruff
-
rerun tests
-
ci: Added in newer workflow
some of them are disables as they require other issues to be solved
- Update Makefile
Co-authored-by: Niklas Muennighoff <[email protected]>
Co-authored-by: Niklas Muennighoff <[email protected]> (023e881
)
-
Restructing the readme (#262)
-
restructing the readme
-
removed double specification of versions and moved all setup to pyproject.toml
-
correctly use flat-layout for the package (
769157b
) -
restructing the readme (
364be7f
) -
Added model results to repo and updated CLI to create consistent folder structure. (#254)
-
Added model results to repo and updated CLI to create consistent folder structure.
-
ci: updated ci to use make install
-
Added missing pytest dependencies
-
Update README.md
Co-authored-by: Niklas Muennighoff <[email protected]>
Co-authored-by: Niklas Muennighoff <[email protected]> (8a758bc
)
-
dev: add workspace defaults in VSCode (#253)
-
dev: add black as default formatter in vscode
-
Update .vscode/settings.json
Co-authored-by: Kenneth Enevoldsen <[email protected]> (30e5b9e
)
-
Add Danish Discourse dataset (#247)
-
misc.
-
update ddisco.py
-
chore: delete ddisco.py, ddisco.test.tsv and ddisco.train.tsv
-
Update mteb/tasks/Classification/DdiscoCohesionClassification.py
Co-authored-by: Kenneth Enevoldsen <[email protected]>
- Update mteb/tasks/Classification/DdiscoCohesionClassification.py
Co-authored-by: Kenneth Enevoldsen <[email protected]>
- Update mteb/tasks/Classification/DdiscoCohesionClassification.py
Co-authored-by: Imene Kerboua <[email protected]>
- Update mteb/tasks/Classification/DdiscoCohesionClassification.py
Co-authored-by: Imene Kerboua <[email protected]>
- Update mteb/tasks/Classification/DdiscoCohesionClassification.py
Co-authored-by: Imene Kerboua <[email protected]>
Co-authored-by: Kenneth Enevoldsen <[email protected]>
Co-authored-by: Imene Kerboua <[email protected]> (d46d0f5
)
-
Update structure of mteb/tasks to mteb/tasks/{type}/{language} (#245)
-
Fix structure of mteb/tasks
Fixes #243 -
fix: Added missing init files (
b1c78c1
) -
tests: do not run tests on collection (#249)
test: update test_all_abstasks.py (236614a
)
-
Update README.md with correct DRESModel location (
399edf4
) -
Fix typo (
9610378
) -
Set dev version (
716f59c
) -
Release: 1.2.0 (
9e9dca8
) -
Rmv superfluous file (
d772fed
) -
Remove duplicate & outdated code (
12bcb83
) -
Adapt scripts (
36b9234
) -
Add example (
273ff4a
) -
Simplify retrieval (#233)
-
Simplify retrieval
-
Simplify
-
Make call method
-
Add splits
-
Rmv outdated test
-
Fix name & \n
-
Add qrels
-
Add revisions
Co-authored-by: Imene Kerboua <[email protected]>
-
Add hf hub org
-
Add test
-
Add missing revision
-
Rename test
Co-authored-by: Imene Kerboua <[email protected]>
- log dres compat
Co-authored-by: Imene Kerboua <[email protected]> (c9fccbc
)
-
Fixed missing revision error on Norwegian Bitext Mining (#221)
-
Removed revision specification from Norwegian Bitext Mining task
-
Update to latest revision
Co-authored-by: Niklas Muennighoff <[email protected]> (b249c67
)
-
Remove HAGRID from french benchmark (#235)
-
add Masakhane dataset config
-
add trigram lang code for dataset who use it
-
create french script eval
-
fix French word
-
add some documentation
-
add script to process and upload alloprof on HF
-
build script for HF
-
adding dataset processing for mteb
-
add script to process and upload alloprof on HF
-
build script for HF
-
adding dataset processing for mteb
-
refactor few thing
-
remove whitespaces
-
4 pair classification (#10)
-
add Opusparcus dataset
-
multilingual usage
-
use eval_split of config files
-
change eval_split according to data
Co-authored-by: Gabriel Sequeira <[email protected]>
-
add script to process and upload alloprof on HF
-
build script for HF
-
adding dataset processing for mteb
-
refactor few thing
-
remove whitespaces
-
Clustering with HAL S2S dataset (#11)
HAL S2S dataset creation and evaluation on clustering task.
-
adding BSARD dataset
-
add BSARD to benchmark
-
adding Hagrid dataset
-
DiaBLa and Flores Bitext Mining evaluation (#12)
-
Add DiaBLa dataset for bitext mining
-
Add DiaBLa dataset for bitext mining
-
deduplicate bitext task
-
add Flores
-
format files
-
add flores to evaluation script
-
remove prints
-
add revision
Co-authored-by: Gabriel Sequeira <[email protected]>
-
add script to process and upload alloprof on HF
-
build script for HF
-
adding dataset processing for mteb
-
refactor few thing
-
remove whitespaces
-
adding dataset processing for mteb
-
adding BSARD dataset
-
add BSARD to benchmark
-
adding Hagrid dataset
-
fix change on langmapping
-
reset alphabetical order
-
add revision handling
-
Clustering: Add AlloProf dataset (#17)
AlloProf dataset for clustering task
-
handling of revision
-
change split + add revision handling
-
add script to process and upload alloprof on HF
-
build script for HF
-
adding dataset processing for mteb
-
refactor few thing
-
remove whitespaces
-
adding dataset processing for mteb
-
adding BSARD dataset
-
add BSARD to benchmark
-
adding Hagrid dataset
-
add script to process and upload alloprof on HF
-
adding dataset processing for mteb
-
refactor few thing
-
reset alphabetical order
-
add revision handling
-
handling of revision
-
change split + add revision handling
-
use eval variable
-
alphabetic order
-
Add MLSUM dataset for clustering task (#21)
-
Use Masakhane dataset for clustering task (#23)
-
16 add datasets to readmemd (#18)
-
run task table
-
run task table
-
Add MLSUM dataset for clustering task (#21)
-
Use Masakhane dataset for clustering task (#23)
-
run task table
-
refresh readme
-
refresh readme
-
run task table
-
refresh readme
Co-authored-by: Gabriel Sequeira <[email protected]>
Co-authored-by: Marion Schaeffer <[email protected]>
- load only test split (#25)
Co-authored-by: Gabriel Sequeira <[email protected]>
- Update mteb/tasks/BitextMining/DiaBLaBitextMining.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update mteb/tasks/Clustering/HALClusteringS2S.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- renaming masakhane (#28)
Co-authored-by: Gabriel Sequeira <[email protected]>
-
Syntec dataset addition (#26)
-
add scrpit to process & load to HF
-
add script to enable download of data from HF
-
add syntec dataset files to gitignore
-
add syntecretrieval
-
add syntec retrival
-
build dataloading script
-
remove datasets
-
correct typo
Co-authored-by: Sequeira Gabriel <[email protected]>
-
30 add syntec reranking (#31)
-
change name to secify retrieval
-
add reranking tasks
-
create script to upload dataset fo reranking task
-
create reranking task
-
add reranking tasks
-
add model name in description
-
SummEval translated to french (#32)
-
7 sts (#33)
-
taike into account multilingual tasks
-
add stsbenchmark multilingual dataset
-
add STS tasks
-
taike into account multilingual tasks
-
add stsbenchmark multilingual dataset
-
add STS tasks
-
add coma
-
Adding sick fr dataset to sts tasks (#34)
-
Adding sick fr dataset to sts tasks
-
modifying dataset in load function to have the right column names
-
Fix alloprof dataset (#36)
-
change revision to use
-
remove duplicate data
-
change main metric because dataset is hard (#37)
-
Fix alloprof dataset (#40)
-
change revision to use
-
remove duplicate data
-
change revision
-
handle queries train test split
-
change dataset creation method
-
change revision
-
handle queries train test split
-
change dataset creation method
-
Fix DiaBLa by inheriting CrossLingual class (#42)
-
Fix DiaBLa by inheriting CrossLingual class
-
remove remaining print
-
Fix DiaBLa integration
-
Update mteb/tasks/BitextMining/FloresBitextMining.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update README.md
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update README.md
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update mteb/tasks/Classification/MasakhaNEWSClassification.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update README.md
Co-authored-by: Niklas Muennighoff <[email protected]>
-
Update README.md
-
Update mteb/tasks/BitextMining/FloresBitextMining.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update mteb/evaluation/MTEB.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update mteb/abstasks/AbsTaskPairClassification.py
Co-authored-by: Imene Kerboua <[email protected]>
-
Update README.md
-
Update scripts/data/syntec/create_data_reranking.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update scripts/data/alloprof/create_data_reranking.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update scripts/run_mteb_french.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update scripts/run_mteb_french.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update mteb/evaluation/MTEB.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update mteb/evaluation/MTEB.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update mteb/tasks/Retrieval/HagridRetrieval.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update mteb/tasks/Clustering/MLSUMClusteringP2P.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update mteb/tasks/Clustering/MLSUMClusteringS2S.py
Co-authored-by: Niklas Muennighoff <[email protected]>
-
Update mteb/tasks/Clustering/MasakhaNEWSClusteringP2P.py
-
Update mteb/tasks/Clustering/MasakhaNEWSClusteringS2S.py
-
Update mteb/tasks/STS/SickFrSTS.py
-
Inherit OpusparcusPC init from MultilingualTask
-
remove unnecessary init
-
Remove train split from evaluation on MasakhaNEWSClassification (#52)
remove train split from evaluation
-
put script on HF dataset repos (#56)
-
put script on HF dataset repos
-
remove scripts
-
49 fix dictionnary in syntecretrieval (#54)
-
add trust remote code arg
-
leave corpus as dict
-
remove trust remote code
-
add Tatoeba & BUCC BitextMining tasks (#57)
add bucc and tatoeba bitextmining tasks
-
46 add other languages to masakhaneweclusterings2s and p2p (#58)
-
add other language to clustering tasks
-
fix main score and S2S task
-
update run fr becnhmark script
-
Update run_mteb_french.py
-
Update AbsTaskClustering.py
-
remove train and validation splits
-
remove Hagrid (#60)
Co-authored-by: Gabriel Sequeira <[email protected]>
Co-authored-by: Marion Schaeffer <[email protected]>
Co-authored-by: [email protected] <[email protected]>
Co-authored-by: Sequeira Gabriel <[email protected]>
Co-authored-by: Imene Kerboua <[email protected]>
Co-authored-by: Niklas Muennighoff <[email protected]>
Co-authored-by: wissam-sib <[email protected]>
Co-authored-by: Wissam Siblini <[email protected]> (d01d053
)
-
Restore TRECCOVID import (
9f8e897
) -
Extend MTEB with French datasets (#218)
-
add Masakhane dataset config
-
add trigram lang code for dataset who use it
-
create french script eval
-
fix French word
-
add some documentation
-
add script to process and upload alloprof on HF
-
build script for HF
-
adding dataset processing for mteb
-
add script to process and upload alloprof on HF
-
build script for HF
-
adding dataset processing for mteb
-
refactor few thing
-
remove whitespaces
-
4 pair classification (#10)
-
add Opusparcus dataset
-
multilingual usage
-
use eval_split of config files
-
change eval_split according to data
Co-authored-by: Gabriel Sequeira <[email protected]>
-
add script to process and upload alloprof on HF
-
build script for HF
-
adding dataset processing for mteb
-
refactor few thing
-
remove whitespaces
-
Clustering with HAL S2S dataset (#11)
HAL S2S dataset creation and evaluation on clustering task.
-
adding BSARD dataset
-
add BSARD to benchmark
-
adding Hagrid dataset
-
DiaBLa and Flores Bitext Mining evaluation (#12)
-
Add DiaBLa dataset for bitext mining
-
Add DiaBLa dataset for bitext mining
-
deduplicate bitext task
-
add Flores
-
format files
-
add flores to evaluation script
-
remove prints
-
add revision
Co-authored-by: Gabriel Sequeira <[email protected]>
-
add script to process and upload alloprof on HF
-
build script for HF
-
adding dataset processing for mteb
-
refactor few thing
-
remove whitespaces
-
adding dataset processing for mteb
-
adding BSARD dataset
-
add BSARD to benchmark
-
adding Hagrid dataset
-
fix change on langmapping
-
reset alphabetical order
-
add revision handling
-
Clustering: Add AlloProf dataset (#17)
AlloProf dataset for clustering task
-
handling of revision
-
change split + add revision handling
-
add script to process and upload alloprof on HF
-
build script for HF
-
adding dataset processing for mteb
-
refactor few thing
-
remove whitespaces
-
adding dataset processing for mteb
-
adding BSARD dataset
-
add BSARD to benchmark
-
adding Hagrid dataset
-
add script to process and upload alloprof on HF
-
adding dataset processing for mteb
-
refactor few thing
-
reset alphabetical order
-
add revision handling
-
handling of revision
-
change split + add revision handling
-
use eval variable
-
alphabetic order
-
Add MLSUM dataset for clustering task (#21)
-
Use Masakhane dataset for clustering task (#23)
-
16 add datasets to readmemd (#18)
-
run task table
-
run task table
-
Add MLSUM dataset for clustering task (#21)
-
Use Masakhane dataset for clustering task (#23)
-
run task table
-
refresh readme
-
refresh readme
-
run task table
-
refresh readme
Co-authored-by: Gabriel Sequeira <[email protected]>
Co-authored-by: Marion Schaeffer <[email protected]>
- load only test split (#25)
Co-authored-by: Gabriel Sequeira <[email protected]>
- Update mteb/tasks/BitextMining/DiaBLaBitextMining.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update mteb/tasks/Clustering/HALClusteringS2S.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- renaming masakhane (#28)
Co-authored-by: Gabriel Sequeira <[email protected]>
-
Syntec dataset addition (#26)
-
add scrpit to process & load to HF
-
add script to enable download of data from HF
-
add syntec dataset files to gitignore
-
add syntecretrieval
-
add syntec retrival
-
build dataloading script
-
remove datasets
-
correct typo
Co-authored-by: Sequeira Gabriel <[email protected]>
-
30 add syntec reranking (#31)
-
change name to secify retrieval
-
add reranking tasks
-
create script to upload dataset fo reranking task
-
create reranking task
-
add reranking tasks
-
add model name in description
-
SummEval translated to french (#32)
-
7 sts (#33)
-
taike into account multilingual tasks
-
add stsbenchmark multilingual dataset
-
add STS tasks
-
taike into account multilingual tasks
-
add stsbenchmark multilingual dataset
-
add STS tasks
-
add coma
-
Adding sick fr dataset to sts tasks (#34)
-
Adding sick fr dataset to sts tasks
-
modifying dataset in load function to have the right column names
-
Fix alloprof dataset (#36)
-
change revision to use
-
remove duplicate data
-
change main metric because dataset is hard (#37)
-
Fix alloprof dataset (#40)
-
change revision to use
-
remove duplicate data
-
change revision
-
handle queries train test split
-
change dataset creation method
-
change revision
-
handle queries train test split
-
change dataset creation method
-
Fix DiaBLa by inheriting CrossLingual class (#42)
-
Fix DiaBLa by inheriting CrossLingual class
-
remove remaining print
-
Fix DiaBLa integration
-
Update mteb/tasks/BitextMining/FloresBitextMining.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update README.md
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update README.md
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update mteb/tasks/Classification/MasakhaNEWSClassification.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update README.md
Co-authored-by: Niklas Muennighoff <[email protected]>
-
Update README.md
-
Update mteb/tasks/BitextMining/FloresBitextMining.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update mteb/evaluation/MTEB.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update mteb/abstasks/AbsTaskPairClassification.py
Co-authored-by: Imene Kerboua <[email protected]>
-
Update README.md
-
Update scripts/data/syntec/create_data_reranking.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update scripts/data/alloprof/create_data_reranking.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update scripts/run_mteb_french.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update scripts/run_mteb_french.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update mteb/evaluation/MTEB.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update mteb/evaluation/MTEB.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update mteb/tasks/Retrieval/HagridRetrieval.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update mteb/tasks/Clustering/MLSUMClusteringP2P.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update mteb/tasks/Clustering/MLSUMClusteringS2S.py
Co-authored-by: Niklas Muennighoff <[email protected]>
-
Update mteb/tasks/Clustering/MasakhaNEWSClusteringP2P.py
-
Update mteb/tasks/Clustering/MasakhaNEWSClusteringS2S.py
-
Update mteb/tasks/STS/SickFrSTS.py
-
Inherit OpusparcusPC init from MultilingualTask
-
remove unnecessary init
-
Remove train split from evaluation on MasakhaNEWSClassification (#52)
remove train split from evaluation
-
put script on HF dataset repos (#56)
-
put script on HF dataset repos
-
remove scripts
-
49 fix dictionnary in syntecretrieval (#54)
-
add trust remote code arg
-
leave corpus as dict
-
remove trust remote code
-
add Tatoeba & BUCC BitextMining tasks (#57)
add bucc and tatoeba bitextmining tasks
-
46 add other languages to masakhaneweclusterings2s and p2p (#58)
-
add other language to clustering tasks
-
fix main score and S2S task
-
update run fr becnhmark script
-
Update run_mteb_french.py
-
Update AbsTaskClustering.py
-
remove train and validation splits
Co-authored-by: Gabriel Sequeira <[email protected]>
Co-authored-by: Marion Schaeffer <[email protected]>
Co-authored-by: [email protected] <[email protected]>
Co-authored-by: Imene Kerboua <[email protected]>
Co-authored-by: mciancone <[email protected]>
Co-authored-by: Niklas Muennighoff <[email protected]>
Co-authored-by: wissam-sib <[email protected]>
Co-authored-by: Wissam Siblini <[email protected]> (3d8b8ec
)
-
dev (
c16eddc
) -
Dev (
08c7317
) -
Add tasks for Spanish Embedding Evaluation (#227)
-
feat: add xmarket es dataset
-
refactor: use multilingual dataset
-
fix: update revision id
-
refactor: add constant for language
-
feat: add two clustering datasets
Signed-off-by: jupyterjazz <[email protected]>
- feat: import classes
Signed-off-by: jupyterjazz <[email protected]>
- refactor: flores dataset
Signed-off-by: jupyterjazz <[email protected]>
-
feat: add miracl reranking task for spanish
-
feat: use hf repo with all reranking langs
-
feat: update revision hash
-
refactor: use description for language
-
feat: add stses task
-
fix: get scores from label column
-
refactor: add revision to data loading
-
Added spanish passage retrieval
-
feat: mintaka and xpqa retrieval tasks
Signed-off-by: jupyterjazz <[email protected]>
- feat: import classes
Signed-off-by: jupyterjazz <[email protected]>
-
fix: typo in data loading
-
fix: id
Signed-off-by: jupyterjazz <[email protected]>
- refactor: try out multilingual task
Signed-off-by: jupyterjazz <[email protected]>
- refactor: multilingual task import
Signed-off-by: jupyterjazz <[email protected]>
- refactor: cmon man
Signed-off-by: jupyterjazz <[email protected]>
- refactor: go back to monolingual tasks
Signed-off-by: jupyterjazz <[email protected]>
- refactor: remove unused import
Signed-off-by: jupyterjazz <[email protected]>
- refactor: loading logic
Signed-off-by: jupyterjazz <[email protected]>
-
feat: add miracl as retrieval task
-
fix: nested corpus
-
refactor: get lang from description
-
Update mteb/tasks/Retrieval/MIRACLRetrieval.py
Co-authored-by: Michael Günther <[email protected]>
-
feat: allow multlingual reranking tasks
-
feat: make miraclreranking multilingual
-
refactor: rename miraclretrieval
Co-authored-by: Niklas Muennighoff <[email protected]>
-
style: add missing eof empty line
-
feat: make xmarket retrieval task multilingual
-
refactor: rename xmarket
-
refactor: turn spanish tasks multilingual (#11)
-
refactor: make xpqa retrieval multilingual
-
fix: formatting of xpqa dataset
-
refactor: make mintaka into multilingual task
-
refactor: make miracl retrieval multilingual
-
feat: add revision ids for hf datasets
-
refactor: remove patool
-
Update mteb/tasks/Reranking/init.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update mteb/tasks/STS/init.py
Co-authored-by: Niklas Muennighoff <[email protected]>
Signed-off-by: jupyterjazz <[email protected]>
Co-authored-by: guenthermi <[email protected]>
Co-authored-by: jupyterjazz <[email protected]>
Co-authored-by: Markus Krimmel <[email protected]>
Co-authored-by: Michael Günther <[email protected]>
Co-authored-by: Niklas Muennighoff <[email protected]> (52d5c9f
)
-
Release: 1.1.2 (
def3c91
) -
Add task list (#228)
-
Add task list
-
Update mteb/init.py
-
Update README.md (
10bf6f8
) -
Update BeIRPLTask.py (#225)
-
Update BeIRPLTask.py
-
Update BeIRPLTask.py (
a8922c1
) -
Allow multiple languages (
2cc222e
) -
Add Korean Text Search Tasks to MTEB (#210)
-
add Ko-miracl, Ko-StrategyQA, Ko-mrtydi tasks
-
Update mteb/abstasks/AbsTaskRetrieval.py
Co-authored-by: Niklas Muennighoff <[email protected]>
-
Update AbsTaskRetrieval.py
-
Update mteb/abstasks/AbsTaskRetrieval.py
Co-authored-by: Niklas Muennighoff <[email protected]>
- Update scripts/run_mteb_korean.py
Co-authored-by: Niklas Muennighoff <[email protected]>
Co-authored-by: Niklas Muennighoff <[email protected]> (dadf2da
)
-
Add MultiLongDocRetrieval task to MTEB. (#224)
-
Update AbsTaskRetrieval.py.
-
Add Retrieval Task:
MultiLongDocRetrieval
-
Update AbsTaskRetrieval.py and
MLDR
task -
Update reference of MLDR (
2f65179
) -
Fix name (
2989f76
) -
only save top-k (#209)
-
Update AbsTaskRetrieval.py
-
Add json import; rename kwarg
-
Pass OF
-
Update mteb/abstasks/AbsTaskRetrieval.py
-
Update AbsTaskRetrieval.py
-
Update AbsTaskRetrieval.py
-
Update mteb/abstasks/AbsTaskRetrieval.py
Co-authored-by: Niklas Muennighoff <[email protected]> (f58888d
)
-
Add tasks for German Embedding Evaluation (#214)
-
chore: solve merge conflict
-
fix: gerdalir dataset
-
fix: lang from en to de
-
chore: solve merge conflict
-
chore: add ir datasets to requirements
-
refactor: limit queries to 10k
-
refactor: update description of task with limit
-
revert style changes
-
feat: add german stsbenchmarksts task
-
feat: update revision id
-
refactor: update revision id after changes in scores
-
add XMarket dataset
-
add xmarket to init file
-
feat: add revision id
-
add paws x dataset
-
Add ir_datasets as dependency
-
add GermanDPR dataset
-
fix loading
-
Update mteb/tasks/Retrieval/GermanDPRRetrieval.py
Co-authored-by: Saba Sturua <[email protected]>
-
feat: add miracl reranking task for german
-
refactor: cleanup task
-
prevent duplicate pos docs
-
fix: use test split in MIRACL (#13)
Fixes mismatch between description and HuggingFace dataset
-
refactor: remove WikiCLIR
-
fix: double import; xmarket name
-
add German tasks to run_mteb_german script
-
fupdate revisions and style
-
update MIRACL to work with latest version
-
revert adding ir_datasets
-
support multilingual pair classification
-
remove print statement
-
Apply suggestions from code review
Co-authored-by: Niklas Muennighoff <[email protected]>
-
fix monolingual pair classification
-
remove lang for monolingual tasks
Co-authored-by: Isabelle Mohr <[email protected]>
Co-authored-by: Markus Krimmel <[email protected]>
Co-authored-by: Saba Sturua <[email protected]>
Co-authored-by: Markus Krimmel <[email protected]>
Co-authored-by: Niklas Muennighoff <[email protected]> (9aba9ee
)
-
Simplify (
1cd07db
) -
Refer to other works (
8f28bcb
) -
Update mteb/tasks/Retrieval/GermanQuADRetrieval.py
Co-authored-by: Niklas Muennighoff <[email protected]> (09a9cb0
)
-
clean up (
51c40fd
) -
WIP: implement requested changes (
58baad2
) -
remove code for writing JSONL dataset (
d23eac3
) -
add docstring, remove local qrels (
af7ee50
) -
fix query id in qrel dataset, ready to merge (
33c9dd4
) -
WIP: use HF dataset instead of local JSONL (
db3fea1
) -
rename BeIRDETask (
e56cf86
) -
Update scripts/run_mteb_german.py
Co-authored-by: Niklas Muennighoff <[email protected]> (4b18a7e
)
- Update mteb/tasks/Retrieval/GermanRetrieval.py
Co-authored-by: Niklas Muennighoff <[email protected]> (3fef61a
)
-
add reference to GermanQuAD (
ae268e0
) -
fix results folder path (
dc7fc01
) -
copy from local (
9c0880d
) -
Update mteb/abstasks/AbsTaskRetrieval.py (
be1fcc1
) -
Pass OF (
b0e6316
) -
Add json import; rename kwarg (
d39c21c
) -
Update AbsTaskRetrieval.py (
4eb8e02
) -
Added Norwegian Bokmål-Nynorsk bitext mining task (
c3fb742
) -
Add STS revisions (
38277ae
) -
Add RTR revisions (
8da9487
) -
Add RRK revisions (
2011cd8
) -
Add PCLF revisions (
9b6f4b9
) -
Add CLST revisions (
da73236
) -
Add CLF revisions (
fd91a9c
) -
Update Revision (
6b0fae5
) -
Fix SweFAQ linkage (
2341c48
) -
Fix SummEval linkage (
7252322
) -
Fix Dalaj linkage (
fb9ccd8
) -
Fix medrxiv mislinkage (
620defc
) -
Fix stripping (
02e84b2
) -
add datasets for long document evaluation
Co-authored-by: Isabelle Mohr <[email protected]> (88beb46
)
-
Do not enforce rich import (
aa11fe7
) -
fix RerankingEvaluator's compute_metrics_individual (
fd7bfac
) -
Fix SummEval import (
859d38e
) -
Increment version (
4d75ddf
) -
Release: 1.1.1 (
d3aaf4f
) -
Merge branch 'main' into fixconversion (
d292258
) -
Fix eval_lang (
7836148
) -
Simplify code snippets (
d434f52
) -
Simplify wording (
3adb0b5
) -
Clarify multi-gpu usage (
5a2da23
) -
Fix splits (
93f6f85
) -
Improve Cust Model explanation (
52c1fd8
) -
Add bs to Clustering test (
4df0d2e
) -
Rely on auto-conversion to tensor in score function (
d8512f7
) -
Rely on standard encode kwargs only (
4c1660e
) -
Improve Cust Model explanation (
23d758f
) -
Add bs to Clustering test (
6e0c0d2
) -
Rely on auto-conversion to tensor in score function (
7ec4c57
) -
Rely on standard encode kwargs only (
2fad0f9
) -
Update README.md (
d9aa70f
) -
Update README.md (
2211f83
) -
Simplify assertion (
f7fcbc1
) -
Default to false (
d64f6c7
) -
Add multi gpu eval to readme (#140)
update readme (1b1c9d3
)
-
Support Multi-node Evaluation (#132)
-
styling
-
USE_HF_DATASETS
-
Support DRPES
-
we use beir.datasets.data_loader_hf in case of non dist
-
distributed fixes
-
update run command
-
cleanup
-
.
-
sugg
-
ruff (
0dd82a9
) -
Add Chinese tasks (C-MTEB) (#134)
-
add C_MTEB
-
add C_MTEB
-
rename MMarcoReranking
-
rename MMarcoReranking
-
Update mteb/tasks/Retrieval/CMTEBRetrieval.py
-
Update README.md
-
Allow custom encode functions
Co-authored-by: shitao <[email protected]>
Co-authored-by: Nouamane Tazi <[email protected]>
Co-authored-by: Niklas Muennighoff <[email protected]> (071974a
)
-
Add Polish tasks (PL-MTEB) (#137)
-
Add Polish tasks (PL-MTEB)
-
Add Polish datasets to README
-
Add newline
Co-authored-by: rposwiata <[email protected]>
Co-authored-by: Niklas Muennighoff <[email protected]> (2779344
)
-
Add BEIR-PL datasets to MTEB (#121)
-
Add BIER-PL benchmark
-
Update README with BEIR-PL datasets
-
Update names
-
Add tasks to init to be visible during evaluation
Co-authored-by: Konrad Wojtasik <[email protected]>
Co-authored-by: Niklas Muennighoff <[email protected]> (5972c02
)
-
Replaced prints with logging (#133)
-
Make sure that main score is added to bitext mining tasks
-
Added scandinavian languages: da, no, sv
-
merge upstream main
-
fix: Replaced prints with logging statements
-
chore: removed accidental commits (
d7ca378
) -
add logging (
6412a6a
) -
Merge pull request #131 from embeddings-benchmark/nouamane/quick-fixes
Code cleanup (4fb97d0
)
-
. (
3ebb039
) -
add eval_splits arg (
c407c4b
) -
quick fixes (
6c5a3fa
) -
clean MTEB tasks (
b276f1d
) -
clean args (
9365755
) -
styling (
dd02b48
) -
black (
652d07c
) -
Set dev version (
bf98c2c
) -
Release: 1.1.0 (
80d0344
) -
Bump version ID and update PyPI (#128)
Bump version ID and update PyPI after adding additional tasks. (4a4b54b
)
-
Fix typo (
33a3140
) -
Sort imports (
ab2eef8
) -
Sort imports (
3432374
) -
Raise error first (
0b1bfd2
) -
Added support for Scandinavian Languages (#124)
-
Make sure that main score is added to bitext mining tasks
-
Added scandinavian languages: da, no, sv
-
Updated readme with scandinavian tasks
-
Changes n samples for the nordic lang CLF
-
Added scandinavian models to init
-
Added error logs to gitignore
-
fix import error
-
fix dataset columns
-
rename dataset columns
-
remove swefaq
-
fix: Added functionality to raise error
-
fix: Updated names
-
fix: Removed no as a language
-
Added missing data transformation
-
Fix spelling error (
acb0f59
) -
Install beir (
c50b8ab
) -
Update README.md (
29ffedf
) -
ruff (
6a58b5d
) -
Update README.md (
5825536
) -
fix revision hash for TenKGnadClusteringP2P dataset
Co-authored-by: Niklas Muennighoff <[email protected]> (eb622f8
)
- change dataset order for BlurbsClustering in README
Co-authored-by: Niklas Muennighoff <[email protected]> (f6e49ba
)
- change dataset order for TenKGnadClustering in README
Co-authored-by: Niklas Muennighoff <[email protected]> (2a2c47f
)
-
fix descriptions for German clustering datasets (
30a966c
) -
add German clustering tasks to README (
62457e3
) -
update reference & category for TenKGnad datasets (
2174a47
) -
add German clustering tasks (
ab469be
) -
Allow abs path (
b56528c
) -
Add @Property annotation to description method of AbsTask (
98b0443
) -
fix typo (
37a986b
) -
fix extend lang pairs (
865dffc
) -
Fix clustering eval, black, isort (
bc43665
) -
Add 'auto' to sklearn clustering, add test, fix warning (
15ce352
) -
Update MSMARCORetrieval.py (
d913f56
) -
Revert to old split (
1f3ff6e
) -
Add wheel instruction (
62fad9b
) -
Dev version (
d988e48
) -
Release: 1.0.2 (
e189bae
) -
Add comment
Co-authored-by: Nouamane Tazi <[email protected]> (3e72ee8
)
-
Fix naming (
33f2db9
) -
Cleaner logging & tqdm usage (
542d871
) -
Add kwargs (
e0b801d
) -
Produce embeddings in one go (
e88bcf2
) -
Fix naming (
6c62f18
) -
Make inputs always List[str] & call in one (
bdeeedf
) -
Fix SummEval description (
0c2b1be
) -
fix SemmEval description
Unless I'm missing something, I think the SemmEval description is incorrect---the dataset consists of summaries of news articles, not biomedical abstracts. (1ccc068
)
-
Clarify script for running all of MTEB English (
9f72434
) -
Update run_mteb_english.py (
6ff57d3
) -
Update run_mteb_english.py (
7803eea
) -
Point to English benchmarking script (
57f3371
) -
Eexample script for benchmarking all of MTEB English (
77e6b22
) -
Clarify MSMARCO split (
bbeada8
) -
Allow re-merging (
b0ce501
) -
Set dataset name; Sort imports (
2a5a661
) -
Standardize CQA merging script (
5d5a2fb
) -
Update merge_cqadupstack.py (
b0304c1
) -
Update README.md (
8c60c22
) -
Update README.md (
6255449
) -
Remove validation split (
875a98e
) -
Remove validation set (
b3f9585
) -
Update ClassificationEvaluator.py (
93b89b6
) -
Set dev version (
8a0d6b1
) -
Release: 1.0.1 (
b9f423b
) -
Delete mteb_diagram.png (
76dc363
) -
Deactivate beir (
b263157
) -
Update BeIRTask.py (
37b7b79
) -
Remove validation (
6922840
) -
Fix typo (
7247233
) -
Add files via upload (
9d2bb67
) -
Increment version & use abslink (
a792a65
) -
Release: 1.0.0 (
9c544a4
) -
Add paper (
b73457a
) -
Fix formatting (
c523d16
) -
print -> logging (
4f3a559
) -
Do not ignore data scripts (
891b455
) -
Reorganize scripts (
e157bb0
) -
Add release instructions & dev suffix to version (
164b9ae
) -
Release: 0.9.1 (
5c438cc
)