1.2.0 Spanish & French, Simpler Retrieval
Updates
- πͺπΈ New Spanish datasets thanks to @violenil & team π
- π«π· New French datasets thanks to @GabrielSequeira & team + there's a new French Overall leaderboard tab thanks to their massive benchmarking π₯
- Retrieval has become much simpler and is now standardized to align with other tasks. You can inspect all Retrieval datasets on the hub, it is much easier to add new Retrieval datasets now & there are fewer dependencies making installing MTEB easier π While this change is backward-compatible, it represents a significant change in how MTEB works, thus we decided to increment the minor for this release (1.1.2 -> 1.2.0).
What's Changed
- Add tasks for Spanish Embedding Evaluation by @violenil in #227
- Extend MTEB with French datasets by @GabrielSequeira in #218
- Remove HAGRID from french benchmark by @MathieuCiancone in #235
- Fixed missing revision error on Norwegian Bitext Mining by @x-tabdeveloping in #221
- Simplify retrieval by @Muennighoff in #233
New Contributors
- @GabrielSequeira made their first contribution in #218
- @MathieuCiancone made their first contribution in #235
Full Changelog: 1.1.2...1.2.0