Skip to content

1.1.0 New languages, default cluster setting & default error raising

Compare
Choose a tag to compare
@Muennighoff Muennighoff released this 31 Jul 09:21
· 1735 commits to main since this release

Updates

  • πŸ‡©πŸ‡°πŸ‡³πŸ‡΄πŸ‡ΈπŸ‡ͺ New Danish, Norwegian and Swedish BitextMining & Classification tasks AngryTweetsClassification, BornholmBitextMining, DKHateClassification, DalajClassification, LccSentimentClassification, NordicLangClassification, NorwegianParliament, ScalaDaClassification, ScalaNbClassification & ScalaSvClassification thanks to @KennethEnevoldsen
  • πŸ‡©πŸ‡ͺ New German Clustering tasks BlurbsClusteringP2P, BlurbsClusteringS2S, TenKGnadClusteringP2P & TenKGnadClusteringS2S thanks to @slvnwhrl
  • ❉ Change in cluster initialization from 3 to the sklearn recommended default of auto. This leads to tiny changes in clustering scores going forward and hence makes this release not backwards-compatible. See here for a discussion. Thanks to @stephantul for this change.
  • ❌ Errors are now directly raised by default. This behavior can be deactivated by passing a kwarg at evaluation. Previously, they were just written to a .txt file. Thanks to @KennethEnevoldsen for introducing this change.
  • πŸ’» Code cleanups thanks to @stephantul @izhx @permutohedra
  • πŸ“ˆ The leaderboard has also improved a lot with new task-based rankings, better caching and many new models

What's Changed

New Contributors

Full Changelog: 1.0.1...1.1.0