From f56e3956429e07e33ab39001eb60f64fab77bf8e Mon Sep 17 00:00:00 2001 From: gmftbyGMFTBY Date: Fri, 23 Feb 2024 19:06:41 +0800 Subject: [PATCH] update notes about overall scores in objective evaluation --- docs/leaderboard_objective.html | 1 + 1 file changed, 1 insertion(+) diff --git a/docs/leaderboard_objective.html b/docs/leaderboard_objective.html index 2c3b351..ef70ddc 100644 --- a/docs/leaderboard_objective.html +++ b/docs/leaderboard_objective.html @@ -229,6 +229,7 @@

📝 Notes

  1. Models labeled with 🌍 are API-Based models, while others are open-sourced.
  2. Some models are not optimized for correction and comparison critique dimensions, like Auto-J-13B and UltraCM-13B. Their scores are not recorded, and the overall scores are the average of other dimensions.
  3. +
  4. The details about how to compute the overall scores can be found in Section 4.3 in our paper.