Merge pull request #6 from arthbohra/ab/agent-arena

prettier on extended arena
lmarena · Oct 9, 2024 · 89e0c14 · 89e0c14
2 parents 640cf64 + 78505bf
commit 89e0c14
Show file tree

Hide file tree

Showing 2 changed files with 3 additions and 3 deletions.
diff --git a/_posts/2024-09-30-extended-arena.md b/_posts/2024-09-30-extended-arena.md
@@ -64,7 +64,7 @@ arrive.
 The Extended Online Arena Score amounts to running online logistic regression on the same feature set.
 The algorithm is as follows:
 
-$$ \theta^{(t+1)} = \theta^{(t)} - \eta \nabla \ell(\sigma(X_t^\top \theta^{(t)}), Y_t) - \lambda \nabla \|\theta^{(t)}\|_p,$$
+$$ \theta^{(t+1)} = \theta^{(t)} - \eta \nabla \ell(\sigma(X_t^\top \theta^{(t)}), Y_t) - \lambda \nabla \|\theta^{(t)}\|\_p,$$
 
 where $$\eta > 0$$ is the learning rate, and $$\nabla \|\cdot\|_p$$ is any valid subgradient of the $$\ell_p$$ norm.
 The benefit, and drawback, of the online score is that it never converges.