-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
bd3b4d2
commit 9202d36
Showing
10 changed files
with
45 additions
and
36 deletions.
There are no files selected for viewing
25 changes: 0 additions & 25 deletions
25
_posts/papers-of-the-month/2024-09/2024-09-27-title-tbd.md
This file was deleted.
Oops, something went wrong.
34 changes: 34 additions & 0 deletions
34
_posts/papers-of-the-month/2024-09/2024-09-30-proper-conditioning.md
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,34 @@ | ||
--- | ||
title: "September Papers: Proper Conditioning" | ||
header: | ||
teaser: /assets/images/posts/2024-09/potm/twitter_card.png | ||
image: /assets/images/posts/2024-09/potm/twitter_card.png | ||
og_image: /assets/images/posts/2024-09/potm/twitter_card.png | ||
|
||
date: 2024-09-30T01:00:00-00:00 | ||
potm_year: 2024 | ||
potm_month: 9 | ||
|
||
layout: paper-summaries-layout | ||
category: "papers-of-the-month" | ||
toc: true | ||
toc_sticky: true | ||
toc_label: "Papers" | ||
toc_icon: "book" | ||
author.twitter: "GCResearchTeam" | ||
--- | ||
|
||
We're pleased to share four papers from different domains: LLM self-correction, FP8 training, generative crystals and optimisation. They are united, somewhat tenuously, by the importance of _proper conditioning_: | ||
|
||
1. DeepMind researchers explain how _conditioning on the wrong distribution_ during supervised fine-tuning for self-correction is harmful but can be overcome using RL. | ||
2. A novel Smooth-SwiGLU activation _"conditions" the numerics_ by inserting a scaling factor in just the right place, preventing late-training instability in FP8. | ||
3. The GenMS architecture generates crystal structures for materials _conditions on high-level textual and low-level structural information_ for high-quality generation. | ||
4. SOAP is an evolution of Shampoo, with conditioners in the name and _preconditioners forming the eigenbasis_ for optimisation. | ||
|
||
You can be the judge of how tenuous the connection is, but I'd encourage you to check out the summaries first or despite this. | ||
|
||
_I hope you enjoy these as much as we did. Tell us we're wrong; tell us we're right [@GCResearchTeam](https://x.com/GCResearchTeam)._ | ||
|
||
--- | ||
|
||
{% include paper-summaries.md %} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
File renamed without changes
File renamed without changes
File renamed without changes
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.