Skip to content

Jack47/awesome-openai-o1-related

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 

Repository files navigation

Awesome OpenAI o1 related Papers, Projects

跟 OpenAI o1 相关的论文、文章等信息的索引页

论文

O1 Replication Journey: A Strategic Progress Report

Dataset

MuSR: Testing the Limits of Chain-of-Thought with MultiStep Soft Reasoning. 上榜理由:作者发现 MuSR 这个数据集上,o1 的表现并不如 Claude-3.5 Sonnet CoT 好。 peiyi9979/Math-Shepherd: 训 Math-Shepher 里 PRM 的数据集 PRM800K: A Process Supervision Dataset: Let's Verify Step by Step 里的数据集

源码

rStar: Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers Train PRM with hard estimation in OpenRLHF: 算是实现框架来做 Math-Shepher 的复现 Open O1: A Model Matching Proprietary Power with Open-Source Innovation LLM Reasoners: 有可视化方案和 notebook 可以一步步看它的例子

其他类似的聚合资源:

  1. Awesome-LLM-Strawberry. 上榜理由: 作者是 OpenRLHF 作者
  2. Self Correction LLM Papers. 上榜理由: 系统介绍了 Training-Time Correction, Generation-Time Correction, Post-hoc Correction,一年前就创建了这个 repo

About

跟 OpenAI o1 相关的文章索引页

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published