Skip to content

Commit

Permalink
add claude3 benchmarks (#11685)
Browse files Browse the repository at this point in the history
* cr

* lint

---------

Co-authored-by: Andrei Fajardo <[email protected]>
  • Loading branch information
jerryjliu and nerdai authored Mar 9, 2024
1 parent 890377e commit bee508b
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions docs/module_guides/models/llms.md
Original file line number Diff line number Diff line change
Expand Up @@ -94,6 +94,8 @@ If you have ways to improve the setup for existing notebooks, contributions to c
| [gpt-3.5-turbo](https://colab.research.google.com/drive/1vvdcf7VYNQA67NOxBHCyQvgb2Pu7iY_5?usp=sharing) (openai) ||||||| |
| [gpt-3.5-turbo-instruct](https://colab.research.google.com/drive/1Ne-VmMNYGOKUeECvkjurdKqMDpfqJQHE?usp=sharing) (openai) |||||| ⚠️ | Tool usage in data-agents seems flakey. |
| [gpt-4](https://colab.research.google.com/drive/1QUNyCVt8q5G32XHNztGw4YJ2EmEkeUe8?usp=sharing) (openai) ||||||| |
| [claude-3 opus](https://colab.research.google.com/drive/1xeFgAmSLpY_9w7bcGPvIcE8UuFSI3xjF?usp=sharing) || ⚠️ ||||| |
| [claude-3 sonnet](https://colab.research.google.com/drive/1xeFgAmSLpY_9w7bcGPvIcE8UuFSI3xjF?usp=sharing) |||||| ⚠️ | Prone to hallucinating tool inputs. |
| [claude-2](https://colab.research.google.com/drive/1IuHRN67MYOaLx2_AgJ9gWVtlK7bIvS1f?usp=sharing) (anthropic) |||||| ⚠️ | Prone to hallucinating tool inputs. |
| [claude-instant-1.2](https://colab.research.google.com/drive/1ahq-2kXwCVCA_3xyC5UMWHyfAcjoG8Gp?usp=sharing) (anthropic) |||||| ⚠️ | Prone to hallucinating tool inputs. |

Expand Down

0 comments on commit bee508b

Please sign in to comment.