feat: `ChatGPTGenerator` #5692

ZanSara · 2023-08-30T15:47:51Z

Related Issues

related to OpenAI LLM Generators for Haystack 2.0 #5623
based on feat: generators (2.0) #5690

Proposed Changes:

Add ChatGPTGenerator according to the LLM Proposal
Add unit tests
Add end to end tests

How did you test it?

Local tests run
CI
e2e tests

Notes for the reviewer

n/a

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes
I added unit tests and updated the docstrings
I've used one of the conventional commit types for my PR title: fix:, feat:, build:, chore:, ci:, docs:, style:, refactor:, perf:, test:.
I documented my code
I ran pre-commit hooks and fixed any issue

…to generators-module

vblagoje

On the first pass I tried to find obvious and easy-to-find errors. Looks solid; left some early feedback.

haystack/preview/components/generators/openai/_helpers.py

haystack/preview/components/generators/openai/chatgpt.py

ZanSara · 2023-08-31T14:02:45Z

haystack/preview/components/generators/openai/chatgpt.py

+        headers = {"Authorization": f"Bearer {api_key}", "Content-Type": "application/json"}
+        if openai_organization:
+            headers["OpenAI-Organization"] = openai_organization
+        url = f"{api_base_url}/chat/completions"


@vblagoje We're using the chat completion endpoint for ChatGPT: https://platform.openai.com/docs/api-reference/chat/create

…to chatgpt-generator

coveralls · 2023-08-31T15:48:04Z

Pull Request Test Coverage Report for Build 6039895083

0 of 0 changed or added relevant lines in 0 files are covered.
3 unchanged lines in 1 file lost coverage.
Overall coverage increased (+0.3%) to 48.903%

Files with Coverage Reduction	New Missed Lines	%
preview/components/generators/openai/_helpers.py	3	96.59%

Totals
Change from base Build 6039423187:	0.3%
Covered Lines:	11749
Relevant Lines:	24025

💛 - Coveralls

dfokina · 2023-09-04T10:14:15Z

haystack/preview/components/generators/openai/chatgpt.py

+        :param top_p: An alternative to sampling with temperature, called nucleus sampling, where the model
+            considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens
+            comprising the top 10% probability mass are considered.
+        :param n: How many completions to generate for each prompt.


Can we elaborate on what this means?

dfokina · 2023-09-04T10:15:48Z

haystack/preview/components/generators/openai/chatgpt.py

+            considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens
+            comprising the top 10% probability mass are considered.
+        :param n: How many completions to generate for each prompt.
+        :param stop: One or more sequences where the API will stop generating further tokens.


Does this define some specific phrase that tells the API to stop generating tokens?

It's the contrary: it's a keyword that tells us that we can discard all the LLM output that comes after it. It used to be quite important as ChatGPT would keep generating after the stopword if not instructed correctly. I'm not sure it's still an issue, but the parameter is still present.

I changed the wording to try make it more clear, it was quite ambiguous indeed

dfokina · 2023-09-04T10:17:27Z

haystack/preview/components/generators/openai/chatgpt.py

+        :param api_base_url: The OpenAI API Base url, defaults to `https://api.openai.com/v1`.
+        :param openai_organization: The OpenAI organization ID.
+
+        See OpenAI documentation](https://platform.openai.com/docs/api-reference/chat) for more details.


Suggested change

See OpenAI documentation](https://platform.openai.com/docs/api-reference/chat) for more details.

See OpenAI [documentation](https://platform.openai.com/docs/api-reference/chat) for more details.

ZanSara · 2023-09-04T13:19:15Z

This PR has become really big, so I'll split it into two smaller ones. I'll make sure to mark you two as reviewers of those PRs as well.

ZanSara added 5 commits August 30, 2023 16:06

add generators module

0fc2bac

add tests for module helper

7f6325c

add chatgpt generator

47b6799

add init and serialization tests

4e8fcb3

test component

cbf7701

github-actions bot added the topic:tests label Aug 30, 2023

ZanSara changed the base branch from main to generators-module August 30, 2023 15:48

github-actions bot added the type:documentation Improvements on the docs label Aug 30, 2023

ZanSara added 4 commits August 30, 2023 17:49

reno

419f615

Merge branch 'main' into generators-module

49ff654

Merge branch 'generators-module' into chatgpt-generator

4edeb8e

reno

08e9c62

ZanSara mentioned this pull request Aug 30, 2023

feat: generators (2.0) #5690

Merged

more tests

a984e67

ZanSara mentioned this pull request Aug 30, 2023

feat: GPT4Generator #5694

Closed

ZanSara marked this pull request as ready for review August 30, 2023 18:38

ZanSara requested review from a team as code owners August 30, 2023 18:38

ZanSara requested review from dfokina and vblagoje and removed request for a team August 30, 2023 18:38

ZanSara marked this pull request as draft August 30, 2023 18:39

ZanSara added 7 commits August 31, 2023 10:45

add another test

612876a

Merge branch 'generators-module' of github.com:deepset-ai/haystack in…

ec8e14a

…to generators-module

Merge branch 'generators-module' into chatgpt-generator

366b0ff

chat token limit

e9c3de7

move into openai

725fabe

Merge branch 'generators-module' into chatgpt-generator

4d4f9d4

fix test

c3bef8f

ZanSara added 4 commits August 31, 2023 12:16

improve tests

c1a7696

Merge branch 'generators-module' into chatgpt-generator

246ca63

add e2e test and small fixes

ec809e4

linting

5d946f8

ZanSara marked this pull request as ready for review August 31, 2023 11:02

ZanSara added the 2.x Related to Haystack v2.0 label Aug 31, 2023

Add ChatGPTGenerator example

aa9ce33

vblagoje requested changes Aug 31, 2023

View reviewed changes

ZanSara commented Aug 31, 2023

View reviewed changes

ZanSara requested a review from vblagoje August 31, 2023 14:06

ZanSara added 3 commits August 31, 2023 16:15

review feedback

9310057

Merge branch 'chatgpt-generator' of github.com:deepset-ai/haystack in…

7c36db1

…to chatgpt-generator

support for metadata

b2e421d

Base automatically changed from generators-module to main August 31, 2023 15:33

Merge branch 'main' into chatgpt-generator

6d81d79

ZanSara added 8 commits August 31, 2023 18:19

mypy

2895697

mypy

1538d61

extract backend from generator and make it accept chats

02cd61f

fix tests

84332c6

mypy

329b54d

query->complete

5ee2aac

mypy

429a3ae

Merge branch 'main' into chatgpt-generator

c0b237d

dfokina reviewed Sep 4, 2023

View reviewed changes

ZanSara closed this Sep 4, 2023

masci deleted the chatgpt-generator branch December 5, 2023 08:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: `ChatGPTGenerator` #5692

feat: `ChatGPTGenerator` #5692

ZanSara commented Aug 30, 2023 •

edited

Loading

vblagoje left a comment •

edited

Loading

ZanSara Aug 31, 2023 •

edited

Loading

coveralls commented Aug 31, 2023 •

edited

Loading

dfokina Sep 4, 2023

dfokina Sep 4, 2023

ZanSara Sep 4, 2023

ZanSara Sep 4, 2023

dfokina Sep 4, 2023

ZanSara commented Sep 4, 2023

	See OpenAI documentation](https://platform.openai.com/docs/api-reference/chat) for more details.
	See OpenAI [documentation](https://platform.openai.com/docs/api-reference/chat) for more details.

feat: ChatGPTGenerator #5692

feat: ChatGPTGenerator #5692

Conversation

ZanSara commented Aug 30, 2023 • edited Loading

Related Issues

Proposed Changes:

How did you test it?

Notes for the reviewer

Checklist

vblagoje left a comment • edited Loading

Choose a reason for hiding this comment

ZanSara Aug 31, 2023 • edited Loading

Choose a reason for hiding this comment

coveralls commented Aug 31, 2023 • edited Loading

Pull Request Test Coverage Report for Build 6039895083

💛 - Coveralls

dfokina Sep 4, 2023

Choose a reason for hiding this comment

dfokina Sep 4, 2023

Choose a reason for hiding this comment

ZanSara Sep 4, 2023

Choose a reason for hiding this comment

ZanSara Sep 4, 2023

Choose a reason for hiding this comment

dfokina Sep 4, 2023

Choose a reason for hiding this comment

ZanSara commented Sep 4, 2023

feat: `ChatGPTGenerator` #5692

feat: `ChatGPTGenerator` #5692

ZanSara commented Aug 30, 2023 •

edited

Loading

vblagoje left a comment •

edited

Loading

ZanSara Aug 31, 2023 •

edited

Loading

coveralls commented Aug 31, 2023 •

edited

Loading