⚙️ Fix Integration Test for TGI #124

baptistecolle · 2024-12-02T15:04:50Z

What does this PR do?

Fixes broken TGI integration tests and adds Gemma model test

Changes

Fixed broken GPT-2 integration test suite
Added integration tests for Gemma model

Notes

This PR is currently not integrated with CI due to some issue with Docker-in-Docker (DinD) on the GitHub runners

tengomucho

Nice, few nits, but overall it seems fine to me.
For GPT2, I wonder if it would be useful to pass the JETSTREAM_PT_DISABLED=1 env var, otherwise it will try to use jetstream first, and go to torch xla as fallback.

tengomucho · 2024-12-02T16:55:46Z

text-generation-inference/integration-tests/conftest.py


 class LauncherHandle:
    def __init__(self, port: int):
-        self.client = AsyncClient(f"http://localhost:{port}")
+        self.client = AsyncClient(f"http://localhost:{port}", timeout=600)
+        self.logger = logging.getLogger(self.__class__.__name__)


note we use loguru elsewhere for logging

Refactored with loguru

tengomucho · 2024-12-02T17:01:14Z

text-generation-inference/integration-tests/conftest.py

+            "LOG_LEVEL": "info,text_generation_router,text_generation_launcher=debug",
+            "MAX_BATCH_SIZE": "4",
+            "HF_HUB_ENABLE_HF_TRANSFER": "0",
+            "JETSTREAM_PT": "1",


you can remove this one now

text-generation-inference/integration-tests/test_gemma.py

tengomucho · 2024-12-02T17:07:02Z

text-generation-inference/integration-tests/test_gemma.py

+import Levenshtein
+import pytest
+
+MODEL_ID = "google/gemma-2b-it"


wouldn't it be possible to have a single test file with different parameters per model instead of having a new file?

fix(tests): fix broken GPT2 integration test

bdd422d

baptistecolle marked this pull request as ready for review December 2, 2024 15:32

baptistecolle requested a review from tengomucho December 2, 2024 15:32

tengomucho reviewed Dec 2, 2024

View reviewed changes

feat(tests): add Gemma integration test

9e63ff2

baptistecolle force-pushed the fix-integration-test-tgi-without-ci branch from 7dc97f2 to 9e63ff2 Compare December 2, 2024 18:30

refactor(logging): migrate to Loguru

c822676

baptistecolle force-pushed the fix-integration-test-tgi-without-ci branch 2 times, most recently from a7243e8 to c5eaa0a Compare December 4, 2024 10:02

fix(tests): fix broken connection to docker container

1af9edc

baptistecolle force-pushed the fix-integration-test-tgi-without-ci branch from c5eaa0a to 1af9edc Compare December 4, 2024 10:03

refractor(tests): make run arguments to model config

3a5a7f0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

⚙️ Fix Integration Test for TGI #124

⚙️ Fix Integration Test for TGI #124

baptistecolle commented Dec 2, 2024 •

edited

Loading

tengomucho left a comment

tengomucho Dec 2, 2024

baptistecolle Dec 4, 2024

tengomucho Dec 2, 2024

tengomucho Dec 2, 2024

baptistecolle Dec 4, 2024

⚙️ Fix Integration Test for TGI #124

Are you sure you want to change the base?

⚙️ Fix Integration Test for TGI #124

Conversation

baptistecolle commented Dec 2, 2024 • edited Loading

What does this PR do?

Changes

Notes

tengomucho left a comment

Choose a reason for hiding this comment

tengomucho Dec 2, 2024

Choose a reason for hiding this comment

baptistecolle Dec 4, 2024

Choose a reason for hiding this comment

tengomucho Dec 2, 2024

Choose a reason for hiding this comment

tengomucho Dec 2, 2024

Choose a reason for hiding this comment

baptistecolle Dec 4, 2024

Choose a reason for hiding this comment

baptistecolle commented Dec 2, 2024 •

edited

Loading