Goodput initial implementation #32

AndyDai-nv · 2024-08-07T18:48:20Z

Migrate goodput dev branch to this new repo.

dyastremsky

Great start, Andy! Left a few comments. If you have any questions, let me know.

genai-perf/docs/goodput_tutorial.md

genai-perf/genai_perf/metrics/llm_metrics.py

genai-perf/genai_perf/metrics/statistics.py

genai-perf/genai_perf/parser.py

genai-perf/tests/test_llm_metrics.py

dyastremsky

Excellent work, Andy! This POC looks good. I added some small comments as I reviewed to update once the core development is done.

What are the next steps? We can also chat offline. I think it'd be great to get unit testing up for this.

Once you finish doing this work for LLMs, we can look at creating a class for non-LLMs (specifically embeddings/rankings). With the way you coded this, I don't see anything special about the llm_goodput_reporter that couldn't be used for non-LLM models. You may want to try using it with embeddings/rankings and seeing if it works.

genai-perf/genai_perf/goodput_reporter/llm_goodput_reporter.py

genai-perf/genai_perf/parser.py

genai-perf/docs/goodput_tutorial.md

genai-perf/genai_perf/goodput_reporter/llm_goodput_reporter.py

genai-perf/genai_perf/goodput_reporter/goodput_reporter.py

nv-hwoo · 2024-08-09T17:07:57Z

genai-perf/genai_perf/goodput_reporter/llm_goodput_reporter.py

+                   for val, slo in zip(request_metric_values, target_metric_values)
+            ):
+                good_req_count += 1
+        self._good_req_count = good_req_count


I think we should avoid setting return values that will be used later as an attribute within the method. This is very confusing from the reader's perspective because the "count" is hiding inside the method, and it's being used under compute_goodput.

Let's return the counts

def count_good_reqs(self) -> int: ... return good_req_count

and same goes for other methods as well.

This is what is called a side effect.
In general, we want to avoid those if they are not necessary.
I agree with @nv-hwoo here.

nv-hwoo · 2024-08-09T17:11:03Z

genai-perf/genai_perf/goodput_reporter/llm_goodput_reporter.py

+            if all(val < slo 
+                   for val, slo in zip(request_metric_values, target_metric_values)
+            ):


Could we unroll this? It hurts readability and I don't think there's much gain in fitting this all into the if conditional block.

+1. Readability and Maintainability are the top priority. We optimize only if we find a bottleneck impacting performance.

genai-perf/genai_perf/goodput_reporter/llm_goodput_reporter.py

debermudez

Lots of good work in this PR.
Lets some comments but great job.

debermudez · 2024-08-09T20:23:39Z

genai-perf/genai_perf/parser.py

+    Parse and check goodput args
+    """
+    '''
+    if args.goodput:


This comment is functionally a duplication of the code below.
Comments should steer towards information that is not in the code base. The code should be readable enough alongside a meaningful method name, to avoid this level of detail.
It also has the downside of being able to quickly grow stale and become misleading.

debermudez · 2024-08-09T20:28:06Z

genai-perf/genai_perf/parser.py

@@ -733,6 +778,7 @@ def _parse_profile_args(subparsers) -> argparse.ArgumentParser:
    _add_profile_args(profile)
    _add_output_args(profile)
    _add_other_args(profile)
+    _add_goodput_args(profile)


do we foresee more arguments that are going to fall under the goodput general topic?
I am unclear on whether we need to add a new group for this one argument.

Maybe more options for users to see exactly what metrics contribute most to the low goodput? Something like this. Knowing a goodput number might not be enough information, right?
Just my random thoughts: )

genai-perf/genai_perf/parser.py

genai-perf/tests/test_llm_metrics.py

debermudez · 2024-08-09T20:59:53Z

genai-perf/docs/goodput_tutorial.md

+
+docker run -it --net=host --gpus=1 nvcr.io/nvidia/tritonserver:${RELEASE}-py3-sdk
+
+# Run GenAI-Perf in the container:


I would remove any options that are just stating the default values here. Less code to maintain.

debermudez · 2024-08-09T21:05:51Z

genai-perf/genai_perf/goodput_reporter/goodput_reporter.py

+
+    @abstractmethod
+    def compute_goodput(self) -> None:
+        """Compute the goodput. To be implemented by subclasses."""


same as above

debermudez · 2024-08-09T21:06:13Z

genai-perf/genai_perf/goodput_reporter/llm_goodput_reporter.py

+
+
+class LLMGoodputReporter(GoodputReporter):
+    """A subclass to report goodput for language models."""


We know its a subclass from the line above.

genai-perf/genai_perf/goodput_reporter/llm_goodput_reporter.py

debermudez · 2024-08-09T21:08:01Z

genai-perf/genai_perf/goodput_reporter/llm_goodput_reporter.py

+            if all(val < slo 
+                   for val, slo in zip(request_metric_values, target_metric_values)
+            ):


+1. Readability and Maintainability are the top priority. We optimize only if we find a bottleneck impacting performance.

debermudez · 2024-08-09T21:11:29Z

genai-perf/genai_perf/goodput_reporter/llm_goodput_reporter.py

+                   for val, slo in zip(request_metric_values, target_metric_values)
+            ):
+                good_req_count += 1
+        self._good_req_count = good_req_count


This is what is called a side effect.
In general, we want to avoid those if they are not necessary.
I agree with @nv-hwoo here.

…embeddings usages

genai-perf/genai_perf/profile_data_parser/profile_data_parser.py

genai-perf/tests/test_llm_profile_data_parser.py

dyastremsky

Two small comments. Will finish the review soon.

genai-perf/genai_perf/goodput_calculator/goodput_calculator.py

dyastremsky

One more comment.

This looks pretty close to being ready to merge. Great work, Andy!

genai-perf/tests/test_llm_profile_data_parser.py

dyastremsky

Fantastic work, Andy! Looks good to me. 🚀

Please make sure to confirm the last CI pipeline passed all tests and that running with goodput constraints manually still works for LLM and non-LLM models (since there is no CI test for it yet). Assuming that is all still good, feel free to merge.

GenAI-Perf goodput support implementation

AndyDai-nv added 2 commits August 7, 2024 11:44

Migrate goodput dev branch to this repo

4b132a4

Fix wrong urls

22eaaea

AndyDai-nv requested review from matthewkotila, dyastremsky and nv-hwoo August 7, 2024 18:48

AndyDai-nv temporarily deployed to GITLAB August 7, 2024 18:48 — with GitHub Actions Inactive

dyastremsky reviewed Aug 7, 2024

View reviewed changes

AndyDai-nv added 2 commits August 8, 2024 16:29

New design PoC for goodput

8f5f189

Modified comments.

5d01561

AndyDai-nv temporarily deployed to GITLAB August 8, 2024 23:32 — with GitHub Actions Inactive

dyastremsky reviewed Aug 9, 2024

View reviewed changes

nv-hwoo reviewed Aug 9, 2024

View reviewed changes

debermudez reviewed Aug 9, 2024

View reviewed changes

AndyDai-nv temporarily deployed to GITLAB August 12, 2024 19:36 — with GitHub Actions Inactive

Refactor and enhance code to support goodput options in both LLM and …

5f46340

…embeddings usages

AndyDai-nv temporarily deployed to GITLAB August 13, 2024 00:24 — with GitHub Actions Inactive

github-advanced-security bot found potential problems Aug 13, 2024

View reviewed changes

genai-perf/genai_perf/profile_data_parser/profile_data_parser.py Fixed Show fixed Hide fixed

Add goodput example demos for LLM and Embeddings

baa1d6d

AndyDai-nv temporarily deployed to GITLAB August 13, 2024 00:48 — with GitHub Actions Inactive

AndyDai-nv temporarily deployed to GITLAB August 13, 2024 00:49 — with GitHub Actions Inactive

Add example demos for rankings goodput

bd48def

AndyDai-nv temporarily deployed to GITLAB August 13, 2024 04:00 — with GitHub Actions Inactive

AndyDai-nv added 2 commits August 13, 2024 10:58

Add unit tests.

442b77b

Add VLM goodput examples.

d154afb

AndyDai-nv temporarily deployed to GITLAB August 13, 2024 18:25 — with GitHub Actions Inactive

AndyDai-nv temporarily deployed to GITLAB August 22, 2024 22:36 — with GitHub Actions Inactive

github-advanced-security bot found potential problems Aug 22, 2024

View reviewed changes

genai-perf/tests/test_llm_profile_data_parser.py Fixed Show fixed Hide fixed

genai-perf/tests/test_llm_profile_data_parser.py Fixed Show fixed Hide fixed

Fix pre-commit errors

c34f9db

AndyDai-nv temporarily deployed to GITLAB August 22, 2024 22:44 — with GitHub Actions Inactive

Fix pre-commit errors

afa2784

AndyDai-nv temporarily deployed to GITLAB August 22, 2024 22:47 — with GitHub Actions Inactive

Deleted file merged from main

a3ab155

AndyDai-nv temporarily deployed to GITLAB August 22, 2024 23:05 — with GitHub Actions Inactive

AndyDai-nv temporarily deployed to GITLAB August 22, 2024 23:06 — with GitHub Actions Inactive

dyastremsky reviewed Aug 23, 2024

View reviewed changes

genai-perf/genai_perf/goodput_calculator/goodput_calculator.py Outdated Show resolved Hide resolved

genai-perf/genai_perf/goodput_calculator/goodput_calculator.py Outdated Show resolved Hide resolved

dyastremsky reviewed Aug 23, 2024

View reviewed changes

genai-perf/tests/test_llm_profile_data_parser.py Outdated Show resolved Hide resolved

AndyDai-nv added 2 commits August 23, 2024 14:38

Merge remote-tracking branch 'origin/main' into andy-goodput-dev

274a0f2

Iterated

f3c509f

AndyDai-nv temporarily deployed to GITLAB August 23, 2024 21:57 — with GitHub Actions Inactive

AndyDai-nv temporarily deployed to GITLAB August 23, 2024 21:58 — with GitHub Actions Inactive

github-advanced-security bot found potential problems Aug 23, 2024

View reviewed changes

genai-perf/tests/test_llm_profile_data_parser.py Fixed Show fixed Hide fixed

Fix CodeQL warning

9eaaa81

AndyDai-nv temporarily deployed to GITLAB August 23, 2024 22:01 — with GitHub Actions Inactive

Fix unit tests conflicts

994b5b4

AndyDai-nv temporarily deployed to GITLAB August 23, 2024 22:13 — with GitHub Actions Inactive

dyastremsky approved these changes Aug 23, 2024

View reviewed changes

AndyDai-nv merged commit 68968e3 into main Aug 23, 2024
8 checks passed

AndyDai-nv deleted the andy-goodput-dev branch August 23, 2024 23:25

lkomali pushed a commit that referenced this pull request Aug 27, 2024

Support Goodput metric (#32)

0565603

GenAI-Perf goodput support implementation

nv-hwoo mentioned this pull request Sep 3, 2024

GenAI-Perf goodput support triton-inference-server/client#772

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Goodput initial implementation #32

Goodput initial implementation #32

AndyDai-nv commented Aug 7, 2024

dyastremsky left a comment

dyastremsky left a comment •

edited

Loading

nv-hwoo Aug 9, 2024

debermudez Aug 9, 2024

AndyDai-nv Aug 20, 2024

nv-hwoo Aug 9, 2024

debermudez Aug 9, 2024

AndyDai-nv Aug 20, 2024

debermudez left a comment

debermudez Aug 9, 2024

AndyDai-nv Aug 20, 2024

debermudez Aug 9, 2024

AndyDai-nv Aug 15, 2024 •

edited

Loading

debermudez Aug 9, 2024

AndyDai-nv Aug 20, 2024

debermudez Aug 9, 2024

debermudez Aug 9, 2024

debermudez Aug 9, 2024

debermudez Aug 9, 2024

dyastremsky left a comment

dyastremsky left a comment

dyastremsky left a comment •

edited

Loading


		docker run -it --net=host --gpus=1 nvcr.io/nvidia/tritonserver:${RELEASE}-py3-sdk

		# Run GenAI-Perf in the container:



		class LLMGoodputReporter(GoodputReporter):
		"""A subclass to report goodput for language models."""

Goodput initial implementation #32

Goodput initial implementation #32

Conversation

AndyDai-nv commented Aug 7, 2024

dyastremsky left a comment

Choose a reason for hiding this comment

dyastremsky left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

debermudez left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

AndyDai-nv Aug 15, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dyastremsky left a comment

Choose a reason for hiding this comment

dyastremsky left a comment

Choose a reason for hiding this comment

dyastremsky left a comment • edited Loading

Choose a reason for hiding this comment

dyastremsky left a comment •

edited

Loading

AndyDai-nv Aug 15, 2024 •

edited

Loading

dyastremsky left a comment •

edited

Loading