Lm eval documentation #538

BSynRedhat · 2024-11-13T15:04:47Z

LM-Eval upstream contents x4 modules (incl assembly) and updated TrustyAI book to include the LMEval assembly. Conditionalized the bias tutorial and explainability assembly to only appear upstream.

eturner24 · 2024-11-15T15:11:05Z

lmeval-evaluation-job.adoc

+[role='_abstract']
+LM-Eval service defines a new Custom Resource Definition (CRD) called `LMEvalJob`. An `LMEvalJob` object represents an evaluation job. `LMEvalJob` objects are monitored by the TrustyAI Kubernetes operator.
+
+ Therefore, to run an evaluation job, you first need to create an `LMEvalJob` object with the following information: `model`, `model arguments`, `task`, and `secret`. 


Suggested change

Therefore, to run an evaluation job, you first need to create an `LMEvalJob` object with the following information: `model`, `model arguments`, `task`, and `secret`.

To run an evaluation job, you create an `LMEvalJob` object with the following information: `model`, `model arguments`, `task`, and `secret`.

eturner24 · 2024-11-15T15:12:23Z

lmeval-evaluation-job.adoc

+
+ Therefore, to run an evaluation job, you first need to create an `LMEvalJob` object with the following information: `model`, `model arguments`, `task`, and `secret`. 
+
+Once the `LMEvalJob` is created, the LM-Eval service will run the evaluation job and update the status and results to the `LMEvalJob` object when the information is available.


Suggested change

Once the `LMEvalJob` is created, the LM-Eval service will run the evaluation job and update the status and results to the `LMEvalJob` object when the information is available.

After the `LMEvalJob` is created, the LM-Eval service runs the evaluation job. The status and results of the `LMEvalJob` object update when the information is available.

Using "once" to mean "after" can be confusing to non-English speakers - using "after" is more precise.
Reference: https://www.ibm.com/docs/en/ibm-style?topic=word-usage#o

eturner24 · 2024-11-15T15:16:57Z

lmeval-evaluation-job.adoc

+
+[NOTE]
+--
+It is not recommended to deploy the TrustyAI custom resource (CR) in any namespace that contains non-tabular models. Such models are unsupported by TrustyAI, and may cause errors within the TrustyAI service.


Suggested change

It is not recommended to deploy the TrustyAI custom resource (CR) in any namespace that contains non-tabular models. Such models are unsupported by TrustyAI, and may cause errors within the TrustyAI service.

TrustyAI does not support non-tabular models. Deploying the TrustyAI custom resource (CR) in a namespace that contains non-tabular models can cause errors within the TrustyAI service.

Rewriting for passive voice. Also better to state the recommendation rather that saying "it is not recommended"
References:

https://www.ibm.com/docs/en/ibm-style?topic=information-claims-recommendations

https://www.ibm.com/docs/en/ibm-style?topic=grammar-verbs#voice

eturner24 · 2024-11-15T15:21:56Z

lmeval-evaluation-job.adoc

+
+.Sample LMEvalJob object 
+
+Below is an example of an `LMEvalJob` object. 


Suggested change

Below is an example of an `LMEvalJob` object.

The sample `LMEvalJob` object contains the following features:

Avoid use of "below"

Do not use to indicate a relative location in a document, as in “the information below”.
From: https://www.ibm.com/docs/en/ibm-style?topic=word-usage#b

I wonder if this list might also be better as callouts in the code.

eturner24 · 2024-11-15T15:22:21Z

lmeval-evaluation-job.adoc

+
+Below is an example of an `LMEvalJob` object. 
+
+* It uses the `google/flan-t5-base` model from link:https://huggingface.co/google/flan-t5-base[Hugging Face]. 


Suggested change

* It uses the `google/flan-t5-base` model from link:https://huggingface.co/google/flan-t5-base[Hugging Face].

* The `google/flan-t5-base` model from link:https://huggingface.co/google/flan-t5-base[Hugging Face].

eturner24 · 2024-11-15T16:11:25Z

lmeval-evaluation-job.adoc

+** `env`: Specify environment variables. It uses the `EnvVar` data structure of kubernetes.
+** `volumeMounts`: Mount the volumes into the lm-eval container
+** `resources`: Specify the resources for the lm-eval container.
+* `volumes`: Specify the volume information for the lm-eval and other containers. It uses the `Volume`  data structure of kubernetes.


Suggested change

* `volumes`: Specify the volume information for the lm-eval and other containers. It uses the `Volume` data structure of kubernetes.

* `volumes`: Specifies the volume information for the `lm-eval` and other containers. This parameter uses the `Volume` data structure of Kubernetes.

eturner24 · 2024-11-15T16:11:47Z

lmeval-evaluation-job.adoc

+** `volumeMounts`: Mount the volumes into the lm-eval container
+** `resources`: Specify the resources for the lm-eval container.
+* `volumes`: Specify the volume information for the lm-eval and other containers. It uses the `Volume`  data structure of kubernetes.
+* `sideCars`: A list of containers that run along with the lm-eval container. It uses the `Container` data structure of kubernetes.


Suggested change

* `sideCars`: A list of containers that run along with the lm-eval container. It uses the `Container` data structure of kubernetes.

* `sideCars`: A list of containers that run along with the `lm-eval` container. It uses the `Container` data structure of Kubernetes.

eturner24 · 2024-11-15T16:12:38Z

lmeval-evaluation-job.adoc

+
+
+| `outputs`
+| This section defines custom output locations for the evaluation results storage. At the moment only Persistent Volume Claims (PVC) are supported.


Suggested change

| This section defines custom output locations for the evaluation results storage. At the moment only Persistent Volume Claims (PVC) are supported.

| This parameter defines a custom output location to store the the evaluation results. Only Persistent Volume Claims (PVC) are supported.

Not sure if parameter is the right word

eturner24 · 2024-11-15T16:14:12Z

lmeval-evaluation-job.adoc

+| This section defines custom output locations for the evaluation results storage. At the moment only Persistent Volume Claims (PVC) are supported.
+
+| `outputs.pvcManaged`
+| Create an operator-managed PVC to store this job's results. The PVC will be named `<job-name>-pvc` and will be owned by the `LMEvalJob`. After job completion, the PVC will still be available, but it will be deleted upon deleting the `LMEvalJob`. Supports the following fields:


Suggested change

| Create an operator-managed PVC to store this job's results. The PVC will be named `<job-name>-pvc` and will be owned by the `LMEvalJob`. After job completion, the PVC will still be available, but it will be deleted upon deleting the `LMEvalJob`. Supports the following fields:

| Creates an operator-managed PVC to store this job's results. The PVC is named `<job-name>-pvc` and is owned by the `LMEvalJob`. After the job finishes, the PVC is still be available, but it is deleted with the `LMEvalJob`. Supports the following fields:

Can we find a different wording than "is owned by" - in addition to being passive voice, it anthropomorphizes the job. More info: https://www.ibm.com/docs/en/ibm-style?topic=grammar-anthropomorphism

eturner24 · 2024-11-15T16:15:14Z

lmeval-evaluation-job.adoc

+= LM-Eval evaluation job
+
+[role='_abstract']
+LM-Eval service defines a new Custom Resource Definition (CRD) called `LMEvalJob`. An `LMEvalJob` object represents an evaluation job. `LMEvalJob` objects are monitored by the TrustyAI Kubernetes operator.


Where is the LM-Eval service? RHOAI or Openshift? Should it be code formatted?

BSynRedhat added 3 commits November 13, 2024 14:49

LMEval Documentation x4 files

eaad41d

Edited TrustyAI section to include new LMEval assembly

171f651

Conditionalising Explainability and Bias tutorial to be upstream only

1c59e13

BSynRedhat self-assigned this Nov 13, 2024

BSynRedhat marked this pull request as draft November 13, 2024 15:05

aduquett mentioned this pull request Nov 13, 2024

Trusty ai 2.15 #539

Closed

BSynRedhat marked this pull request as ready for review November 14, 2024 18:03

eturner24 requested changes Nov 15, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lm eval documentation #538

Lm eval documentation #538

BSynRedhat commented Nov 13, 2024

eturner24 Nov 15, 2024

eturner24 Nov 15, 2024

eturner24 Nov 15, 2024

eturner24 Nov 15, 2024

eturner24 Nov 15, 2024

eturner24 Nov 15, 2024

eturner24 Nov 15, 2024

eturner24 Nov 15, 2024

eturner24 Nov 15, 2024

eturner24 Nov 15, 2024

eturner24 Nov 15, 2024

	Therefore, to run an evaluation job, you first need to create an `LMEvalJob` object with the following information: `model`, `model arguments`, `task`, and `secret`.
	To run an evaluation job, you create an `LMEvalJob` object with the following information: `model`, `model arguments`, `task`, and `secret`.


		Therefore, to run an evaluation job, you first need to create an `LMEvalJob` object with the following information: `model`, `model arguments`, `task`, and `secret`.

		Once the `LMEvalJob` is created, the LM-Eval service will run the evaluation job and update the status and results to the `LMEvalJob` object when the information is available.

	Once the `LMEvalJob` is created, the LM-Eval service will run the evaluation job and update the status and results to the `LMEvalJob` object when the information is available.
	After the `LMEvalJob` is created, the LM-Eval service runs the evaluation job. The status and results of the `LMEvalJob` object update when the information is available.

	It is not recommended to deploy the TrustyAI custom resource (CR) in any namespace that contains non-tabular models. Such models are unsupported by TrustyAI, and may cause errors within the TrustyAI service.
	TrustyAI does not support non-tabular models. Deploying the TrustyAI custom resource (CR) in a namespace that contains non-tabular models can cause errors within the TrustyAI service.


		.Sample LMEvalJob object

		Below is an example of an `LMEvalJob` object.

	Below is an example of an `LMEvalJob` object.
	The sample `LMEvalJob` object contains the following features:


		Below is an example of an `LMEvalJob` object.

		* It uses the `google/flan-t5-base` model from link:https://huggingface.co/google/flan-t5-base[Hugging Face].

	* It uses the `google/flan-t5-base` model from link:https://huggingface.co/google/flan-t5-base[Hugging Face].
	* The `google/flan-t5-base` model from link:https://huggingface.co/google/flan-t5-base[Hugging Face].

	* `volumes`: Specify the volume information for the lm-eval and other containers. It uses the `Volume` data structure of kubernetes.
	* `volumes`: Specifies the volume information for the `lm-eval` and other containers. This parameter uses the `Volume` data structure of Kubernetes.

	* `sideCars`: A list of containers that run along with the lm-eval container. It uses the `Container` data structure of kubernetes.
	* `sideCars`: A list of containers that run along with the `lm-eval` container. It uses the `Container` data structure of Kubernetes.



		\| `outputs`
		\| This section defines custom output locations for the evaluation results storage. At the moment only Persistent Volume Claims (PVC) are supported.

	\| This section defines custom output locations for the evaluation results storage. At the moment only Persistent Volume Claims (PVC) are supported.
	\| This parameter defines a custom output location to store the the evaluation results. Only Persistent Volume Claims (PVC) are supported.

	\| Create an operator-managed PVC to store this job's results. The PVC will be named `<job-name>-pvc` and will be owned by the `LMEvalJob`. After job completion, the PVC will still be available, but it will be deleted upon deleting the `LMEvalJob`. Supports the following fields:
	\| Creates an operator-managed PVC to store this job's results. The PVC is named `<job-name>-pvc` and is owned by the `LMEvalJob`. After the job finishes, the PVC is still be available, but it is deleted with the `LMEvalJob`. Supports the following fields:

Lm eval documentation #538

Are you sure you want to change the base?

Lm eval documentation #538

Conversation

BSynRedhat commented Nov 13, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment