update to outlines010 #1092

davidberenstein1957 · 2025-01-09T07:06:51Z

llama-cpp
transformers
vllm
mlx waiting on [FEATURE] mlx-lm integration #995, did not include this because it was relatively complex to add at this stage.

github-actions · 2025-01-09T07:08:19Z

Documentation for this PR has been built. You can view it at: https://distilabel.argilla.io/pr-1092/

codspeed-hq · 2025-01-09T07:15:34Z

CodSpeed Performance Report

Merging #1092 will improve performances by ×4.1

_{Comparing feat/1081-feature-update-to-outlines010 (399154e) with develop (d9fd15c)}

Summary

⚡ 1 improvements

Benchmarks breakdown

	Benchmark	`develop`	`feat/1081-feature-update-to-outlines010`	Change
⚡	`test_cache_time`	2,277 ms	550.4 ms	×4.1

…ch logic

sdiazlor

Looks good as it's the approach I had started for LlamaCpp.

Did you check if it worked for LlamaCpp? I had tested before vacations and needed to update logits_processor=LogitsProcessorList([self._logits_processor]) if self.structured_output else None in llamacpp.py
This is the issue I had encountered for Llama models. I guess it should be solved with the previous PR, right? RuntimeError when using generate.json() on llama 3.2 with llamaccp dottxt-ai/outlines#1261

for more information, see https://pre-commit.ci

…hub.com/argilla-io/distilabel into feat/1081-feature-update-to-outlines010

burtenshaw

This looks good functionally, but I found it difficult to follow. With community maintainability in mind, I think you could localise the logic about outlines versions.

src/distilabel/steps/tasks/structured_outputs/outlines.py

plaguss

Looks good! I left some comments. Have you generated some dataset with the 3 integrations to check it works?

tests/unit/pipeline/.DS_Store

tests/unit/.DS_Store

vllm

src/distilabel/steps/tasks/structured_outputs/outlines.py

…ct; delete unnecessary .DS_Store files from unit tests

- Introduced a helper function to check if the 'outlines' package is installed and its version. - Updated the logic in `_get_logits_processor` to use the new version check, simplifying the processor selection based on the outlines version. - Adjusted the handling of tokenizers in `_get_tokenizer_from_model` to streamline the integration with different frameworks. - Modified `prepare_guided_output` to differentiate processing based on the outlines version, ensuring compatibility with both pre-0.1.0 and post-0.1.0 versions of the outlines package.

- Replaced the `_set_logits_processor` method with direct assignment of `_logits_processor` using `_prepare_structured_output`. - Simplified the logic for setting the logits processor in both the `load` and generation methods, enhancing code clarity and maintainability.

…sLLM - Updated the import statement for outlines to use the new helper function `_outlines_version_below_0_1_0`. - Simplified the logic for setting the `_logits_processor` based on the outlines version check, enhancing code clarity and maintainability.

- Renamed the helper function from `_outlines_version_below_0_1_0` to `_is_outlines_version_below_0_1_0` for clarity. - Updated all references to the renamed function across the codebase, ensuring consistent usage in the `TransformersLLM` class and related functions. - Enhanced code readability and maintainability by standardizing function naming conventions.

…on outlines version - Introduced version check for outlines in both LlamaCppLLM and TransformersLLM to determine processor return type. - Updated `prepare_guided_output` to handle processor initialization differently for outlines versions below and above 0.1.0. - Enhanced tokenizer handling in `_get_tokenizer_from_model` to support multiple frameworks, ensuring compatibility and improved functionality.

…ransformersLLM - Updated return types of `_prepare_structured_output` methods to reflect changes in processor handling. - Changed return type in LlamaCppLLM from `Union["LogitsProcessorList", None]` to `Union["LogitsProcessorList", "LogitsProcessor"]`. - Modified MlxLLM and TransformersLLM to return `Union[List[Callable], Callable>` instead of `Union[Callable, None]`, ensuring consistency across implementations. - Enhanced code clarity and maintainability by standardizing output handling in structured output preparation.

- Added support for the 'mlx' framework in the outlines processing logic. - Updated the `prepare_guided_output` function to utilize `TransformerTokenizer` for 'mlx' framework. - Modified the `_get_logits_processor` and `_get_tokenizer_from_model` functions to include 'mlx' as a valid framework option, ensuring consistent handling across different frameworks. - Improved code clarity and maintainability by standardizing framework handling in the structured output preparation process.

burtenshaw

LGTM

- Simplified return types in LlamaCppLLM and MlxLLM by removing version checks and directly returning the processor. - Enhanced code clarity and maintainability by standardizing the output structure across both classes. - Updated `prepare_guided_output` usage to ensure consistent handling of structured outputs.

- Removed the `structured_output` attribute and related processing logic from MlxLLM to simplify the class structure. - Updated the `load` and generation methods to eliminate references to structured output, enhancing clarity and maintainability. - Adjusted imports and type hints in `outlines.py` to reflect the removal of 'mlx' framework support, streamlining the framework handling. - Improved code readability by cleaning up unnecessary complexity in structured output preparation.

- Changed the assignment of `_logits_processor` to always use a list, ensuring consistent handling across different outlines versions. - Removed the version check for outlines in the `load` method, simplifying the logic and enhancing maintainability. - Updated the return type in the structured output preparation to directly return the processor, improving code clarity.

- Updated type hints for the `llm` parameter in `_get_tokenizer_from_model` and `prepare_guided_output` functions to use `_vLLM` instead of `LLM`, enhancing code readability. - Adjusted imports to reflect the new alias for `LLM`, streamlining the code structure.

- Updated type hint imports to include `# noqa` comments, enhancing code readability and maintaining consistency with type checking. - No functional changes were made; this commit focuses on code structure and clarity.

- Updated the return statement in the `prepare_guided_output` function to use `model or tokenizer` instead of `llm`, improving clarity and consistency in processor assignment. - This change enhances the function's flexibility in handling different input types while maintaining existing functionality.

- Removed the upper version limit for the `transformers` package, allowing for updates beyond version 4.47.0.

davidberenstein1957 added 2 commits January 9, 2025 07:58

add outlines 0.1.0 support

329b645

update tests

9dd4be9

davidberenstein1957 requested a review from sdiazlor January 9, 2025 07:08

davidberenstein1957 added 3 commits January 9, 2025 09:07

fix passing tokenizer to regex processor as well

3ce1ff3

fix test by specifically passing None as token to transformersllm

d8d7b35

fix tests by increeasing the temperature to avoid exploding beam sear…

2e0b42c

…ch logic

sdiazlor reviewed Jan 9, 2025

View reviewed changes

davidberenstein1957 and others added 11 commits January 9, 2025 13:17

fix logit processor assignment during generation

5ee7dce

[pre-commit.ci] auto fixes from pre-commit.com hooks

0d26a1e

for more information, see https://pre-commit.ci

add support transformers

47e38dc

[pre-commit.ci] auto fixes from pre-commit.com hooks

66ac934

for more information, see https://pre-commit.ci

remove duplicate import

61c3538

Merge branch 'develop' into feat/1081-feature-update-to-outlines010

a3b4f9c

[pre-commit.ci] auto fixes from pre-commit.com hooks

0738b27

for more information, see https://pre-commit.ci

remove duplicate

8e6613b

Merge branch 'feat/1081-feature-update-to-outlines010' of https://git…

7db1b0b

…hub.com/argilla-io/distilabel into feat/1081-feature-update-to-outlines010

remove duplicate import

cb4c2ce

return content when nog chat template is present

7f20d9f

davidberenstein1957 marked this pull request as ready for review January 9, 2025 17:36

davidberenstein1957 added 3 commits January 9, 2025 18:39

refactor clean code

61aa597

chore refactor

b994f06

refactor logic if else statement

a47963d

davidberenstein1957 requested review from gabrielmbmb, plaguss and sdiazlor January 9, 2025 17:47

davidberenstein1957 added 3 commits January 9, 2025 18:58

fix import when outlines is not present

a0f8acd

chore pin transformers version

b41d6f0

chore add context w.r.t. logit processor

d2fdd4c

davidberenstein1957 added 2 commits January 9, 2025 19:48

chore bump version

2b8f634

add simplification of transformers implementation

ed5f00f

davidberenstein1957 requested a review from burtenshaw January 10, 2025 07:21

burtenshaw reviewed Jan 10, 2025

View reviewed changes

plaguss requested changes Jan 10, 2025

View reviewed changes

tests/unit/pipeline/.DS_Store Outdated Show resolved Hide resolved

tests/unit/.DS_Store Outdated Show resolved Hide resolved

vllm Outdated Show resolved Hide resolved

src/distilabel/steps/tasks/structured_outputs/outlines.py Outdated Show resolved Hide resolved

davidberenstein1957 added 9 commits January 10, 2025 10:12

Update .gitignore to exclude .DS_Store files and remove vllm subproje…

473de03

…ct; delete unnecessary .DS_Store files from unit tests

Merge branch 'develop' into feat/1081-feature-update-to-outlines010

110ecaf

burtenshaw approved these changes Jan 10, 2025

View reviewed changes

davidberenstein1957 added 9 commits January 10, 2025 15:10

Merge branch 'develop' into feat/1081-feature-update-to-outlines010

7fc1762

Refactor type hint imports in outlines.py for improved clarity

85494c4

- Updated type hint imports to include `# noqa` comments, enhancing code readability and maintaining consistency with type checking. - No functional changes were made; this commit focuses on code structure and clarity.

Merge branch 'develop' into feat/1081-feature-update-to-outlines010

f6a50f0

Update transformer dependency constraints in pyproject.toml

399154e

- Removed the upper version limit for the `transformers` package, allowing for updates beyond version 4.47.0.

davidberenstein1957 merged commit 9506930 into develop Jan 10, 2025
6 of 7 checks passed

davidberenstein1957 deleted the feat/1081-feature-update-to-outlines010 branch January 10, 2025 16:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update to outlines010 #1092

update to outlines010 #1092

davidberenstein1957 commented Jan 9, 2025 •

edited

Loading

github-actions bot commented Jan 9, 2025

codspeed-hq bot commented Jan 9, 2025 •

edited

Loading

sdiazlor left a comment

burtenshaw left a comment

plaguss left a comment

burtenshaw left a comment

update to outlines010 #1092

update to outlines010 #1092

Conversation

davidberenstein1957 commented Jan 9, 2025 • edited Loading

github-actions bot commented Jan 9, 2025

codspeed-hq bot commented Jan 9, 2025 • edited Loading

CodSpeed Performance Report

Merging #1092 will improve performances by ×4.1

Summary

Benchmarks breakdown

sdiazlor left a comment

Choose a reason for hiding this comment

burtenshaw left a comment

Choose a reason for hiding this comment

plaguss left a comment

Choose a reason for hiding this comment

burtenshaw left a comment

Choose a reason for hiding this comment

davidberenstein1957 commented Jan 9, 2025 •

edited

Loading

codspeed-hq bot commented Jan 9, 2025 •

edited

Loading