`OutlinesLogitsProcessor` Benchmarks #979

lapp0 · 2024-06-16T22:13:57Z

What behavior of the library made you think about the improvement?

In #956 I noticed some performance issues relating to a large number of allowed_tokens. We should profile how long it takes for a logits processor to augment logits:

The core performance issue with outlines.generate.regex(model, ".{200}") is the need to convert a large (~150,000 integer) list into a tensor in the logits processor
    allowed_tokens = self.fsm.get_next_instruction(self._fsm_state).tokens
    allowed_tokens = torch.tensor(allowed_tokens, device=logits.device)
To mitigate, we can create a separate issue to ensure the FSM index uses tensors of token IDs, not lists. This will result in self.fsm.get_next_instruction(self._fsm_state).tokens being a tensor of token IDs.

How would you like it to behave?

As part of our ASV benchmark test suite, we should benchmark the performance of OutlinesLogitsProcessor to ensure there aren't performance regressions, and track performance improvements.

The text was updated successfully, but these errors were encountered:

lapp0 · 2024-07-13T21:49:02Z

Resolved by #1013

lapp0 added the enhancement label Jun 16, 2024

lapp0 changed the title ~~LogitsProcessor Benchmarks~~ OutlinesLogitsProcessor Benchmarks Jun 16, 2024

lapp0 added tests Linked to library tests optimization Related to performance optimizations labels Jun 19, 2024

lapp0 closed this as completed Jul 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`OutlinesLogitsProcessor` Benchmarks #979

`OutlinesLogitsProcessor` Benchmarks #979

lapp0 commented Jun 16, 2024 •

edited

Loading

lapp0 commented Jul 13, 2024

OutlinesLogitsProcessor Benchmarks #979

OutlinesLogitsProcessor Benchmarks #979

Comments

lapp0 commented Jun 16, 2024 • edited Loading

What behavior of the library made you think about the improvement?

How would you like it to behave?

lapp0 commented Jul 13, 2024

`OutlinesLogitsProcessor` Benchmarks #979

`OutlinesLogitsProcessor` Benchmarks #979

lapp0 commented Jun 16, 2024 •

edited

Loading