Implement `extend` for `cached_test_function_ir` #4159

tybug · 2024-11-08T19:14:03Z

Implement extend for cached_test_function_ir, and use ir serialization for a notion of size.

previous description

Closes . #3864.

This is really two PRs:

implement extend: int = 0 for cached_test_function_ir
migrate explain to the ir

In implementing extend for the ir, we have to choose a notion of "size" for the ir. I've chosen len(nodes) for now. We'll probably want to use something more intelligent in the future, such that 1k booleans is smaller than 1k strings each with 10 characters.

Inquisitor migration was a relatively straightforward lifting of blocks to nodes. I haven't thought too carefully about whether the transformation was correct (for some explain-phase-specific-reason) outside of "tests pass", so that may be worth a think on review.

Likely easiest to review by commit (except there is an "oops, all fixes" grab bag commit at the end).

Zac-HD · 2024-11-08T22:13:21Z

I'm not convinced the notion of ir_size makes sense; we seem to be conflating shrink ordering with number of nodes - I'd suggest separating these, and just calling len() directly for the latter.
I'm concerned that replacing start:end with random values will be rather inefficient for the explain mode; can we measure this or maybe implement an "if you see this magic value (None?), generate randomly until the corresponding .stop_example() call" feature.

otherwise looks awesome! Might be easiest to split out the explain mode changes to a separate PR; we could probably merge the rest today if so.

tybug · 2024-11-08T22:21:34Z

Agreed these are two distinct problems. I guess what I was trying to get at is that - in the future where BUFFER_SIZE_IR replaces BUFFER_SIZE - we are now limiting the maximum size of examples to n nodes instead of n bytes. But a single node can be almost arbitrarily large, whereas a byte can't. So if we interpret "ir size" as the number of nodes, we would allow consumers to generate e.g. a million characters in a single string without overrunning. That was the motivation behind defining a separate notion of size for overruns (which would be distinct from shrink ordering).
Hmm, is the new code not equivalent to the old? I didn't read the explain algorithm in detail, but it seemed to already be replacing start:end with a random buffer:

                buf_attempt_fixed = bytearray(buffer)
                buf_attempt_fixed[start:end] = [
                    self.random.randint(0, 255) for _ in range(end - start)
                ]
                result = self.engine.cached_test_function(
                    buf_attempt_fixed, extend=BUFFER_SIZE - len(buf_attempt_fixed)

and now we're replacing start:end with the same number-and-type-and-kwargs of random nodes. end - start is smaller on the ir though because that's # nodes, not # bytes.

Zac-HD · 2024-11-08T22:51:08Z

Right, yeah, I think "n bytes when serialized" might be the easiest way to get a nice notion of IR size for overruns.

It was, yep. Even with type-matching though it's much less likely that we can successfully 'slip' between different valid node types if there's some variation there... maybe we just don't worry about that for now though, and leave better-generation on the todo list.

tybug · 2024-11-08T23:03:40Z

ah yup, serialized size is probably good enough for now (and maybe ever).

I think that after #4086 the 'slipping' is roughly fine - we don't throw away misalignments anymore. It's not as efficient in clock cycles (due to ir -> buffer -> ir), but I think we accept the hit and just try to move off bytes asap.

tybug · 2024-11-09T03:42:49Z

OK, I've scoped down this PR. I think we're reaching a critical juncture here where it's important to get the ir semantics right, but there are spurious potential problems caused by the interaction of the ir and buffer semantics - such as disagreements on when something is an overrun. Hopefully we (I) can blaze through the changes and minimize impact.

(to be clear, I think I've avoided any consumer-facing problems, but it does make me nervous.)

tybug · 2024-11-09T03:46:26Z

hypothesis-python/src/hypothesis/database.py

@@ -671,3 +673,75 @@ def move(self, src: bytes, dest: bytes, value: bytes) -> None:

    def delete(self, key: bytes, value: bytes) -> None:
        raise RuntimeError(self._read_only_message)
+
+
+def ir_to_bytes(ir: Iterable[IRType], /) -> bytes:


This implementation is lifted from your branch, with two fixes:

surrogatepass instead of surrogateescape (I don't recall my at-the-time justification of this, but if you run your test case for long enough you will get an error with surrogateescape)

correct interpretation for negative ints? I think this also is caught by the test case if run for long enough

Zac-HD

Looks good - onwards!

tybug added 5 commits November 7, 2024 22:52

update debug_data to use the ir

22deae0

return a tuple of ir nodes instead of a list

aac9326

implement extend for cached_test_function_ir

d8a5995

migrate inquisitor to the ir

31f7af5

typing, tests, release notes

eedbd9a

tybug requested a review from Zac-HD as a code owner November 8, 2024 19:14

tybug force-pushed the explain-ir branch from 876f456 to 3fd3f06 Compare November 8, 2024 19:17

Merge branch 'master' into explain-ir

7070ce6

tybug force-pushed the explain-ir branch from 3fd3f06 to 7070ce6 Compare November 8, 2024 19:19

tybug added 2 commits November 8, 2024 21:08

use serialized bytes for ir size

aeaf173

split out explain changes

3bb468a

tybug changed the title ~~Migrate explain phase to the typed choice sequence~~ Implement extend for cached_test_function_ir Nov 9, 2024

tybug commented Nov 9, 2024

View reviewed changes

tybug mentioned this pull request Nov 9, 2024

Migrate explain phase to the typed choice sequence #4160

Merged

Zac-HD approved these changes Nov 9, 2024

View reviewed changes

tybug merged commit 80942bc into HypothesisWorks:master Nov 9, 2024
48 checks passed

tybug deleted the explain-ir branch November 9, 2024 16:00

tybug mentioned this pull request Nov 9, 2024

Migrate our core representation to the typed choice sequence #3921

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement `extend` for `cached_test_function_ir` #4159

Implement `extend` for `cached_test_function_ir` #4159

tybug commented Nov 8, 2024 •

edited by Zac-HD

Loading

Zac-HD commented Nov 8, 2024

tybug commented Nov 8, 2024 •

edited

Loading

Zac-HD commented Nov 8, 2024

tybug commented Nov 8, 2024

tybug commented Nov 9, 2024 •

edited

Loading

tybug Nov 9, 2024 •

edited

Loading

Zac-HD left a comment

Implement extend for cached_test_function_ir #4159

Implement extend for cached_test_function_ir #4159

Conversation

tybug commented Nov 8, 2024 • edited by Zac-HD Loading

Zac-HD commented Nov 8, 2024

tybug commented Nov 8, 2024 • edited Loading

Zac-HD commented Nov 8, 2024

tybug commented Nov 8, 2024

tybug commented Nov 9, 2024 • edited Loading

tybug Nov 9, 2024 • edited Loading

Choose a reason for hiding this comment

Zac-HD left a comment

Choose a reason for hiding this comment

Implement `extend` for `cached_test_function_ir` #4159

Implement `extend` for `cached_test_function_ir` #4159

tybug commented Nov 8, 2024 •

edited by Zac-HD

Loading

tybug commented Nov 8, 2024 •

edited

Loading

tybug commented Nov 9, 2024 •

edited

Loading

tybug Nov 9, 2024 •

edited

Loading