Test harness for lemmas #60

bbyalcinkaya · 2025-01-31T14:34:15Z

This PR adds a test harness for testing lemmas. Tests are written in .k or .md files containing claims that target specific lemmas, with comments above the claims indicating which lemmas are being tested.

Additionally, the spec file proving functionality is now exposed as a CLI command: komet prove-raw. This command enables proving of K claims from a file and includes options for generating bug reports and saving proofs to specified directories.

usage: komet prove-raw [-h] [--proof-dir PROOF_DIR] [--bug-report BUG_REPORT] [--label LABEL] CLAIM_FILE

positional arguments:
  CLAIM_FILE            path to claim file

options:
  -h, --help            show this help message and exit
  --proof-dir PROOF_DIR
                        Output directory for proofs
  --bug-report BUG_REPORT
                        Bug report directory for proofs
  --label LABEL         Label of the K claim in the file

src/tests/lemmas/specs/int-bitwise-spec.k

ehildenb · 2025-02-03T20:59:45Z

I think we can actually avoid using the runLemma => doneLemma functionality altogether. Does parse_modules, when called on modules that have claims like this:

claim ( (I <<Int 32) |Int 4) modInt 256 => ( (I <<Int 32) |Int 4) &Int 255

Produce this directly, or automatically wrap it in <k> cell and configuration? If it doesn't aoutomatically include the k cell, we probably can use the logic here (https://github.com/runtimeverification/evm-semantics/blob/43fce3055f5f94606fb3952028e37bf01e409169/kevm-pyk/src/kevm_pyk/__main__.py#L283), and the EqualityProof.from_claim to build a proof that will be discharged just by calling the simplifier, rather than the reachability prover. Does that work?

Co-authored-by: Everett Hildenbrandt <[email protected]>

bbyalcinkaya · 2025-02-05T14:34:41Z

I think we can actually avoid using the runLemma => doneLemma functionality altogether. Does parse_modules, when called on modules that have claims like this:
claim ( (I <<Int 32) |Int 4) modInt 256 => ( (I <<Int 32) |Int 4) &Int 255
Produce this directly, or automatically wrap it in <k> cell and configuration? If it doesn't aoutomatically include the k cell, we probably can use the logic here (https://github.com/runtimeverification/evm-semantics/blob/43fce3055f5f94606fb3952028e37bf01e409169/kevm-pyk/src/kevm_pyk/__main__.py#L283), and the EqualityProof.from_claim to build a proof that will be discharged just by calling the simplifier, rather than the reachability prover. Does that work?

@ehildenb

Attempting to prove this directly with the APRProver gives a sort error (something like "Int is not a subsort of GeneratedTopCell") . So I implemented the logic in the link you provided. My only concern is this warning:

Building an EqualityProof that has known soundness issues: See https://github.com/runtimeverification/haskell-backend/issues/3605.

Is it something we should worry about?

ehildenb · 2025-02-06T14:40:06Z

I think, given that we're using universal binding for the equality proof now (https://github.com/runtimeverification/k/blob/master/pyk/src/pyk/proof/implies.py#L173), we should be OK w.r.t. that warning. Can you test it by breaking one of the proofs manually, and make sure it fails appropriately?

bbyalcinkaya and others added 6 commits January 31, 2025 16:57

implement the harness

26b1b4a

add lemma tests

075ecd2

Set Version: 0.1.51

3649bee

add CI job for lemma tests

1746e7d

format

fbb640e

update soroban sdk

49646ac

bbyalcinkaya marked this pull request as ready for review February 3, 2025 17:26

bbyalcinkaya requested review from tothtamas28 and ehildenb February 3, 2025 17:26

ehildenb reviewed Feb 3, 2025

View reviewed changes

src/tests/lemmas/specs/int-bitwise-spec.k Outdated Show resolved Hide resolved

bbyalcinkaya and others added 4 commits February 4, 2025 12:23

Fix typo

2aac982

Co-authored-by: Everett Hildenbrandt <[email protected]>

Merge branch 'master' into lemmas-harness

38c36ba

Set Version: 0.1.55

c1b5b03

add EqualityProof support

ea82bf4

bbyalcinkaya requested a review from ehildenb February 5, 2025 16:01

ehildenb approved these changes Feb 6, 2025

View reviewed changes

bbyalcinkaya added the automerge label Feb 6, 2025

automergerpr-permission-manager bot merged commit cbfb8fc into master Feb 6, 2025
4 checks passed

automergerpr-permission-manager bot deleted the lemmas-harness branch February 6, 2025 15:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test harness for lemmas #60

Test harness for lemmas #60

bbyalcinkaya commented Jan 31, 2025 •

edited

Loading

ehildenb commented Feb 3, 2025

bbyalcinkaya commented Feb 5, 2025

ehildenb commented Feb 6, 2025

Test harness for lemmas #60

Test harness for lemmas #60

Conversation

bbyalcinkaya commented Jan 31, 2025 • edited Loading

ehildenb commented Feb 3, 2025

bbyalcinkaya commented Feb 5, 2025

ehildenb commented Feb 6, 2025

bbyalcinkaya commented Jan 31, 2025 •

edited

Loading