TacTok

The TacTok automated Coq proof script synthesis tool

The TacTok technique and its evaluation are described in TacTok: Semantics-Aware Proof Synthesis by Emily First, Yuriy Brun, and Arjun Guha, in Proceedings of the ACM on Programming Languages (PACMPL) Object-Oriented Programming, Systems, Languages, and Applications (OOPSLA) issue, vol. 4, November 2020, pp. 149:1-149:26, doi: 10.1145/3428299.

We have made available a replication package for the results in the paper. It is a VirtualBox VM: here. This includes the result files, scripts for processing the results, pre-trained models, directions and code for running the pre-trained models, and training examples used in the paper. The VM does not support training models.

The following are the directions for installation and use of TacTok.

1. Installation

TacTok operates within the CoqGym learning environment and so modifies their code. The following are the dependencies and directions to install CoqGym:

Dependencies

Building Coq, SerAPI, CoqHammer, and the CoqGym Coq Projects

Create an OPAM switch for OCaml 4.07.1+flambda: opam switch create 4.07.1+flambda && eval $(opam env)
Upgrade the installed OPAM packages (optional): opam upgrade && eval $(opam env)
Clone the repository
Install Coq, SerAPI and CoqHammer: cd TacTok && source install.sh
Build the Coq projects (can take a while): cd coq_projects && make && cd ..
Create and activate the conda environment: conda env create -f coq_gym.yml && conda activate coq_gym

2. Extracting proofs from Coq projects

For any Coq project that compiles in Coq 8.9.1 that you want to use (and may not be in the CoqGym dataset), the following are the steps to extract the proofs from code:

Copy the project into the coq_projects directory.
For each *.meta file in the project, run python check_proofs.py --file /path/to/*.meta
This generates a *.json file in ./data/ corresponding to each *.meta file. The proofs field of the JSON object is a list containing the proof names.
For each *.meta file and each proof, run:
python extract_proof.py --file /path/to/*.meta --proof $PROOF_NAME
python extract_synthetic_proofs.py --file /path/to/*.meta --proof $PROOF_NAME
Finally, run python postprocess.py

3. Using the CoqGym benchmark dataset

Download the CoqGym dataset

To download the CoqGym dataset, please refer to the CoqGym repo for the latest instructions.

Training Examples (proof steps)

Proofs steps used in the paper are found in processed.tar.gz, which can be downloaded from the replication package link provided above. This should be copied into TacTok/
To extract new proofs, run python extract_proof_steps.py.
To generate new proof steps that have prev_tokens field, run python process_proof_steps.py. This will generate processed/proof_steps.

4. Training TacTok

To train, for example, the Tok model on all the proof steps, run python main.py --no_validation --exp_id tok

Model checkpoints will be saved in TacTok/runs/tok/checkpoints/. See options.py for command line options.

5. Evaluation

Now, you can evaluate a model you trained on the test set. For example, the Tok model that you trained can be run with python evaluate.py ours tok-results --path /path/to/tok_model/*.pth. If you used additional options in training, specify those same options for evaluating.

WeigthedGreedy and WeightedRandom don't require any training. To evaluate these two methods:

Run WeightedGreedy python evaluate.py greedy greedy-results
Run WeightedRandom python evaluate.py weighed random-results These commands run the models for the entire specified test set. You can specify file names, proof names, project/file indices, etc.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
TacTok		TacTok
coq-serapi		coq-serapi
coq		coq
coq_projects		coq_projects
stats		stats
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
check_proofs.py		check_proofs.py
coq_gym.yml		coq_gym.yml
eval_env.py		eval_env.py
extract_proof.py		extract_proof.py
extract_synthetic_proofs.py		extract_synthetic_proofs.py
gallina.py		gallina.py
install.sh		install.sh
postprocess.py		postprocess.py
proof_tree.py		proof_tree.py
re_patterns.py		re_patterns.py
serapi.py		serapi.py
unzip_data.py		unzip_data.py
utils.py		utils.py
vernac_types.py		vernac_types.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TacTok

1. Installation

Dependencies

Building Coq, SerAPI, CoqHammer, and the CoqGym Coq Projects

2. Extracting proofs from Coq projects

3. Using the CoqGym benchmark dataset

Download the CoqGym dataset

Training Examples (proof steps)

4. Training TacTok

5. Evaluation

About

Releases

Packages

Contributors 3

Languages

License

LASER-UMASS/TacTok

Folders and files

Latest commit

History

Repository files navigation

TacTok

1. Installation

Dependencies

Building Coq, SerAPI, CoqHammer, and the CoqGym Coq Projects

2. Extracting proofs from Coq projects

3. Using the CoqGym benchmark dataset

Download the CoqGym dataset

Training Examples (proof steps)

4. Training TacTok

5. Evaluation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages