Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Could you please offer the code for calculating unit testing metric of function completion in RepoEval dataset? #8

Open
wyt2000 opened this issue Jul 1, 2024 · 1 comment

Comments

@wyt2000
Copy link

wyt2000 commented Jul 1, 2024

Thank you for your work. I hope to reproduce the experiments related to the RepoEval dataset, but the repository only contains the code for calculating EM and ES. Could you please provide the code for calculating the UT metric to help reproduce the function-level code generation experiments? I would greatly appreciate it.

@xiaowu0162
Copy link
Contributor

xiaowu0162 commented Jul 1, 2024

Hi @wyt2000 ,

Thank you for your interest. For each repository, we created an environment and installed the repository from source. Then, to test each case, we substitute the model's function completion into the source code and trigger unittest with pytest.

Although we haven't tested on it, it seems that the similar functionality is implemented in CodeRAG-Bench (https://github.com/code-rag-bench/code-rag-bench?tab=readme-ov-file#repoeval-function). You may also give it a try.

Best,
Di

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants