Skip to content

glGarg/PerfBench

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

PerfBench: Performance Issue Benchmark for Software Engineering Agents

PerfBench is a benchmark dataset for evaluating software engineering agents on C# performance optimization tasks. This repository contains 82 real-world performance issues extracted from GitHub, along with evaluation tools and Docker environments for testing.

Dataset Overview

The benchmark consists of 81 instances from real GitHub repositories, each containing:

  • A specific performance issue described in the problem statement
  • The repository state before the performance fix
  • Reference solution (the actual commit that fixed the performance issue)

Example Structure

Each instance id follows the pattern: {owner}_{repo}__{issue_id}__{dotnet_version}

E.g.:

  • aarnott_nerdbank.messagepack__155__9.0 - Async serialization optimization
  • angusjohnson_clipper2__35__6.0 - Class to struct conversion
  • etc.

Citation

If you use PerfBench in your research, please cite:

@article{garg2025perfbenchagentsresolverealworld,
      title={PerfBench: Can Agents Resolve Real-World Performance Bugs?}, 
      author={Spandan Garg and Roshanak Zilouchian Moghaddam and Neel Sundaresan},
      year={2025},
      primaryClass={cs.SE},
      url={https://arxiv.org/abs/2509.24091}, 
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published