InferAI

InferAI is an experiment tool designed to automatically generate patches for security vulnerabilities identified by the Infer Static Analyzer through the use of LLMs. This project was developed as part of the Ingegneria dei Sistemi Distribuiti course at the University of Catania (UNICT), with the goal of bridging the gap between vulnerability detection and resolution.

The project is intended as a conceptual tool to explore automation in vulnerability patching. While it provides insights into the potential of such tools, InferAI is not designed for production use or highly complex projects.

Code Structure and Workflow

The software is divided into three parts to work asynchronously:

Frontend:
- A basic web interface that enables users to start jobs and view results.
Infer-Worker:
- A backend service responsible for running Infer Static Analyzer and parsing the code and enrich it.
Query-Worker:
- A backend service that queries the LLM to generate patch suggestions based on the output of Infer.

The three components communicate asynchronously through message queues provided by RabbitMQ.

The workflow is as follows:

Job Creation: A user initiates a job through the frontend. The job is defined by an ID, the repository link to analyze, and the code entry point. This information is sent to the Infer-worker via the analyze-jobs queue.
Code Analysis: Infer-worker processes the job by analyzing the code with the Infer Static Analyzer. Vulnerabilities are grouped by functions, and the code is enriched with contextual comments for the LLM (as illustrated below). The library used for source code manipulation is tree-sitter. Each vulnerable function is sent to the Query-worker through the query-jobs queue.
Code fix generation: Query-worker processes incoming messages and manages rate limit errors. The LLM response generates a patched version of the vulnerable function, which is forwarded to the patch-jobs queue, handled by the Infer-worker.
Patch Generation: Infer-worker compares the patched function generated by the LLM with the original function to create an applicable patch file. Infer Static Analyzer is run again to verify the quality of the generated code, though retry handling is not currently implemented.
Result Retrieval: Users can download the processed results through the frontend.

Note

This setup is a proof of concept and has areas for improvement. For instance: The shared storage between the frontend and Infer-worker could be replaced with cloud storage. A database could be introduced to track users and their jobs more effectively. Currently, the frontend manages user sessions and analyzed files with a simple JWT token.

Installation

To set up and run InferAI, follow these steps:

git clone https://github.com/v0lp3/InferAI.git

Set the required tokens:

cd InferAI
mkdir secrets
cd secrets
echo "YOUR_GROQ_TOKEN" > groq_token.txt
echo "RANDOM_PASSWORD" > rabbitmq.txt
echo "RANDOM_PASSWORD2" > flask.txt # optional

Run with docker:

docker compose up

Example

Note

Different execution can produce different results.

Given the following code as input:

#include <string.h>

int main () {

        char* test = malloc(-1);
        strcpy(test, "yoooooooo");

}

Infer will produce the following report:

main.c:6: error: Buffer Overrun L1
  Offset: 9 Size: [0, -1].
  4.
  5.         char* test = malloc(-1);
  6.         strcpy(test, "yoooooooo");
             ^
  7.
  8. }

main.c:5: error: Inferbo Alloc Is Big
  Length: 18446744073709551615.
  3. int main () {
  4.
  5.         char* test = malloc(-1);
                          ^
  6.         strcpy(test, "yoooooooo");
  7.

main.c:6: error: Null Dereference
  pointer `test` last assigned on line 5 could be null and is dereferenced by call to `strcpy()` at line 6, column 9.
  4.
  5.         char* test = malloc(-1);
  6.         strcpy(test, "yoooooooo");
             ^
  7.
  8. }


Found 3 issues
                  Issue Type(ISSUED_TYPE_ID): #
          Null Dereference(NULL_DEREFERENCE): 1
  Inferbo Alloc Is Big(INFERBO_ALLOC_IS_BIG): 1
        Buffer Overrun L1(BUFFER_OVERRUN_L1): 1

InferAI processes the report from Infer Static Analyzer and enriches the original code by adding contextual comments directly below the detected vulnerabilities. For instance:

#include <string.h>

int main () {
        // [Unsafe] INFERBO_ALLOC_IS_BIG: Length: 18446744073709551615.
        char* test = malloc(-1);
        // [Unsafe] NULL_DEREFERENCE: pointer `test` last assigned on line 5 could be null and is dereferenced by call to `strcpy()` at line 6, column 9.
        // [Unsafe] BUFFER_OVERRUN_L1: Offset: 9 Size: [0, -1].
        strcpy(test, "yoooooooo");

}

The enriched code is then passed to the LLM, which uses the additional context to generate a patch.

--- test3/main.c
+++ test3/main.c
@@ -2,7 +2,11 @@
 
 int main () {
 
-        char* test = malloc(-1);
-        strcpy(test, "yoooooooo");
+        char* test = NULL;
+        if(test = malloc(256)) {
+                strcpy(test, "yoooooooo");
+                free(test);
+        }
 
 }

Benchmarks

You can find the benchmarks here.

Author

Andrea Maugeri

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
docs		docs
frontend		frontend
infer-worker		infer-worker
query-worker		query-worker
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
rabbitmq.conf		rabbitmq.conf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

InferAI

Code Structure and Workflow

Installation

Example

Benchmarks

Author

About

Languages

License

v0lp3/InferAI

Folders and files

Latest commit

History

Repository files navigation

InferAI

Code Structure and Workflow

Installation

Example

Benchmarks

Author

About

Topics

Resources

License

Stars

Watchers

Forks

Languages