DD3431-PhD-Task

Repository for the PhD assignment of course DD3431 at KTH.

GOAL & Overview

This PhD project aims to apply the fundamental learning of the DD3431 Machine Learning course on the PhD topic of the author.

In this case, the project will be directly related to smart contract security.

Using the dataset smartbugs-curated and the recent-published exploit for the dataset sb-heist, this experiment will compare the performance of two models trained with different configurations of the same dataset for vulnerable line identification.

Mix Model (MM):

Dataset: smartbugs-curated
Labels: smartbugs-curated

Pure Model (PM):

Dataset: smartbugs-curated
Labels: smartbugs-curated x sb-heists (exploits)

Methodology

Each model will aim to identify vulnerable lines of code.

MM: will train using the labeles from the smartgbugs-curated dataset.

PM: will train on lines of code that are label as vulnerable if they are reported as vulnerable in the smartbugs-curated dataset and the contract has been listed as exploitable by sb-heists.

Evaluation

Both MM and PM will be test with each others dataset to assess the performance of the model when trained with pure True Positive vulnerabilities.

Results

Accuracy

Dataset	MM			PM
Dataset	Accuracy	Precision	Recall	Accuracy	Precision	Recall
Raw smartbugs-curated	0.887	0.84	0.954	0.426	0.44	0.659
Exploitable smartbugs-curated	0.42	0.518	0.518	0.844	0.833	0.925

Reproduction

Installation

Clone the repository:

git clone https://github.com/yourusername/DD3431-PhD-Task.git

Change to the repository directory:
```
cd DD3431-PhD-Task
```
Install the required dependencies using Poetry:
```
poetry install
```

Acknowledgements

Thanks to @vivi365 for her key advise on ML for vulnerability detection.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.github/workflows		.github/workflows
dataset		dataset
notebooks		notebooks
.gitignore		.gitignore
.gitmodules		.gitmodules
DD3431_PhD_Report_Sofia_Bobadilla.pdf		DD3431_PhD_Report_Sofia_Bobadilla.pdf
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DD3431-PhD-Task

GOAL & Overview

Methodology

Evaluation

Results

Reproduction

Installation

Acknowledgements

About

Releases

Packages

Languages

License

sofiabobadilla/DD3431-PhD-Task

Folders and files

Latest commit

History

Repository files navigation

DD3431-PhD-Task

GOAL & Overview

Methodology

Evaluation

Results

Reproduction

Installation

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages