The artifact contains the source code and Python/shell scripts. The scripts generate the throughput results in Table 3 of our paper. We also provide our raw experimental data and the scripts to reproduce other figures in our paper.
We developed and tested our work in two NVIDIA GPUs (Quadro P6000 and Tesla V100). We expect our code can run on GPUs with the compute capability no less than 5.0.
Our work requires CUDA 9.2 SDK. Our work uses cmake later than 3.10 for compilation. We use Python 3 for our scripts.
We test our work on Ubuntu 18.04. We expect it can work with mainstream Linux distributions.
- matplotlib
- numpy
- pandas
- scipy
- xlrd
If you use conda
, simply run
conda create --name gpunfa_env python=3.7 matplotlib numpy scipy pandas xlrd
conda activate gpunfa_env
to install all the required Python packages.
All data sets are from public available benchmark suites. We convert the automata files of them to ANML format. To facilitate this step, we provide the data set that is ready to use. Our scripts will automatically unzip these datasets.
If the benchmark could not be downloaded due to GitHub's quota, try to download it here.
The setup.sh
will automatically download the datasets, build the executables, and set up environmental variables.
git clone https://github.com/bigwater/gpunfa-artifact.git
cd gpunfa-artifact
source setup.sh
Our project contains three executables, infant
, ppopp12
, and obat
. They are added to your PATH variable after the setup. The former two excutables are our implementations of iNFAnt and NFA-CG, respectively. The obat
contains our schemes. They share the same options and arguments settings.
obat -i [input_stream_file] -a [anml_file] -g [algorithm] ...
The algorithm could be:
- obat2: the scheme NewTran
- obat_MC: NewTran + matchset compression
- hotstart_ea_no_MC2: Hotstart
- hotstart_ea: Howstart + matchset compression
ppopp12 -i [input_stream_file] -a [anml_file] -g [algorithm] ...
The algorithm could be:
- ppopp12: Our implementation of NFA-CG.
infant -i [input_stream_file] -a [anml_file] -g [algorithm] ...
The algorithm could be:
- infant: Our implementation of iNFAnt.
Other options. You can check how to use other options by showing the help using obat -?
or obat -h
.
To get Table 3 in our paper, simply run the following commands.
cd gpunfa-artifact
./run_experiments_get_table3.sh
The entire set requires several hours to finish. It uses the same configuration as the paper and generates Table 3 automatically. It will generate a csv file abs_throughput.csv
containing the information of Table 3.
cd ${GPUNFA_ROOT}
./gen_figures.sh
The script will take around 20 min to finish.
Please refer to this paper for other details.
@inproceedings{gpunfa-asplos2020,
author = {Liu, Hongyuan and Pai, Sreepathi and Jog, Adwait},
title = {{Why GPUs are Slow at Executing NFAs and How to Make them Faster}},
booktitle = {Proceedings of the International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS)},
year = {2020}
}