Building Deepbench with Accel-Sim #96

ishitachaturvedi · 2022-03-01T20:01:44Z

Hi,

I am trying to build DeepBench with Accel-Sim and failing.
Can you share the Deepbench makefile you use to build the benchmark?
Thank you!

JRPan · 2022-03-01T20:03:04Z

Have you checked the gpu-app-collection repo?

rodhuega · 2022-03-01T20:04:43Z

Do you have installed the right cudnn version for your cuda version and set it to the PATH?

ishitachaturvedi · 2022-03-01T20:05:00Z

Not yet. I will look into it.
Can you please keep the issue open till I set it up? Thanks!

ishitachaturvedi · 2022-03-01T20:05:28Z

What cudnn version is required for deepbench?

rodhuega · 2022-03-01T20:07:28Z

I think that it doesn't matter. The only think you need is to have a right installation of cuda with cudnn and all the things in the PATH. I had problems compiling deepbench and after that I was able to compile it and execute it.

ishitachaturvedi · 2022-03-01T20:08:33Z

Which libraries need to be there in PATH?
Have you had any success setting up Gunrock?

rodhuega · 2022-03-01T20:09:53Z

Cuda and cudnn. I don't know what is ginecólogo

ishitachaturvedi · 2022-03-01T20:51:43Z

Do any libraries need to be linked as static in the makefile?

JRPan · 2022-03-02T00:03:40Z

why not just use the one we provided?

ishitachaturvedi · 2022-03-02T00:17:21Z

I used the deepbench from gpu-app-collection using release-accelwattch branch.
An ldd on conv_bench gives-
linux-vdso.so.1 (0x00007fff5ad12000)
libcurand.so.10 => /usr/local/cuda-11/lib64/libcurand.so.10 (0x000014d023f0b000)
libcudnn.so.8 => /scratch/gpfs/ishitac/cudnn-lib/lib64/libcudnn.so.8 (0x000014d023ce3000)
libcudart.so.11.0 => /usr/local/cuda-11/lib64/libcudart.so.11.0 (0x000014d023a3f000)
libstdc++.so.6 => /usr/lib/x86_64-linux-gnu/libstdc++.so.6 (0x000014d0236b6000)
libgcc_s.so.1 => /lib/x86_64-linux-gnu/libgcc_s.so.1 (0x000014d02349e000)
libc.so.6 => /lib/x86_64-linux-gnu/libc.so.6 (0x000014d0230ad000)
librt.so.1 => /lib/x86_64-linux-gnu/librt.so.1 (0x000014d022ea5000)
libpthread.so.0 => /lib/x86_64-linux-gnu/libpthread.so.0 (0x000014d022c86000)
libdl.so.2 => /lib/x86_64-linux-gnu/libdl.so.2 (0x000014d022a82000)
libm.so.6 => /lib/x86_64-linux-gnu/libm.so.6 (0x000014d0226e4000)
/lib64/ld-linux-x86-64.so.2 (0x000014d029e06000)

Should it not be pointing to accel-wattch libcudart?

JRPan · 2022-03-02T00:22:28Z

This looks fine. The setup_environment.sh will set LD_LIBRARY_PATH

ishitachaturvedi · 2022-03-02T00:32:10Z

I built conv_bench, but while I do see the executable, if I do ./conv_bench it gives Segmentation fault (core dumped)

My bashrc looks like this-
export PTX_SIM_MODE_FUNC=0
export PTX_SIM_MODE_DETAIL=1
export CUDA_INSTALL_PATH=/usr/local/cuda-11
export PATH=$CUDA_INSTALL_PATH/bin:$PATH
source /scratch/gpfs/ishitac/gpgpusim-codes/accel-sim-framework/gpu-simulator/setup_environment.sh

export CPATH=$CPATH:/scratch/gpfs/ishitac/local/include
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/scratch/gpfs/ishitac/local/lib
export CUDNN_PATH=/scratch/gpfs/ishitac/cudnn-lib/
export LD_LIBRARY_PATH=$CUDA_INSTALL_PATH/lib64:$CUDA_INSTALL_PATH/lib:/scratch/gpfs/ishitac/cudnn-lib/lib64

ishitachaturvedi · 2022-03-02T00:59:45Z

It is not invoking accel-sim but going to nvidia-cuda to run

JRPan · 2022-03-02T03:23:33Z

can you try manually source the setup_environment.sh?
I'm not sure about this but I think the later export in your bashrc overwrites what setup_environment.sh did.
export LD_LIBRARY_PATH=$CUDA_INSTALL_PATH/lib64:$CUDA_INSTALL_PATH/lib:/scratch/gpfs/ishitac/cudnn-lib/lib64

ishitachaturvedi · 2022-03-02T03:30:18Z

I manually sourced it and changed the line to export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$CUDA_INSTALL_PATH/lib64:$CUDA_INSTALL_PATH/lib:/scratch/gpfs/ishitac/cudnn-lib/lib64
I did make on source again, but it does not link to accel-sim in libcuda, it links to the nvcc library somehow

ishitachaturvedi · 2022-03-02T04:24:44Z

Interestingly I have this problem for cuda 11. When I go to CUDA 10.1 it links libcudart.so to the correct libcuda for accelsim. My cuda 10.1 does not have cublas so I am still sorting out the issue for Deepbench, but other benchmarks which dont require cublas are linking to accelsim now. Any ideas for why it is not working with cuda 11?

ishitachaturvedi · 2022-03-02T18:36:26Z

I managed to fix the linker error.
Now gpgpusim is linked with the binaries.
However I get the following error-
terminate called after throwing an instance of 'std::runtime_error'
what(): CUDNN failure: CUDNN_STATUS_NOT_INITIALIZED in cudnn_helper.h at line: 33
Any way to get around this? Thank you

BrianQian1999 · 2023-02-15T14:56:46Z

Have you fixed that CUDNN_STATUS_NOT_INITIALIZED issue? I was stuck in that as well.

JRPan · 2023-02-16T18:20:31Z

Are you running PTX mode or trace mode? Deepbench is supported in trace mode in Accel-Sim

BrianQian1999 · 2023-02-17T00:47:00Z

I was trying to deploy some workloads implemented in cuDNN (Deepbench, cuDNN_sample, etc.) in PTX mode. I replaced the libcudnn.so to the corresponding static library to enable PTX mode (as it does in the original GPGPU-sim). However, the same SEGF would occur in both Accel-sim and GPGPU-sim on my machine (CUDA11.0 + cuDNN8.5 + GCC9).

JRPan · 2023-02-17T00:52:22Z

Again, Deepbench is supported in trace mode in Accel-Sim. We have not looked into how to run PTX mode with cuDNN. Feel free to download the traces for Deepbench and try out the trace mode.

BrianQian1999 · 2023-02-17T01:12:28Z

Thanks for the feedback. It seems that it would not be possible to run cuDNN in PTX mode after CUDA-8.

JRPan · 2023-02-17T01:13:38Z

*Not my paper. But thanks.

Yes, that was done with gpgpu-sim. Not Accel-sim. Here is some discussion on DeepBench and a response from the Author of Accel-Sim, Mahmoud.
gpgpu-sim/gpgpu-sim_distribution#212

Someone made it work a while ago. But since Accel-Sim went out, so we focused on Accel-Sim instead.

Good luck hacking!

JRPan closed this as completed Mar 2, 2022

JRPan reopened this Mar 2, 2022

JRPan closed this as completed Apr 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Building Deepbench with Accel-Sim #96

Building Deepbench with Accel-Sim #96

ishitachaturvedi commented Mar 1, 2022

JRPan commented Mar 1, 2022

rodhuega commented Mar 1, 2022

ishitachaturvedi commented Mar 1, 2022

ishitachaturvedi commented Mar 1, 2022

rodhuega commented Mar 1, 2022

ishitachaturvedi commented Mar 1, 2022

rodhuega commented Mar 1, 2022

ishitachaturvedi commented Mar 1, 2022

JRPan commented Mar 2, 2022

ishitachaturvedi commented Mar 2, 2022 •

edited

Loading

JRPan commented Mar 2, 2022

ishitachaturvedi commented Mar 2, 2022

ishitachaturvedi commented Mar 2, 2022

JRPan commented Mar 2, 2022

ishitachaturvedi commented Mar 2, 2022

ishitachaturvedi commented Mar 2, 2022

ishitachaturvedi commented Mar 2, 2022

BrianQian1999 commented Feb 15, 2023

JRPan commented Feb 16, 2023

BrianQian1999 commented Feb 17, 2023

JRPan commented Feb 17, 2023

BrianQian1999 commented Feb 17, 2023

JRPan commented Feb 17, 2023

Building Deepbench with Accel-Sim #96

Building Deepbench with Accel-Sim #96

Comments

ishitachaturvedi commented Mar 1, 2022

JRPan commented Mar 1, 2022

rodhuega commented Mar 1, 2022

ishitachaturvedi commented Mar 1, 2022

ishitachaturvedi commented Mar 1, 2022

rodhuega commented Mar 1, 2022

ishitachaturvedi commented Mar 1, 2022

rodhuega commented Mar 1, 2022

ishitachaturvedi commented Mar 1, 2022

JRPan commented Mar 2, 2022

ishitachaturvedi commented Mar 2, 2022 • edited Loading

JRPan commented Mar 2, 2022

ishitachaturvedi commented Mar 2, 2022

ishitachaturvedi commented Mar 2, 2022

JRPan commented Mar 2, 2022

ishitachaturvedi commented Mar 2, 2022

ishitachaturvedi commented Mar 2, 2022

ishitachaturvedi commented Mar 2, 2022

BrianQian1999 commented Feb 15, 2023

JRPan commented Feb 16, 2023

BrianQian1999 commented Feb 17, 2023

JRPan commented Feb 17, 2023

BrianQian1999 commented Feb 17, 2023

JRPan commented Feb 17, 2023

ishitachaturvedi commented Mar 2, 2022 •

edited

Loading