Skip to content

jkonvicka/Nvidia-CUDA-course

Repository files navigation

Nvidia-CUDA-course

Fundamentals of Accelerated Computing with CUDA C/C++

💡CUDA cheatsheet

NVIDIA DLI Certificate

ce7b1370c4484875811a1a47160a6ea6

Final assessment kernel execution time: 0.495862 sec [max. 1.3 sec]

Course

  1. Accelerating Applications with CUDA C/C++
  2. Unified Memory
  3. Streaming and Visual Profiling

Objectives

By the time you complete this lab, you will be able to:

* Write, compile, and run C/C++ programs that both call CPU functions and launch GPU kernels.
* Control parallel thread hierarchy using execution configuration.
* Refactor serial loops to execute their iterations in parallel on a GPU.
* Allocate and free memory available to both CPUs and GPUs.
* Handle errors generated by CUDA code.
* Accelerate CPU-only applications.

About

Fundamentals of Accelerated Computing with CUDA C/C++

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published