cuda.device_empty function #66

smazouz42 · 2024-07-19T00:57:56Z

This pull request addresses issue #57 by adding a new feature to 'cuda' device_empty that allows you to allocate memory on the GPU

This PR aims to make the C code compilable using nvcc. The cuda language was added as well as a CudaCodePrinter. Changes to stdlib: Wrapped expressions using complex types in an `ifndef __NVCC__` to avoid processing them with the nvcc compiler --------- Co-authored-by: Mouad Elalj, EmilyBourne

This pull request fixes #48, by implementing a tiny wrapper for CUDA and a wrapper for non-CUDA functionalities only with external 'C'. **Commit Summary** - Implemented new header printer for CUDA. - Added CUDA wrapper assignment - Instead of wrapping all local headers, wrap only C functions with extern 'C' --------- Co-authored-by: EmilyBourne <[email protected]> Co-authored-by: bauom <[email protected]>

This pull request addresses issue #28 by implementing a new feature in Pyccel that allows users to define custom GPU kernels. The syntax for creating these kernels is inspired by Numba. and I also need to fix issue #45 for testing purposes **Commit Summary** - Introduced KernelCall class - Added cuda printer methods _print_KernelCall and _print_FunctionDef to generate the corresponding CUDA representation for both kernel calls and definitions - Added IndexedFunctionCall represents an indexed function call - Added CUDA module and cuda.synchronize() - Fixing a bug that I found in the header: it does not import the necessary header for the used function --------- Co-authored-by: EmilyBourne <[email protected]> Co-authored-by: bauom <[email protected]> Co-authored-by: Emily Bourne <[email protected]>

…nctions, and refining CUDA type handling

This PR aims to make the C code compilable using nvcc. The cuda language was added as well as a CudaCodePrinter. Changes to stdlib: Wrapped expressions using complex types in an `ifndef __NVCC__` to avoid processing them with the nvcc compiler --------- Co-authored-by: Mouad Elalj, EmilyBourne

This pull request fixes #48, by implementing a tiny wrapper for CUDA and a wrapper for non-CUDA functionalities only with external 'C'. **Commit Summary** - Implemented new header printer for CUDA. - Added CUDA wrapper assignment - Instead of wrapping all local headers, wrap only C functions with extern 'C' --------- Co-authored-by: EmilyBourne <[email protected]> Co-authored-by: bauom <[email protected]>

This pull request addresses issue #28 by implementing a new feature in Pyccel that allows users to define custom GPU kernels. The syntax for creating these kernels is inspired by Numba. and I also need to fix issue #45 for testing purposes **Commit Summary** - Introduced KernelCall class - Added cuda printer methods _print_KernelCall and _print_FunctionDef to generate the corresponding CUDA representation for both kernel calls and definitions - Added IndexedFunctionCall represents an indexed function call - Added CUDA module and cuda.synchronize() - Fixing a bug that I found in the header: it does not import the necessary header for the used function --------- Co-authored-by: EmilyBourne <[email protected]> Co-authored-by: bauom <[email protected]> Co-authored-by: Emily Bourne <[email protected]>

This pull request addresses issue #59 by adding more CUDA-specific keywords to enhance the checking of variable/function names and prevent name clashes --------- Co-authored-by: EmilyBourne <[email protected]> Co-authored-by: bauom <[email protected]>

This pull request addresses issue #41 by implementing a new feature in Pyccel that allows users to define a custom device **Commit Summary** - Adding handler for custom device and its code generation. - Adding test --------- Co-authored-by: EmilyBourne <[email protected]>

smazouz42 · 2024-07-24T15:58:16Z

Here is your checklist. Please tick items off when you have completed them or determined that they are not necessary for this pull request:

Write a clear PR description
Add tests to check your code works as expected
Update documentation if necessary
Update Changelog
Ensure any relevant issues are linked
Ensure new tests are passing

… issue_58

EmilyBourne and others added 30 commits June 27, 2024 08:10

Trigger tests on push to devel or main branch

c7a6638

Add cuda workflow to test cuda developments on CI

821a1c5

Trigger tests on push to devel or main branch

092b557

Begin implementation of CUDA arrays: adding cudaempty and cudafull fu…

80f905b

…nctions, and refining CUDA type handling

work in progress

7e8cf9e

work in progress

2dbcfae

work in progress

f3911d5

work in progress

37289f9

work in progress

ba66b48

work in progress

406a88b

work in progress

3afad1b

work in progress

190c5a2

cleaning up my PR

eeeb249

cleaning up my PR

de0f5ab

cleaning up my PR

d6ba6ad

work in progress

8286a89

work in progress

96c3f29

work in progress

b414d62

Trigger tests on push to devel or main branch

7c93416

Add cuda workflow to test cuda developments on CI

f8ec722

Trigger tests on push to devel or main branch

cc3a93e

work in progress

a28c724

smazouz42 added 24 commits July 25, 2024 10:56

fix doc string of host_empty

eea028a

Merge branch 'issue_68' of https://github.com/pyccel/pyccel-cuda into…

aed013a

… issue_58

Make sure tests are running successfully

c5a508c

merge with issue_56

71c85c2

refactoring the code

52ebe93

refactoring the code

ec738b3

refactoring the code

aa76f91

move CudaThreadIndexing to pyccel/cuda

528099f

cleaning upmy PR

f1f63ef

add final new line

1aa26b1

add final new line

ea1beb7

Make sure tests are passing

0f076a0

refactoring the code

57f977e

adding missing import to device test

26fcdc0

adding missing import to kernel

9f58f02

refactoring the code

c45c615

refactoring the code

572cdd8

update doc

d969ebb

update doc

2b3085f

update doc

34d801f

update doc

e9436a9

Merge branch 'issue_68' of https://github.com/pyccel/pyccel-cuda into…

803ef8c

… issue_58

work in progress

ac936de

work in progress

e3db4c0

EmilyBourne force-pushed the devel branch from 8eef19d to 12d98b6 Compare July 26, 2024 12:09

EmilyBourne force-pushed the devel branch 2 times, most recently from 81b9970 to 5f7e3e2 Compare September 3, 2024 13:43

EmilyBourne force-pushed the devel branch from 5f7e3e2 to bb18b0a Compare September 25, 2024 15:40

EmilyBourne force-pushed the devel branch from bb18b0a to de362d3 Compare November 8, 2024 13:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cuda.device_empty function #66

cuda.device_empty function #66

smazouz42 commented Jul 19, 2024

smazouz42 commented Jul 24, 2024 •

edited by pyccel-bot bot

Loading

cuda.device_empty function #66

Are you sure you want to change the base?

cuda.device_empty function #66

Conversation

smazouz42 commented Jul 19, 2024

smazouz42 commented Jul 24, 2024 • edited by pyccel-bot bot Loading

smazouz42 commented Jul 24, 2024 •

edited by pyccel-bot bot

Loading