🐉
In dragons I trust!
-
NVIDIA
- Portland, United States
Pinned Loading
-
llvm/llvm-project
llvm/llvm-project PublicThe LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
-
spack/spack
spack/spack PublicA flexible package manager that supports multiple versions, configurations, platforms, and compilers.
-
omni-compiler/xcodeml-tools
omni-compiler/xcodeml-tools Publicfrontend and backend for C and Fortran
-
omni-compiler/omni-compiler
omni-compiler/omni-compiler PublicOmni Compiler for C and Fortran programs with XcalableMP and OpenACC directives
-
claw-project/claw-compiler
claw-project/claw-compiler PublicCLAW Compiler for Performance Portability
-
omni-compiler/XcodeML-Specification
omni-compiler/XcodeML-Specification PublicSpecification of XcodeML
1,030 contributions in the last year
Day of Week | April Apr | May May | June Jun | July Jul | August Aug | September Sep | October Oct | November Nov | December Dec | January Jan | February Feb | March Mar | April Apr | ||||||||||||||||||||||||||||||||||||||||
Sunday Sun | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Monday Mon | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Tuesday Tue | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Wednesday Wed | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Thursday Thu | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Friday Fri | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Saturday Sat |
Less
No contributions.
Low contributions.
Medium-low contributions.
Medium-high contributions.
High contributions.
More
Activity overview
Loading
Contribution activity
April 2025
Created 34 commits in 1 repository
Created a pull request in llvm/llvm-project that received 9 comments
[mlir][NVVM] Add ops for vote all and any sync
Add operations for nvvm.vote.all.sync
and nvvm.vote.any.sync
intrinsics similar to nvvm.vote.ballot.sync
.
+87
−40
lines changed
•
9
comments
Opened 33 other pull requests in 1 repository
llvm/llvm-project
1
closed
31
merged
1
open
-
Revert "[flang][cuda] Update stream type for cuf kernel op"
This contribution was made on Apr 22
-
[flang][cuda] Update stream type for cuf kernel op
This contribution was made on Apr 21
-
[flang][openacc] Make async clause behavior homogenous
This contribution was made on Apr 21
-
[flang][openacc] Make num_gangs, num_workers and vector_length behavior homogenous with parallel
This contribution was made on Apr 18
-
[flang][cuda] Only convert launch from CUDA Fortran kernels
This contribution was made on Apr 17
-
[flang][cuda] Carry over the stream information to kernel launch
This contribution was made on Apr 17
-
[flang][cuda] Update cuf.kernel_launch stream and conversion
This contribution was made on Apr 17
-
[flang][cuda] Introduce stream cast op
This contribution was made on Apr 16
-
[flang][rt] Attempt to support some complex pow on the device
This contribution was made on Apr 15
-
[flang][openacc] Allow if clause on atomic directives
This contribution was made on Apr 11
-
[flang][openacc] Allow if_present multiple times on host_data and update
This contribution was made on Apr 11
-
[flang][openacc] Allow finalize clause on exit data more than once
This contribution was made on Apr 11
-
[flang][openacc] Allow multiple device_type clauses on init and shutdown
This contribution was made on Apr 11
-
[flang][cuda] Use the provided stream in kernel launch
This contribution was made on Apr 10
-
[flang][cuda] Pass stream information to kernel launch functions
This contribution was made on Apr 10
-
[flang][cuda] Propagate stream information to gpu.launch_func op
This contribution was made on Apr 10
-
[flang][cuda] Update stream operand type for cuf.kernel_launch op
This contribution was made on Apr 10
-
[flang][cuda] Use the aysncId in device allocation
This contribution was made on Apr 9
-
[flang][cuda] Add asyncId to allocate entry point
This contribution was made on Apr 8
-
[flang][cuda] Relax compatibility rules when host,device procedure is involved
This contribution was made on Apr 8
-
[flang][cuda] Avoid triggering host array error in host device proc
This contribution was made on Apr 8
-
[flang][cuda] Implicitly load cudadevice in host,device and grid_global procedures
This contribution was made on Apr 8
-
[flang][cuda] Add on_device interface to cudadevice
This contribution was made on Apr 7
-
[flang][cuda] Add async id to allocators
This contribution was made on Apr 7
-
[flang][cuda] Register managed variables with double descriptor
This contribution was made on Apr 4
- Some pull requests not shown.
Reviewed 39 pull requests in 1 repository
llvm/llvm-project
25 pull requests
-
[mlir][acc] Use consistent name for device_num operand
This contribution was made on Apr 22
-
[flang][acc] Generate acc.bounds operation from FIR shape
This contribution was made on Apr 22
-
[flang][acc] Update stride calculation to include inner-dimensions
This contribution was made on Apr 21
-
[OpenACC][CIR] Lower 'num_workers' for parallel/kernels
This contribution was made on Apr 21
-
[OpenACC] Switch Clang to use the Flang 'appertainment' rules for cla…
This contribution was made on Apr 18
-
Reland [flang] Generalized simplification of HLFIR reduction ops. (#136071)
This contribution was made on Apr 18
-
[flang][cuda] Only convert launch from CUDA Fortran kernels
This contribution was made on Apr 18
-
[flang][cuda] Carry over the stream information to kernel launch
This contribution was made on Apr 18
-
[flang][OpenACC] Fix crash due to truncated scope source range
This contribution was made on Apr 17
-
[flang][acc] Avoid implicitly privatizing IVs already privatized
This contribution was made on Apr 17
-
[flang][cuda] Update cuf.kernel_launch stream and conversion
This contribution was made on Apr 17
-
[flang] Fixed out-of-bounds access in SimplifyIntrinsics.
This contribution was made on Apr 17
-
[flang][cuda] Introduce stream cast op
This contribution was made on Apr 17
-
[flang][CUDA] Add error & warning for device argument first dimension…
This contribution was made on Apr 17
-
[flang][openacc] Allow open acc routines from other modules.
This contribution was made on Apr 16
-
[flang] Improve OpenACC SELF clause parser
This contribution was made on Apr 16
-
[flang][rt] Attempt to support some complex pow on the device
This contribution was made on Apr 15
-
[OpenACC][CIR] Implement 'self' lowering on compute constructs
This contribution was made on Apr 15
-
[flang][OpenACC] use correct type when create private box init recipe
This contribution was made on Apr 15
-
[flang][OpenMP] Prescanning bug with !$ fixed form line continuation
This contribution was made on Apr 11
-
[flang] Handle volatility in lowering and codegen
This contribution was made on Apr 11
-
[flang-rt] Declare DeviceTrap static inline.
This contribution was made on Apr 11
-
[flang][NFC] Split CG dialect and the passes.
This contribution was made on Apr 10
-
[OpenACC][CIR] Implement 'device_type' clause lowering for 'init'/'sh…
This contribution was made on Apr 10
-
[flang] Don't insert spaces in -E output after line continuation
This contribution was made on Apr 9
- Some pull request reviews not shown.