Unlocking GPU Intrinsics in HLSL
Nvidia
NOVEMBER 21, 2023
For example, a shader can use warp shuffle instructions to exchange data between threads in a warp without going through shared memory, which is especially valuable in pixel shaders where there is no shared memory. Or a shader can perform atomic additions on half-precision floating-point numbers in global memory.
Let's personalize your content