this post was submitted on 24 Jul 2023
5 points (100.0% liked)

GPU_programming

92 readers
5 users here now

Programming Lemmy instance focused on GPUs. CUDA, OpenCL, ROCm, DirectX, Vulkan are all on subject here.

founded 2 years ago
MODERATORS
 

Stream Compaction is one of the most important applications of GPU prefix sums IMO.

Of course, prefer a library over writing your own. DirectX11 (and 12) has AppendStructuredBuffers (https://learn.microsoft.com/en-us/windows/win32/direct3dhlsl/sm5-object-appendstructuredbuffer) for instance that performs this automatically. AVX512 even has "compress" and "expand" instructions that effectively perform this stream-compaction task.

IMO, this is one of the "new era" methods of creating SIMD-datastructures. Its not often talked about, but its all over the place, the wizards / experts obviously know about this but somehow don't know how to tell us normal people how it works.

you are viewing a single comment's thread
view the rest of the comments
[–] dragontamer 1 points 2 years ago

Post, make my alt a moderator!