NVlabs / instant-ngp
Instant neural graphics primitives: lightning fast NeRF and more
See what the GitHub community is most excited about today.
Instant neural graphics primitives: lightning fast NeRF and more
CUDA Library Samples
Sample codes for my CUDA programming book
how to optimize some algorithm in cuda.
NCCL Tests
FlashInfer: Kernel Library for LLM Serving
Flash Attention in ~100 lines of CUDA (forward pass only)
cuGraph - RAPIDS Graph Analytics Library
Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.