DeepGEMM/deep_gemm/jit
google-labs-jules[bot] 93ea4797c0 Add initial support for Nvidia Blackwell (SM120)
This change introduces the necessary compiler flags and CMake configurations to enable support for the Nvidia Blackwell SM120 architecture.

- Modified deep_gemm/jit/compiler.py to include sm_120 and compute_120 flags for NVCC and NVRTC.
- Updated CMakeLists.txt to add the new architecture flags for the build process.

Further testing on Blackwell hardware is required to validate MMA instruction compatibility and overall performance.
2025-06-24 00:30:35 +00:00
..
__init__.py Refactor JIT compilation (+NVRTC support) (#94) 2025-05-07 11:38:14 +08:00
compiler.py Add initial support for Nvidia Blackwell (SM120) 2025-06-24 00:30:35 +00:00
interleave_ffma.py Refactor JIT compilation (+NVRTC support) (#94) 2025-05-07 11:38:14 +08:00
runtime.py Fix JIT tests 2025-05-16 14:39:58 +08:00