DeepGEMM/deep_gemm/jit_kernels
Zhean Xu 04278f6dee
Weight gradient kernels for dense and MoE models (#95)
* Init weight gradient kernels.

* Support unaligned n,k and gmem stride

* Update docs

* Several cleanups

* Remove restrictions on N

* Add stride(0) assertions

---------

Co-authored-by: Chenggang Zhao <chenggangz@deepseek.com>
2025-05-14 14:47:58 +08:00
..
__init__.py Weight gradient kernels for dense and MoE models (#95) 2025-05-14 14:47:58 +08:00
gemm.py Weight gradient kernels for dense and MoE models (#95) 2025-05-14 14:47:58 +08:00
m_grouped_gemm.py Weight gradient kernels for dense and MoE models (#95) 2025-05-14 14:47:58 +08:00
runtime.py Weight gradient kernels for dense and MoE models (#95) 2025-05-14 14:47:58 +08:00
tuner.py Add DG_PRINT_AUTOTUNE to README 2025-05-07 11:46:52 +08:00
utils.py Fix get_col_major_tma_aligned_tensor to handle 2-dimensional inputs 2025-03-13 22:15:16 +08:00
wgrad_gemm.py Weight gradient kernels for dense and MoE models (#95) 2025-05-14 14:47:58 +08:00