DeepGEMM/deep_gemm
Zhean Xu 04278f6dee
Weight gradient kernels for dense and MoE models (#95)
* Init weight gradient kernels.

* Support unaligned n,k and gmem stride

* Update docs

* Several cleanups

* Remove restrictions on N

* Add stride(0) assertions

---------

Co-authored-by: Chenggang Zhao <chenggangz@deepseek.com>
2025-05-14 14:47:58 +08:00
..
include/deep_gemm Weight gradient kernels for dense and MoE models (#95) 2025-05-14 14:47:58 +08:00
jit Fix 12.9 compatibility 2025-05-07 13:23:40 +08:00
jit_kernels Weight gradient kernels for dense and MoE models (#95) 2025-05-14 14:47:58 +08:00
__init__.py Weight gradient kernels for dense and MoE models (#95) 2025-05-14 14:47:58 +08:00
utils.py Weight gradient kernels for dense and MoE models (#95) 2025-05-14 14:47:58 +08:00