mirror of
https://github.com/deepseek-ai/DeepGEMM
synced 2025-06-26 23:15:49 +00:00
* Init weight gradient kernels. * Support unaligned n,k and gmem stride * Update docs * Several cleanups * Remove restrictions on N * Add stride(0) assertions --------- Co-authored-by: Chenggang Zhao <chenggangz@deepseek.com> |
||
---|---|---|
.. | ||
__init__.py | ||
gemm.py | ||
m_grouped_gemm.py | ||
runtime.py | ||
tuner.py | ||
utils.py | ||
wgrad_gemm.py |