mirror of
https://github.com/deepseek-ai/DeepGEMM
synced 2025-06-26 23:15:49 +00:00
* Add swizzling params * Add TMA D descriptor * Always use STSMx2 * Swizzling draft * Compatible with padding * Fix bugs * Optimize swizzle performance * Optimize expression * Optimize TMA issues * Fix README * Stricter assertions |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| compiler.py | ||
| interleave_ffma.py | ||
| runtime.py | ||
| template.py | ||