Update README.md

This commit is contained in:
Zhean Xu 2025-04-14 17:03:35 +08:00 committed by GitHub
parent 584b67eebb
commit 857d57d157
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194

View File

@ -22,7 +22,7 @@ Despite its lightweight design, DeepGEMM's performance matches or exceeds expert
- [ ] CUDA PDL support
- [ ] More scaling granularity support via templates
- [ ] Larger TMA multicast size for some shapes
- [ ] MMA template refactor with CUTLASS
- [x] MMA template refactor with CUTLASS
- [ ] Optimizations for unaligned shapes
- [ ] Optimizations for power efficiency
- [ ] Remove shape limitations on N and K