From 857d57d1579784b0e6cec5660f72a618e0e74319 Mon Sep 17 00:00:00 2001 From: Zhean Xu <94977922+zheanxu@users.noreply.github.com> Date: Mon, 14 Apr 2025 17:03:35 +0800 Subject: [PATCH] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 5b9388a..b7924e1 100644 --- a/README.md +++ b/README.md @@ -22,7 +22,7 @@ Despite its lightweight design, DeepGEMM's performance matches or exceeds expert - [ ] CUDA PDL support - [ ] More scaling granularity support via templates - [ ] Larger TMA multicast size for some shapes -- [ ] MMA template refactor with CUTLASS +- [x] MMA template refactor with CUTLASS - [ ] Optimizations for unaligned shapes - [ ] Optimizations for power efficiency - [ ] Remove shape limitations on N and K