Update plans

2025-06-26 23:15:49 +00:00 · 2025-04-24 14:37:53 +08:00 · 2025-04-24 14:37:53 +08:00 · 33e0c3ce40
commit 33e0c3ce40
parent 95e81b3dd6
1 changed files with 3 additions and 0 deletions
--- a/README.md
+++ b/README.md
@ -17,6 +17,9 @@ Despite its lightweight design, DeepGEMM's performance matches or exceeds expert
 - [ ] Larger block size on N (up to 256)
 - [x] MoE scheduler with TMA multicast compatibility
 - [x] Fix TMA multicast compatibility for indivisible shapes
+- [ ] Skip useless computation on M
+- [ ] NVRTC as a faster compiler
+- [ ] Sanitizer for testing
 - [ ] Weight gradient kernels for dense models
 - [ ] Weight gradient kernels for MoE models
 - [ ] Utility kernels for MoE models (as a pre-built CUDA library)