diff --git a/README.md b/README.md index dab1f05..07bf7f1 100644 --- a/README.md +++ b/README.md @@ -18,7 +18,7 @@ Despite its lightweight design, DeepGEMM's performance matches or exceeds expert - [x] MoE scheduler with TMA multicast compatibility - [x] Fix TMA multicast compatibility for indivisible shapes - [ ] Skip useless computation on M -- [ ] NVRTC as a faster compiler +- [x] NVRTC as a faster compiler - [ ] Sanitizer for testing - [ ] Weight gradient kernels for dense models - [ ] Weight gradient kernels for MoE models