Update README.md

2025-06-26 18:15:54 +00:00 · 2025-04-22 17:01:49 +08:00 · 2025-04-22 17:01:49 +08:00 · c7996e951d
commit c7996e951d
parent 15f3897667
1 changed files with 1 additions and 1 deletions
--- a/README.md
+++ b/README.md
@ -6,7 +6,7 @@ We're excited to announce the new release of Flash MLA, which delivers 5% ~ 15%

 Besides, we'd love to share the technical details behind the new kernel! Check out our deep-dive write-up here: <LINK>

-The new kernel primarily targets compute-intensive settings (where the number of q heads $\times$ the number of q sequences per request (if MTP is disabled then it's 1) $\ge 64$). For memory-bound cases, we recommend using version [b31bfe7](https://github.com/deepseek-ai/FlashMLA/tree/b31bfe72a83ea205467b3271a5845440a03ed7cb) for optimal performance.
+The new kernel primarily targets compute-intensive settings (where the number of q heads $\times$ the number of q tokens per request (if MTP is disabled then it's 1) $\ge 64$). For memory-bound cases, we recommend using version [b31bfe7](https://github.com/deepseek-ai/FlashMLA/tree/b31bfe72a83ea205467b3271a5845440a03ed7cb) for optimal performance.

 ## Introduction