Commit Graph

3 Commits

Author SHA1 Message Date
Shengyu Liu
70b9468520
Fix LaTeX render error (#74) 2025-04-23 10:21:14 +08:00
ℍ𝕠𝕝𝕝𝕠𝕨 𝕄𝕒𝕟
6cff5a73f5
Minor fix to the docs to correct FlashAttention-3's paper link and typos (#73)
Thank you for open source FlashMLA! Just read the write up and very amazing
work! Found some very minor mistakes regarding to typos, and the link
to the FlashAttention-3 paper is wrong as that is the original FlashAttention
paper, so I just send the PR here. Thanks again!

Signed-off-by: Hollow Man <hollowman@opensuse.org>
2025-04-23 10:14:05 +08:00
Shengyu Liu
c2067be3ea
Performance Update (2025.04.22) (#71)
* Fix benchmark script

* Performance optimization for compute-bound cases

* Add new testcase (s_k = 16384)

* Update README.md

* Update comment

* Update README.md

* Add the deep-dive blog

* Add background color for MLA Kernel Sched.drawio.svg

* Use relative path for the schedule image

* Move flash_mla.h to kernels/params.h
2025-04-22 17:50:57 +08:00