Commit Graph

4 Commits

Author SHA1 Message Date
Kevin Zhang
e0557deb3a Feature:Support flashMLA decoding via flashAttn2(#29)
Changes:
1. Implement flashMLA with matrix absorption algorithm via flashAttn2
2. Add golden test on MXMACA platform
2025-02-24 23:56:05 +08:00
lancerts
4fbaa9527c minor fix test 2025-02-23 20:12:49 -08:00
sazc
051e40e82b tests: Triton had remove the fast_flush parameter from do_bench (#4485) 2025-02-24 10:59:22 +08:00
Jiashi Li
414a2f3eed Initial commit
i
2025-02-24 09:20:23 +08:00