mirror of
https://github.com/deepseek-ai/FlashMLA
synced 2025-06-26 18:15:54 +00:00
* Fix benchmark script * Performance optimization for compute-bound cases * Add new testcase (s_k = 16384) * Update README.md * Update comment * Update README.md * Add the deep-dive blog * Add background color for MLA Kernel Sched.drawio.svg * Use relative path for the schedule image * Move flash_mla.h to kernels/params.h
9 lines
67 B
Plaintext
9 lines
67 B
Plaintext
build
|
|
*.so
|
|
*.egg-info/
|
|
__pycache__/
|
|
dist/
|
|
*perf.csv
|
|
*.png
|
|
/.vscode
|