From 5da24e229a60e85504875c0c0deb92fadc33be2d Mon Sep 17 00:00:00 2001 From: Antonio Cheong Date: Wed, 26 Feb 2025 02:36:04 +0000 Subject: [PATCH] spelling: README.md behavior -> behaves --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 14435f3..ff45782 100644 --- a/README.md +++ b/README.md @@ -10,7 +10,7 @@ Despite its lightweight design, DeepGEMM's performance matches or exceeds expert We test all shapes potentially used in DeepSeek-V3/R1 inference (including both prefilling and decoding, but without tensor parallelism) on H800 with NVCC 12.8. All speedup metrics are calculated in comparison to our internally and carefully optimized implementation based on CUTLASS 3.6. -DeepGEMM does not behavior very well on some shapes, optimization PRs are welcomed if you are interested. +DeepGEMM does not behave very well on some shapes, optimization PRs are welcomed if you are interested. ### Normal GEMMs for dense models