Grouped GEMM skip useless computation for unaligned Ms (#103)

* Grouped GEMM skip useless computation for unaligned Ms

* Update readme.md

* small typo

* Rename variables

* Restore previous indent

* Format

* Refactor tests

* Add `SkipComputation` types

* Bug fixed

* Format

* Fix tests

* Add assertions

* Minor fix

---------

Co-authored-by: yukuai <yukuai@deepseek.com>
Co-authored-by: Chenggang Zhao <chenggangz@deepseek.com>
This commit is contained in:
yukuai26
2025-05-27 13:43:38 +08:00
committed by GitHub
parent 391755ada0
commit 8dfa329827
5 changed files with 106 additions and 93 deletions

View File

@@ -121,7 +121,7 @@ class Compiler:
'--ptxas-options=--register-usage-level=10' +
(',--verbose' if 'DG_JIT_PTXAS_VERBOSE' in os.environ else ''),
# Suppress some unnecessary warnings, such as unused variables for certain `constexpr` branch cases
'--diag-suppress=39,161,174,177,940']
'--diag-suppress=39,161,174,177,186,940']
@staticmethod
def include_dirs() -> List[str]: