A-transformer
92521df34d
comment more clear about Memory Consistency and Barrier Visibility
...
Memory Consistency and Barrier Visibility: Both __syncthreads() and cute::cluster_sync() serve as synchronization points, ensuring that all threads reach the barrier before any proceed. This guarantees that all prior memory operations, including barrier initialization, are visible to all threads within the synchronization scope.
2025-02-27 22:01:50 +04:00
A-transformer
a2e0d68eed
Merge pull request #2 from deepseek-ai/main
...
sync
2025-02-27 21:47:31 +04:00
Liang
fbec9e5eee
Update get_best_configs
...
a better strategy to choose config
2025-02-27 23:18:52 +08:00
Zhean Xu
461427ecd0
Merge pull request #27 from vatlor/main
...
fix typo
2025-02-27 20:37:31 +08:00
dotrail
488b5fc467
fix typo
2025-02-27 11:53:33 +00:00
Chenggang Zhao
6da94d2d36
Add extra TMA checks
2025-02-27 18:20:57 +08:00
Chenggang Zhao
ca13ce0fab
Fix TMA store bugs and code format
2025-02-27 17:57:21 +08:00
Chenggang Zhao
b05ed2f017
Code format
2025-02-27 10:50:20 +08:00
Chenggang Zhao
676329b8e2
Merge pull request #19 from dzhulgakov/fix-wheel
...
Fix wheel building
2025-02-27 10:44:11 +08:00
Chenggang Zhao
6e55da296f
Fix python -O mode issues
2025-02-27 10:42:46 +08:00
Chenggang Zhao
d5b974da2b
Merge pull request #16 from AcraeaTerpsicore/patch-1
...
Fix typos
2025-02-27 10:34:12 +08:00
Dmytro Dzhulgakov
fc7c3f8299
setup.py: fix wheel building
2025-02-26 17:48:57 +00:00
Zhean Xu
78cacf70d4
Update README.md
2025-02-26 19:20:39 +08:00
AcraeaTerpsicore
96b31fd6bb
fix typo
2025-02-26 18:37:22 +08:00
xuzhean
bc989405fe
fix: prevent expected_m from exceeding m in test_core
2025-02-26 16:55:47 +08:00
Zhean Xu
eec7ab7f03
Merge pull request #13 from ZeppLu/patch-1
...
doc: Use permanent link
2025-02-26 16:34:23 +08:00
Zepp
7a70b439cd
doc: Use permanent link
2025-02-26 16:15:37 +08:00
Chenggang Zhao
184ce9b5ea
Merge pull request #3 from acheong08/patch-1
...
Spelling: README.md
2025-02-26 13:29:45 +08:00
Antonio Cheong
5da24e229a
spelling: README.md
...
behavior -> behaves
2025-02-26 02:36:04 +00:00
Chenggang Zhao
a6d97a1c1b
Initial commit
2025-02-25 22:52:41 +08:00