songhexiang
|
4dd1e68ac8
|
For the SMs which calculate metadata in notify_dispatch, each warp in the SM is used to calculate the metadata of one channel. The default configuration is 8 warps for 10 channels, which needs two rounds of loop. Maybe the number of warps can be configured to the number of the channels so that one loop is enough.
|
2025-03-28 06:43:29 +00:00 |
|
Chenggang Zhao
|
1fc40d50f3
|
Improve AR performance
|
2025-03-06 21:41:19 +08:00 |
|
Chenggang Zhao
|
458cdcb22a
|
Fix AR bugs for normal kernels
|
2025-03-05 17:13:35 +08:00 |
|
Chenggang Zhao
|
680e424bdc
|
Bugs fixed
|
2025-03-05 14:27:45 +08:00 |
|
Chenggang Zhao
|
ebfe47e46f
|
Initial commit
|
2025-02-25 09:07:53 +08:00 |
|