DeepEP/csrc
Chenggang Zhao 1b92be8a71
Add automatic warp count control for low-latency kernels (#213)
* Add automatic warp count control for low-latency dispatch

* Add automatic warp count control for low-latency combine

* More assertions
2025-06-16 11:56:43 +08:00
..
kernels Add automatic warp count control for low-latency kernels (#213) 2025-06-16 11:56:43 +08:00
CMakeLists.txt Use TMA instead of LD/ST for intra-node normal kernels (#191) 2025-06-06 15:40:17 +08:00
config.hpp Add automatic warp count control for low-latency kernels (#213) 2025-06-16 11:56:43 +08:00
deep_ep.cpp Add automatic warp count control for low-latency kernels (#213) 2025-06-16 11:56:43 +08:00
deep_ep.hpp Add automatic warp count control for low-latency kernels (#213) 2025-06-16 11:56:43 +08:00
event.hpp