mirror of
https://github.com/deepseek-ai/DeepEP
synced 2025-06-26 18:28:11 +00:00
* Increase the test round. * Add warp synchronization. * Shuffle the send warps. * Add time elapsed into bench result. |
||
---|---|---|
.. | ||
kernels | ||
CMakeLists.txt | ||
config.hpp | ||
deep_ep.cpp | ||
deep_ep.hpp | ||
event.hpp |