mirror of
https://github.com/deepseek-ai/DeepEP
synced 2025-06-26 18:28:11 +00:00
* Increase the test round. * Add warp synchronization. * Shuffle the send warps. * Add time elapsed into bench result. |
||
|---|---|---|
| .. | ||
| kernels | ||
| CMakeLists.txt | ||
| config.hpp | ||
| deep_ep.cpp | ||
| deep_ep.hpp | ||
| event.hpp | ||