mirror of
https://github.com/deepseek-ai/DeepEP
synced 2025-06-26 18:28:11 +00:00
* Increase the test round. * Add warp synchronization. * Shuffle the send warps. * Add time elapsed into bench result.
* Increase the test round. * Add warp synchronization. * Shuffle the send warps. * Add time elapsed into bench result.