DeepEP/csrc
Zhicheng Wu 05df5554ff
Use one qp per sm for internode normal kernels (#181)
let the sender SM use the channel_id, and the receiver SM use channel_id + num_channels
2025-06-13 14:37:59 +08:00
..
kernels Use one qp per sm for internode normal kernels (#181) 2025-06-13 14:37:59 +08:00
CMakeLists.txt Use TMA instead of LD/ST for intra-node normal kernels (#191) 2025-06-06 15:40:17 +08:00
config.hpp Support Ampere architecture (#204) 2025-06-11 15:48:18 +08:00
deep_ep.cpp Support UE8M0 data format. (#206) 2025-06-12 09:38:19 +08:00
deep_ep.hpp Support UE8M0 data format. (#206) 2025-06-12 09:38:19 +08:00
event.hpp Initial commit 2025-02-25 09:07:53 +08:00