DeepEP/csrc/kernels
2025-06-24 10:12:23 +08:00
..
api.cuh Remove the low-latency usage flag (#214) 2025-06-16 13:30:14 +08:00
buffer.cuh Fully remove forwarders' and NVL receivers' code 2025-06-19 13:48:07 +08:00
CMakeLists.txt Support UE8M0 data format. (#206) 2025-06-12 09:38:19 +08:00
configs.cuh Add the transaction window data structure for RDMA senders (#245) 2025-06-24 09:12:40 +08:00
exception.cuh
ibgda_device.cuh Fully remove barrier FIFO designs (#200) 2025-06-10 16:23:20 +08:00
internode_ll.cu Update internode_ll.cu (#246) 2025-06-23 15:18:10 +08:00
internode.cu Add transaction windows 2025-06-24 10:12:23 +08:00
intranode.cu Optimize intranode combine. (#247) 2025-06-24 09:10:23 +08:00
launch.cuh Add the transaction window data structure for RDMA senders (#245) 2025-06-24 09:12:40 +08:00
layout.cu Support Ampere architecture (#204) 2025-06-11 15:48:18 +08:00
runtime.cu Support Ampere architecture (#204) 2025-06-11 15:48:18 +08:00
utils.cuh Merge remote-tracking branch 'origin/main' into internode-tma 2025-06-24 09:29:07 +08:00