DeepEP/csrc/kernels
Chenggang Zhao b8d90fb753
Support Ampere architecture (#204)
* Update README

* Update `setup.py`

* Fix headers

* Add `DISABLE_NVSHMEM` for APIs

* Fix launch

* Fix TMA settings

* Fix TMA usages

* Fix dlink

* Separate layout kernels

* Update version

* Add `is_sm90_compiled`

* Fix tests

* Add NVLink connection checks

* Update README

* Fix tests

* Add some comments

* Minor fix

* Minor fix

* Fix bugs
2025-06-11 15:48:18 +08:00
..
api.cuh Support Ampere architecture (#204) 2025-06-11 15:48:18 +08:00
buffer.cuh Initial commit 2025-02-25 09:07:53 +08:00
CMakeLists.txt Support Ampere architecture (#204) 2025-06-11 15:48:18 +08:00
configs.cuh Support Ampere architecture (#204) 2025-06-11 15:48:18 +08:00
exception.cuh Initial commit 2025-02-25 09:07:53 +08:00
ibgda_device.cuh Fully remove barrier FIFO designs (#200) 2025-06-10 16:23:20 +08:00
internode_ll.cu Fully remove barrier FIFO designs (#200) 2025-06-10 16:23:20 +08:00
internode.cu Support Ampere architecture (#204) 2025-06-11 15:48:18 +08:00
intranode.cu Support Ampere architecture (#204) 2025-06-11 15:48:18 +08:00
launch.cuh Support Ampere architecture (#204) 2025-06-11 15:48:18 +08:00
layout.cu Support Ampere architecture (#204) 2025-06-11 15:48:18 +08:00
runtime.cu Support Ampere architecture (#204) 2025-06-11 15:48:18 +08:00
utils.cuh Support Ampere architecture (#204) 2025-06-11 15:48:18 +08:00