Remove NVLink low-latency plan

This commit is contained in:
Chenggang Zhao 2025-03-27 17:15:01 +08:00
parent cbd92fd0fc
commit e130cc6e7d

View File

@ -283,7 +283,7 @@ For two micro-batch overlapping, you can refer to the following figure. With our
- [x] Refactor low-latency mode AR code - [x] Refactor low-latency mode AR code
- [ ] A100 support (intranode only) - [ ] A100 support (intranode only)
- [x] Support BF16 for the low-latency dispatch kernel - [x] Support BF16 for the low-latency dispatch kernel
- [ ] Support NVLink protocol for intranode low-latency kernels - [x] ~~Support NVLink protocol for intranode low-latency kernels~~ (conflict with hook-based overlapping)
- [ ] SM-free normal kernels - [ ] SM-free normal kernels
## Notices ## Notices