mirror of
https://github.com/deepseek-ai/DeepEP
synced 2025-06-26 18:28:11 +00:00
Add some plans
This commit is contained in:
parent
1553fc42bf
commit
592296cd45
@ -280,6 +280,13 @@ For two micro-batch overlapping, you can refer to the following figure. With our
|
||||
|
||||

|
||||
|
||||
## Roadmap
|
||||
|
||||
- [ ] A100 support (intranode only)
|
||||
- [ ] Support BF16 for the low-latency dispatch kernel
|
||||
- [ ] Support NVLink protocol for intranode low-latency kernels
|
||||
- [ ] SM-free normal kernels
|
||||
|
||||
## Notices
|
||||
|
||||
#### Easier potential overall design
|
||||
|
Loading…
Reference in New Issue
Block a user