mirror of
https://github.com/deepseek-ai/DeepEP
synced 2025-06-26 18:28:11 +00:00
Add some plans
This commit is contained in:
parent
1553fc42bf
commit
592296cd45
@ -280,6 +280,13 @@ For two micro-batch overlapping, you can refer to the following figure. With our
|
|||||||
|
|
||||||

|

|
||||||
|
|
||||||
|
## Roadmap
|
||||||
|
|
||||||
|
- [ ] A100 support (intranode only)
|
||||||
|
- [ ] Support BF16 for the low-latency dispatch kernel
|
||||||
|
- [ ] Support NVLink protocol for intranode low-latency kernels
|
||||||
|
- [ ] SM-free normal kernels
|
||||||
|
|
||||||
## Notices
|
## Notices
|
||||||
|
|
||||||
#### Easier potential overall design
|
#### Easier potential overall design
|
||||||
|
Loading…
Reference in New Issue
Block a user