DeepSeek-MoE/requirements.txt
2024-01-11 10:35:17 +08:00

12 lines
116 B
Plaintext

torch>=2.0.1
tokenizers>=0.14.0
transformers>=4.36.2
accelerate
attrdict
tqdm
deepspeed
datasets
tensorboardX
peft