DeepSeek-V3/inference
2025-01-05 14:33:48 +08:00
..
configs Release DeepSeek-V3 2024-12-26 19:01:57 +08:00
convert.py require model-parallel in convert.py 2024-12-31 18:05:55 +08:00
fp8_cast_bf16.py handle missing scale_inv_name (#2) 2024-12-27 09:34:38 +08:00
generate.py Release DeepSeek-V3 2024-12-26 19:01:57 +08:00
kernel.py Release DeepSeek-V3 2024-12-26 19:01:57 +08:00
model.py torch rmsnorm 2025-01-05 14:33:48 +08:00
requirements.txt Release DeepSeek-V3 2024-12-26 19:01:57 +08:00