Update README.md

This commit is contained in:
ZHU QIHAO 2023-11-14 12:16:26 +08:00 committed by GitHub
parent d3414b11be
commit b26405dac3
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23

View File

@ -265,12 +265,12 @@ In the following scenario, the DeepSeek-Coder-6.7B model effectively calls a cla
### 5. How to Fine-tune DeepSeek-Coder ### 5. How to Fine-tune DeepSeek-Coder
We provide script `finetune_deepseekcoder.py` for users to finetune our models on downstream tasks. We provide script `finetune/finetune_deepseekcoder.py` for users to finetune our models on downstream tasks.
The script supports the training with [DeepSpeed](https://github.com/microsoft/DeepSpeed). You need install required packages by: The script supports the training with [DeepSpeed](https://github.com/microsoft/DeepSpeed). You need install required packages by:
```bash ```bash
pip install -r requirements.txt pip install -r finetune/requirements.txt
``` ```
Please follow [Sample Dataset Format](https://huggingface.co/datasets/nickrosh/Evol-Instruct-Code-80k-v1) to prepare your training data. Please follow [Sample Dataset Format](https://huggingface.co/datasets/nickrosh/Evol-Instruct-Code-80k-v1) to prepare your training data.
@ -285,7 +285,7 @@ DATA_PATH="<your_data_path>"
OUTPUT_PATH="<your_output_path>" OUTPUT_PATH="<your_output_path>"
MODEL="deepseek-ai/deepseek-coder-6.7b-instruct" MODEL="deepseek-ai/deepseek-coder-6.7b-instruct"
deepspeed finetune_deepseekcoder.py \ cd finetune && deepspeed finetune_deepseekcoder.py \
--model_name_or_path $MODEL_PATH \ --model_name_or_path $MODEL_PATH \
--data_path $DATA_PATH \ --data_path $DATA_PATH \
--output_dir $OUTPUT_PATH \ --output_dir $OUTPUT_PATH \