diff --git a/README.md b/README.md index 84eb565..ebcc2fd 100644 --- a/README.md +++ b/README.md @@ -117,7 +117,7 @@ We evaluate our models and some baseline models on a series of representative be #### Never Seen Before Exam -To address data contamination and tuning for specific testsets, we have designed fresh problem sets to assess the capabilities of open-source LLM models. **The evaluation results indicate that DeepSeek LLM Chat 67B performs exceptionally well on never-before-seen exams.** +To address data contamination and tuning for specific testsets, we have designed fresh problem sets to assess the capabilities of open-source LLM models. **The evaluation results indicate that DeepSeek LLM 67B Chat performs exceptionally well on never-before-seen exams.** ---