Commit Graph

  • 7117f260e9 mask标记声明 wanglei 2025-01-14 15:29:06 +0800
  • 9cb0b3ca37
    Merge 8941732439 into ee4c4ea32b Jafar Saad 2025-01-13 02:06:14 +0700
  • ee4c4ea32b
    Merge pull request #234 from wangfuchun-fc/patch-1 main Xingkai Yu 2025-01-07 17:53:28 +0800
  • 25109d2ccd
    Merge pull request #230 from jacksonpradolima/main Huang Panpan 2025-01-07 14:05:15 +0800
  • fdbd5be754
    Merge pull request #193 from enochkan/main Huang Panpan 2025-01-07 14:02:11 +0800
  • 3779a89770
    fix: fix readme doc typo. wangfuchun-fc 2025-01-06 22:00:32 +0800
  • cd28bb1bf4
    Merge pull request #1 from twlhitesh/improve-accessibility Hitesh Yadav 2025-01-06 15:54:11 +0530
  • 7d95c3e5ec Improve accessibility for CAPTCHA challenges in DeepSeek v3 Hitesh Yadav 2025-01-06 15:53:37 +0530
  • c070549279 Add CITATION.cff to provide citation metadata Jackson Antonio do Prado Lima 2025-01-05 21:46:37 -0300
  • bc77f22afc Updated model.py docstrings enoch kan 2025-01-05 18:24:31 +0000
  • a1296f099e Enhance documentation and update .gitignore for model conversion scripts enoch kan 2025-01-05 18:18:18 +0000
  • bc9459df40 refactor(inference): modularize model architecture for improved maintainability Hitesh Yadav 2025-01-05 16:28:10 +0530
  • fd011c11aa torch rmsnorm GeeeekExplorer 2025-01-05 14:33:48 +0800
  • dbd2938f1b
    Create azure-webapps-python.yml shhmfa 2025-01-04 10:07:01 -0500
  • 9b288b86cc
    Update README.md Xingkai Yu 2025-01-03 15:30:48 +0800
  • 0d16ea24c8
    Merge pull request #206 from kutt/patch-1 Huang Panpan 2025-01-03 09:48:03 +0800
  • 21bc231f32
    use alert formatting for notes in readme kutt 2025-01-02 15:02:52 +0100
  • f5fb13ee14
    Created fuzz_target.py Shivam7-1 2024-12-31 16:10:31 +0530
  • 8710ec2ecb
    require model-parallel in convert.py Xingkai Yu 2024-12-31 18:05:55 +0800
  • 8941732439
    Create python-app.yml Jafar Saad 2024-12-31 14:35:29 +0700
  • 7c2466b310
    Update issue templates Huang Panpan 2024-12-31 14:49:05 +0800
  • dd6882bc3d
    Delete LICENSE-MODEL gogo67xxxggg 2024-12-30 15:20:45 -0600
  • ba602e3800
    Delete .github/workflows directory gogo67xxxggg 2024-12-30 15:07:59 -0600
  • 641ed202ec
    Create python-package-conda.yml gogo67xxxggg 2024-12-30 15:07:35 -0600
  • 1b8e18cc29
    Merge pull request #21 from eltociear/patch-1 Huang Panpan 2024-12-30 15:03:30 +0800
  • 94410f8d58
    Merge pull request #33 from zhyncs/main Haswell Iris 2024-12-30 14:37:38 +0800
  • 68d0061937 upd zhyncs 2024-12-30 14:25:28 +0800
  • 2fc98d1cdf upd zhyncs 2024-12-30 14:21:00 +0800
  • a1edf4138e upd zhyncs 2024-12-30 14:18:00 +0800
  • 8638950ec2 docs: update SGLang usage zhyncs 2024-12-30 14:13:27 +0800
  • 83dd18eda4
    Update README.md DeepSeekDDM 2024-12-30 11:04:14 +0800
  • 710c8b8b6e
    docs: update README.md Ikko Eltociear Ashimine 2024-12-29 00:43:11 +0900
  • 8f1c9488b5
    handle missing scale_inv_name (#2) Yang Wang 2024-12-27 09:34:38 +0800
  • c8087bd8b8
    Merge pull request #9 from simon-mo/vllm Huang Panpan 2024-12-27 09:16:09 +0800
  • e2c15caf04 add version simon-mo 2024-12-26 17:11:31 -0800
  • cf47874d8e Docs: add vLLM as supported engine simon-mo 2024-12-26 17:10:33 -0800
  • 6e7c5ee471 remove interface AK391 2024-12-26 18:09:56 +0100
  • 2ab082f2f5 add gradio app AK391 2024-12-26 17:25:34 +0100
  • 65d8f5f1e9
    Add CUDA cache clearing in memory management Yang Wang 2024-12-26 23:18:39 +0800
  • e6e66fd23f
    sort filename to reduce memory costs Yang Wang 2024-12-26 23:14:44 +0800
  • 1e3a83629e
    handle missing scale_inv_name Yang Wang 2024-12-26 23:09:17 +0800
  • 4c2fdb8f55 Release DeepSeek-V3 stack-heap-overflow 2024-12-26 19:01:57 +0800
  • 4b58dc6bfc
    Initial commit stack-heap-overflow 2024-12-26 17:52:41 +0800