DeepSeek-V3-GGUF / README.md
bullerwins's picture
Update README.md
2484d36 verified
|
raw
history blame
338 Bytes
metadata
base_model:
  - deepseek-ai/DeepSeek-V3

Initial preview for the GGUF quantized version of deepseek-ai/DeepSeek-V3

It needs this PR commit to work: https://github.com/ggerganov/llama.cpp/pull/11049

Thanks to Fairydreaming for the PR!

Note: no multi-token prediction (MTP) support