unsloth/DeepSeek-R1-Distill-Qwen-32B-GGUF · error when to try this gguf

7 days ago

llama_model_loader: - type f32: 321 tensors
llama_model_loader: - type q8_0: 450 tensors
llama_model_load: error loading model: error loading model vocabulary: unknown pre-tokenizer type: 'deepseek-r1-qwen'
llama_load_model_from_file: failed to load model
common_init_from_params: failed to load model '/Users/yuhai/Downloads/DeepSeek-R1-Distill-Qwen-32B-Q8_0.gguf'

command:

./llama.cpp/llama-cli
--model ~/Downloads/DeepSeek-R1-Distill-Qwen-32B-Q8_0.gguf
--threads 16
--prompt '<｜User｜>What is 1+1?<｜Assistant｜>'

cloudyu

7 days ago

llama.cpp build: 4153 (6dfcfef0) with Apple clang version 15.0.0 (clang-1500.3.9.4) for arm64-apple-darwin23.5.0

haipeng

6 days ago

you may need to update llama.cpp >= b4514

https://github.com/ggerganov/llama.cpp/releases/tag/b4514

cloudyu

6 days ago

you may need to update llama.cpp >= b4514

https://github.com/ggerganov/llama.cpp/releases/tag/b4514

thanks, solved by update code

cloudyu changed discussion status to closed 6 days ago