error when to try this gguf

#3
by cloudyu - opened

llama_model_loader: - type f32: 321 tensors
llama_model_loader: - type q8_0: 450 tensors
llama_model_load: error loading model: error loading model vocabulary: unknown pre-tokenizer type: 'deepseek-r1-qwen'
llama_load_model_from_file: failed to load model
common_init_from_params: failed to load model '/Users/yuhai/Downloads/DeepSeek-R1-Distill-Qwen-32B-Q8_0.gguf'

command:

./llama.cpp/llama-cli
--model ~/Downloads/DeepSeek-R1-Distill-Qwen-32B-Q8_0.gguf
--threads 16
--prompt '<|User|>What is 1+1?<|Assistant|>'

llama.cpp build: 4153 (6dfcfef0) with Apple clang version 15.0.0 (clang-1500.3.9.4) for arm64-apple-darwin23.5.0

you may need to update llama.cpp >= b4514

https://github.com/ggerganov/llama.cpp/releases/tag/b4514

you may need to update llama.cpp >= b4514

https://github.com/ggerganov/llama.cpp/releases/tag/b4514

thanks, solved by update code

cloudyu changed discussion status to closed

Sign up or log in to comment