Request for GGUF support through llama.cpp

by Doctor-Chad-PhD - opened Jan 24

Jan 24

Dear Tencent Team,

I would like to request the support of GGUF quantization through the llama.cpp library.
As this will allow more users to use your new model.
The repo for llama.cpp can be found here: https://github.com/ggerganov/llama.cpp.
Thank you for considering this request.

mpasila

Jan 25

•

edited Jan 25

Have you created an issue on the Github repo? Since if you do that there will be a bigger chance of it being implemented.

J22

about 1 month ago

You can try with chatllm.cpp now.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment