ymcki
/

Llama-3_1-Nemotron-51B-Instruct-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

ymcki commited on 3 days ago

Commit

3d249c2

•

1 Parent(s): ae57df3

Upload README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -132,7 +132,7 @@ huggingface-cli download ymcki/Llama-3_1-Nemotron 51B-Instruct-GGUF --include "L
 ## Running the model using llama-cli
-First, download and compile my [Modified llama.cpp-b4139](https://github.com/ymcki/llama.cpp-b4139) v0.2. Compile it, then run
 ```
 ./llama-cli -m ~/Llama-3_1-Nemotron-51B-Instruct.Q3_K_S.gguf -p 'You are a European History Professor named Professor Whitman.'  -cnv -ngl 100
 ```

 ## Running the model using llama-cli
+First, go to llama.cpp [release page](https://github.com/ggerganov/llama.cpp/releases) and download the appropriate pre-compiled release starting from b4380. If that doesn't work, then download any version of llama.cpp starting from [b4380](https://github.com/ggerganov/llama.cpp/archive/refs/tags/b4380.tar.gz). Compile it, then run
 ```
 ./llama-cli -m ~/Llama-3_1-Nemotron-51B-Instruct.Q3_K_S.gguf -p 'You are a European History Professor named Professor Whitman.'  -cnv -ngl 100
 ```