Upload README.md
Browse files
README.md
CHANGED
@@ -132,7 +132,7 @@ huggingface-cli download ymcki/Llama-3_1-Nemotron 51B-Instruct-GGUF --include "L
|
|
132 |
|
133 |
## Running the model using llama-cli
|
134 |
|
135 |
-
First,
|
136 |
```
|
137 |
./llama-cli -m ~/Llama-3_1-Nemotron-51B-Instruct.Q3_K_S.gguf -p 'You are a European History Professor named Professor Whitman.' -cnv -ngl 100
|
138 |
```
|
|
|
132 |
|
133 |
## Running the model using llama-cli
|
134 |
|
135 |
+
First, go to llama.cpp [release page](https://github.com/ggerganov/llama.cpp/releases) and download the appropriate pre-compiled release starting from b4380. If that doesn't work, then download any version of llama.cpp starting from [b4380](https://github.com/ggerganov/llama.cpp/archive/refs/tags/b4380.tar.gz). Compile it, then run
|
136 |
```
|
137 |
./llama-cli -m ~/Llama-3_1-Nemotron-51B-Instruct.Q3_K_S.gguf -p 'You are a European History Professor named Professor Whitman.' -cnv -ngl 100
|
138 |
```
|