Llama-3_1-Nemotron-51B-Instruct-GGUF
Original Model
nvidia/Llama-3_1-Nemotron-51B-Instruct
Run with Gaianet
Prompt template:
prompt template: llama-3-chat
Context size:
chat_ctx_size: 8192
Run with GaiaNet:
Quick start: https://docs.gaianet.ai/node-guide/quick-start
Customize your node: https://docs.gaianet.ai/node-guide/customize
Quantized with llama.cpp b4381
- Downloads last month
- 85
Inference API (serverless) does not yet support model repos that contain custom code.
Model tree for gaianet/Llama-3_1-Nemotron-51B-Instruct-GGUF
Base model
nvidia/Llama-3_1-Nemotron-51B-Instruct