Triangle104
/

Mistral-Small-24B-Instruct-2501-Q6_K-GGUF

Model card Files Files and versions Community

Triangle104 commited on 6 days ago

Commit

1df4f12

·

verified ·

1 Parent(s): 9b047f1

Update README.md

Files changed (1) hide show

README.md +34 -0

README.md CHANGED Viewed

@@ -26,6 +26,40 @@ tags:
 This model was converted to GGUF format from [`mistralai/Mistral-Small-24B-Instruct-2501`](https://huggingface.co/mistralai/Mistral-Small-24B-Instruct-2501) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/mistralai/Mistral-Small-24B-Instruct-2501) for more details on the model.
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)

 This model was converted to GGUF format from [`mistralai/Mistral-Small-24B-Instruct-2501`](https://huggingface.co/mistralai/Mistral-Small-24B-Instruct-2501) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/mistralai/Mistral-Small-24B-Instruct-2501) for more details on the model.
+---
+Model details:
+-
+Mistral Small 3 ( 2501 ) sets a new benchmark in the "small" Large Language Models category below 70B, boasting 24B parameters and achieving state-of-the-art capabilities comparable to larger models!
+This model is an instruction-fine-tuned version of the base model: Mistral-Small-24B-Base-2501.
+Mistral Small can be deployed locally and is exceptionally "knowledge-dense", fitting in a single RTX 4090 or a 32GB RAM MacBook once quantized.
+Perfect for:
+    Fast response conversational agents.
+    Low latency function calling.
+    Subject matter experts via fine-tuning.
+    Local inference for hobbyists and organizations handling sensitive data.
+For enterprises that need specialized capabilities (increased context, particular modalities, domain specific knowledge, etc.), we will be releasing commercial models beyond what Mistral AI contributes to the community.
+This release demonstrates our commitment to open source, serving as a strong base model.
+Learn more about Mistral Small in our blog post.
+Model developper: Mistral AI Team
+Key Features
+    Multilingual: Supports dozens of languages, including English, French, German, Spanish, Italian, Chinese, Japanese, Korean, Portuguese, Dutch, and Polish.
+    Agent-Centric: Offers best-in-class agentic capabilities with native function calling and JSON outputting.
+    Advanced Reasoning: State-of-the-art conversational and reasoning capabilities.
+    Apache 2.0 License: Open license allowing usage and modification for both commercial and non-commercial purposes.
+    Context Window: A 32k context window.
+    System Prompt: Maintains strong adherence and support for system prompts.
+    Tokenizer: Utilizes a Tekken tokenizer with a 131k vocabulary size.
+---
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)