Upload README.md
Browse files
README.md
ADDED
@@ -0,0 +1,44 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: other
|
3 |
+
language:
|
4 |
+
- en
|
5 |
+
pipeline_tag: text-generation
|
6 |
+
inference: false
|
7 |
+
tags:
|
8 |
+
- transformers
|
9 |
+
- gguf
|
10 |
+
- imatrix
|
11 |
+
- QwQ-LCoT-7B-Instruct
|
12 |
+
---
|
13 |
+
Quantizations of https://huggingface.co/prithivMLmods/QwQ-LCoT-7B-Instruct
|
14 |
+
|
15 |
+
### Inference Clients/UIs
|
16 |
+
* [llama.cpp](https://github.com/ggerganov/llama.cpp)
|
17 |
+
* [KoboldCPP](https://github.com/LostRuins/koboldcpp)
|
18 |
+
* [ollama](https://github.com/ollama/ollama)
|
19 |
+
* [text-generation-webui](https://github.com/oobabooga/text-generation-webui)
|
20 |
+
* [jan](https://github.com/janhq/jan)
|
21 |
+
* [GPT4All](https://github.com/nomic-ai/gpt4all)
|
22 |
+
---
|
23 |
+
|
24 |
+
# From original readme
|
25 |
+
|
26 |
+
The **QwQ-LCoT-7B-Instruct** is a fine-tuned language model designed for advanced reasoning and instruction-following tasks. It leverages the **Qwen2.5-7B** base model and has been fine-tuned on the **amphora/QwQ-LongCoT-130K** dataset, focusing on chain-of-thought (CoT) reasoning.
|
27 |
+
|
28 |
+
### **Training Dataset:**
|
29 |
+
- **Dataset Name:** [amphora/QwQ-LongCoT-130K](https://huggingface.co/datasets/amphora/QwQ-LongCoT-130K)
|
30 |
+
- **Size:** 133k examples.
|
31 |
+
- **Focus:** Chain-of-Thought reasoning for complex tasks.
|
32 |
+
|
33 |
+
---
|
34 |
+
|
35 |
+
### **Use Cases:**
|
36 |
+
1. **Instruction Following:**
|
37 |
+
Handle user instructions effectively, even for multi-step tasks.
|
38 |
+
|
39 |
+
2. **Reasoning Tasks:**
|
40 |
+
Perform logical reasoning and generate detailed step-by-step solutions.
|
41 |
+
|
42 |
+
3. **Text Generation:**
|
43 |
+
Generate coherent, context-aware responses.
|
44 |
+
---
|