Triangle104 commited on
Commit
96e1406
·
verified ·
1 Parent(s): 8b6c10c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +26 -0
README.md CHANGED
@@ -17,6 +17,32 @@ pipeline_tag: text-generation
17
  This model was converted to GGUF format from [`Krystalan/DRT-o1-14B`](https://huggingface.co/Krystalan/DRT-o1-14B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
18
  Refer to the [original model card](https://huggingface.co/Krystalan/DRT-o1-14B) for more details on the model.
19
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
20
  ## Use with llama.cpp
21
  Install llama.cpp through brew (works on Mac and Linux)
22
 
 
17
  This model was converted to GGUF format from [`Krystalan/DRT-o1-14B`](https://huggingface.co/Krystalan/DRT-o1-14B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
18
  Refer to the [original model card](https://huggingface.co/Krystalan/DRT-o1-14B) for more details on the model.
19
 
20
+ ---
21
+ Model detils:
22
+ -
23
+ In this work, we introduce DRT-o1, an attempt to bring the success of
24
+ long thought reasoning to neural machine translation (MT). To this end,
25
+
26
+
27
+ 🌟 We mine English sentences with similes or metaphors from existing
28
+ literature books, which are suitable for translation via long thought.
29
+ 🌟 We propose a designed multi-agent framework with three agents
30
+ (i.e., a translator, an advisor and an evaluator) to synthesize the MT
31
+ samples with long thought. There are 22,264 synthesized samples in
32
+ total.
33
+ 🌟 We train DRT-o1-8B, DRT-o1-7B and DRT-o1-14B using
34
+ Llama-3.1-8B-Instruct, Qwen2.5-7B-Instruct and Qwen2.5-14B-Instruct as
35
+ backbones.
36
+
37
+
38
+
39
+
40
+ Our goal is not to achieve competitive performance with OpenAI’s O1
41
+ in neural machine translation (MT). Instead, we explore technical routes
42
+ to bring the success of long thought to MT. To this end, we introduce
43
+ DRT-o1, a byproduct of our exploration, and we hope it could facilitate the corresponding research in this direction.
44
+
45
+ ---
46
  ## Use with llama.cpp
47
  Install llama.cpp through brew (works on Mac and Linux)
48