Triangle104
/

DRT-o1-14B-Q8_0-GGUF

Text Generation

machine tranlsation

Inference Endpoints

Model card Files Files and versions Community

Triangle104 commited on about 24 hours ago

Commit

96e1406

·

verified ·

1 Parent(s): 8b6c10c

Update README.md

Files changed (1) hide show

README.md +26 -0

README.md CHANGED Viewed

@@ -17,6 +17,32 @@ pipeline_tag: text-generation
 This model was converted to GGUF format from [`Krystalan/DRT-o1-14B`](https://huggingface.co/Krystalan/DRT-o1-14B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/Krystalan/DRT-o1-14B) for more details on the model.
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)

 This model was converted to GGUF format from [`Krystalan/DRT-o1-14B`](https://huggingface.co/Krystalan/DRT-o1-14B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/Krystalan/DRT-o1-14B) for more details on the model.
+---
+Model detils:
+-
+In this work, we introduce DRT-o1, an attempt to bring the success of
+ long thought reasoning to neural machine translation (MT). To this end,
+🌟 We mine English sentences with similes or metaphors from existing
+ literature books, which are suitable for translation via long thought.
+🌟 We propose a designed multi-agent framework with three agents
+(i.e., a translator, an advisor and an evaluator) to synthesize the MT
+samples with long thought. There are 22,264 synthesized samples in
+total.
+🌟 We train DRT-o1-8B, DRT-o1-7B and DRT-o1-14B using
+Llama-3.1-8B-Instruct, Qwen2.5-7B-Instruct and Qwen2.5-14B-Instruct as
+backbones.
+Our goal is not to achieve competitive performance with OpenAI’s O1
+in neural machine translation (MT). Instead, we explore technical routes
+ to bring the success of long thought to MT. To this end, we introduce
+DRT-o1, a byproduct of our exploration, and we hope it could facilitate the corresponding research in this direction.
+---
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)