arxiv:2501.00958
Yongliang Shen
tricktreat
AI & ML interests
None yet
Recent Activity
liked
a model
2 days ago
deepseek-ai/DeepSeek-R1
liked
a model
2 days ago
deepseek-ai/DeepSeek-R1-Zero
liked
a dataset
12 days ago
O1-OPEN/OpenO1-SFT-Ultra
Organizations
models
25
tricktreat/iChainGPT-Instruct
Updated
tricktreat/ichaingpt
Updated
•
3
tricktreat/llama-2-7b-chat-merged-with-llama-2-7b-chat-12layers-T6-25000steps-lora612-hhrlhf
Text Generation
•
Updated
•
12
tricktreat/llama-2-7b-chat-merged-with-llama-2-7b-chat-12layers-T6-25000steps-peft-lora-orpo
Text Generation
•
Updated
•
14
tricktreat/llama-2-7b-chat-12layers-T6-25000steps-llama-2-7b-chat-12layers-T6-25000steps-peft-lora-orpo
Text Generation
•
Updated
•
12
tricktreat/llama-2-7b-chat-12layers-T6-25000steps-llama-2-7b-chat-12layers-T6-25000steps-lora612-hhrlhf
Text Generation
•
Updated
•
11
tricktreat/llama-2-7b-chat-12layers-T6-25000steps-llama-2-7b-chat-12layers-T6-25000steps-peft-lora-orpo-2
Text Generation
•
Updated
•
114
tricktreat/llama-2-7b-chat-merged-with-llama-2-7b-chat-12layers-T6-25000steps-peft-lora-orpo-2
Text Generation
•
Updated
•
12
tricktreat/llama-2-7b-chat-12layers-T6-merged-with-llama-2-7b-chat-peft-lora-orpo
Text Generation
•
Updated
•
13
tricktreat/llama-2-7b-chat-merged-with-llama-2-7b-chat-peft-lora-orpo
Text Generation
•
Updated
•
12
datasets
None public yet