Reasoning models trained on synthetic data using reinforcement learning.
Yichao 'Peak' Ji
peakji
AI & ML interests
Small Language Models, Retrieval-Augmented Generation, Information Extraction
Recent Activity
liked
a model
3 days ago
mistralai/Mistral-Small-24B-Instruct-2501
liked
a model
3 days ago
deepseek-ai/DeepSeek-V3
upvoted
a
collection
16 days ago
DeepSeek-R1
Organizations
Collections
3
models
14
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1673707677603-noauth.jpeg)
peakji/qwen2.5-coder-7b-awq
Updated
•
30
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1673707677603-noauth.jpeg)
peakji/steiner-32b-preview-gguf
Updated
•
160
•
16
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1673707677603-noauth.jpeg)
peakji/steiner-32b-preview-awq
Updated
•
8
•
3
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1673707677603-noauth.jpeg)
peakji/steiner-32b-preview
Updated
•
26
•
43
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1673707677603-noauth.jpeg)
peakji/peak-reasoning-7b-gguf
Updated
•
295
•
4
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1673707677603-noauth.jpeg)
peakji/peak-reasoning-7b-awq
Updated
•
6
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1673707677603-noauth.jpeg)
peakji/peak-reasoning-7b
Updated
•
5
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1673707677603-noauth.jpeg)
peakji/qwen2.5-72b-instruct-trim
Updated
•
5
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1673707677603-noauth.jpeg)
peakji/qwen2.5-32b-instruct-trim
Updated
•
5
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1673707677603-noauth.jpeg)
peakji/qwen2.5-14b-instruct-trim
Updated
•
6
datasets
8
peakji/peak-text-with-context-2m
Viewer
•
Updated
•
2.07M
•
126
peakji/peak-anchor-content-plain-20k
Viewer
•
Updated
•
20.1k
•
148
peakji/peak-search-content-plain-40k
Viewer
•
Updated
•
40.4k
•
89
peakji/peak-anchor-content-35k
Viewer
•
Updated
•
35.6k
•
75
peakji/peak-search-content-70k
Viewer
•
Updated
•
70.2k
•
135
peakji/peak-anchor-40k
Viewer
•
Updated
•
42.7k
•
210
peakji/peak-search-300k
Viewer
•
Updated
•
312k
•
87
peakji/peak-intent-50
Viewer
•
Updated
•
265k
•
62