shisa-ai
's Collections
shisa-v2-research
updated
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs
with Nothing
Paper
•
2406.08464
•
Published
•
66
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Paper
•
2406.20094
•
Published
•
97
argilla/magpie-ultra-v1.0
Viewer
•
Updated
•
3.22M
•
6.7k
•
41
Viewer
•
Updated
•
1k
•
2.33k
•
53
Viewer
•
Updated
•
817
•
4.32k
•
111
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language
Models
Paper
•
2401.01335
•
Published
•
65
Direct Nash Optimization: Teaching Language Models to Self-Improve with
General Preferences
Paper
•
2404.03715
•
Published
•
61
Self-Boosting Large Language Models with Synthetic Preference Data
Paper
•
2410.06961
•
Published
•
16
SPaR: Self-Play with Tree-Search Refinement to Improve
Instruction-Following in Large Language Models
Paper
•
2412.11605
•
Published
•
18
Magpie-Align/Magpie-Reasoning-V1-150K-CoT-Deepseek-R1-Llama-70B
Viewer
•
Updated
•
150k
•
1.29k
•
15
sbintuitions/modernbert-ja-130m
Fill-Mask
•
Updated
•
7.79k
•
31
bespokelabs/Bespoke-Stratos-17k
Viewer
•
Updated
•
16.7k
•
92.5k
•
277
SymNoise: Advancing Language Model Fine-tuning with Symmetric Noise
Paper
•
2312.01523
•
Published