view article Article From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning By NormalUhr • about 15 hours ago • 1
Aloe: A Family of Fine-tuned Open Healthcare LLMs Paper • 2405.01886 • Published May 3, 2024 • 5
Phi-4 (All Versions) Collection Microsoft's new Phi-4 model in all formats. Includes GGUF, 4-bit bnb and original versions. Includes Unsloth's bug fixes. • 4 items • Updated about 22 hours ago • 39