Enhancing Human-Like Responses in Large Language Models Paper ā¢ 2501.05032 ā¢ Published 3 days ago ā¢ 33
view post Post 1327 We fixed many bugs in Phi-4 & uploaded fixed GGUF + 4-bit versions! āØOur fixed versions are even higher on the Open LLM Leaderboard than Microsoft's!GGUFs: unsloth/phi-4-GGUFDynamic 4-bit: unsloth/phi-4-unsloth-bnb-4bitYou can also now finetune Phi-4 for free on Colab: https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Phi_4-Conversational.ipynbRead our blogpost for more details on bug fixes etc: https://unsloth.ai/blog/phi4 See translation š„ 9 9 š 4 4 ā¤ļø 4 4 š 4 4 š¤ 3 3 + Reply
Agent Laboratory: Using LLM Agents as Research Assistants Paper ā¢ 2501.04227 ā¢ Published 5 days ago ā¢ 66
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper ā¢ 2501.04682 ā¢ Published 4 days ago ā¢ 70
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper ā¢ 2501.03262 ā¢ Published 9 days ago ā¢ 72
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper ā¢ 2501.04519 ā¢ Published 4 days ago ā¢ 187
On the Compositional Generalization of Multimodal LLMs for Medical Imaging Paper ā¢ 2412.20070 ā¢ Published 15 days ago ā¢ 43
Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization Paper ā¢ 2412.18525 ā¢ Published 19 days ago ā¢ 65
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Paper ā¢ 2412.19723 ā¢ Published 16 days ago ā¢ 78
VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control Paper ā¢ 2501.01427 ā¢ Published 10 days ago ā¢ 46
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining Paper ā¢ 2501.00958 ā¢ Published 11 days ago ā¢ 92