Training Language Models to Self-Correct via Reinforcement Learning Paper • 2409.12917 • Published 3 days ago • 88
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution Paper • 2409.12191 • Published 4 days ago • 55
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated 4 days ago • 151
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 7 items • Updated 2 days ago • 46
Qwen2.5-Math Collection Math-specific model series based on Qwen2.5 • 8 items • Updated 4 days ago • 25
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems Paper • 2402.12875 • Published Feb 20 • 12
Sibyl: Simple yet Effective Agent Framework for Complex Real-world Reasoning Paper • 2407.10718 • Published Jul 15 • 17
Video Generation models Collection The domain of video generation is booming. Here are the list of selected Open Access video generation (T2V) models. • 14 items • Updated 26 days ago • 12
view article Article Introducing AuraFace: Open-Source Face Recognition and Identity Preservation Models By isidentical • 27 days ago • 34
view article Article Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap Jun 19 • 11
Minitron Collection A family of compressed models obtained via pruning and knowledge distillation • 7 items • Updated 5 days ago • 54
Top LLM Collection Collection of TOP Open Source LLM, Sort by Best on top • 6 items • Updated Jul 26 • 9
view article Article Introducing HelpingAI-Flash: Emotionally Intelligent Conversational AI for All Devices By Abhaykoul • Jul 19 • 2
view article Article Introducing HelpingAI-15B: Emotionally Intelligent Conversational AI By Abhaykoul • Jul 12 • 3
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation Paper • 2403.04692 • Published Mar 7 • 40
view article Article Introducing NPC-Playground, a 3D playground to interact with LLM-powered NPCs Jun 5 • 17
view article Article Decoding GPT-4'o': In-Depth Exploration of Its Mechanisms and Creating Similar AI. By KingNish • May 21 • 30
llama 3 self-align experiments Collection Replicating the pipeline for StarCoder-2 Instruct on Llama-3-8B with some tweaks https://huggingface.co./blog/sc2-instruct • 4 items • Updated May 9 • 6
FIFO-Diffusion: Generating Infinite Videos from Text without Training Paper • 2405.11473 • Published May 19 • 53
Edit Your Image! Collection Find all the trending and useful Gradio demos that you can use to edit your images. • 21 items • Updated Apr 26 • 23
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context Paper • 1901.02860 • Published Jan 9, 2019 • 2
Chatbot is Not All You Need: Information-rich Prompting for More Realistic Responses Paper • 2312.16233 • Published Dec 25, 2023 • 2
Instant Space Collection Contains spaces which gives lightning fast results compare to others. • 11 items • Updated Jul 26 • 6
view article Article ⚗️ 🧑🏼🌾 Let's grow some Domain Specific Datasets together By burtenshaw • Apr 29 • 28