view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference 14 days ago ā¢ 61
KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models Paper ā¢ 2412.06071 ā¢ Published Dec 8, 2024 ā¢ 9
view article Article dstack to manage clusters of on-prem servers for AI workloads with ease By chansung ā¢ Oct 10, 2024 ā¢ 7
DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. ā¢ 2 items ā¢ Updated Dec 13, 2024 ā¢ 83
view article Article Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques š š By Isayoften ā¢ Aug 26, 2024 ā¢ 48
LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs Paper ā¢ 2408.13467 ā¢ Published Aug 24, 2024 ā¢ 25
view article Article dstack: Your LLM Launchpad - From Fine-Tuning to Serving, Simplified By chansung ā¢ Aug 22, 2024 ā¢ 12
view article Article Expanding Model Context and Creating Chat Models with a Single Click By maywell ā¢ Apr 28, 2024 ā¢ 37
DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing Paper ā¢ 2306.14435 ā¢ Published Jun 26, 2023 ā¢ 20