DeepSeek R1 (All Versions) Collection DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated 1 day ago • 149
RecurrentGPT: Interactive Generation of (Arbitrarily) Long Text Paper • 2305.13304 • Published May 22, 2023 • 2
view article Article Estimating Memory Consumption of LLMs for Inference and Fine-Tuning for Cohere Command-R+ By Andyrasika • Apr 26, 2024 • 11