Knut Jägersberg's picture

Knut Jägersberg

KnutJaegersberg

AI & ML interests

NLP, opinion mining, narrative intelligence

Recent Activity

updated a model about 16 hours ago
KnutJaegersberg/GemmaCoder3-12B-Q8_0-GGUF
published a model about 16 hours ago
KnutJaegersberg/GemmaCoder3-12B-Q8_0-GGUF
liked a model about 16 hours ago
burtenshaw/GemmaCoder3-12B
View all activity

Organizations

LLMs's profile picture Blog-explorers's profile picture Qwen's profile picture Social Post Explorers's profile picture M4-ai's profile picture Chinese LLMs on Hugging Face's profile picture Smol Community's profile picture

KnutJaegersberg's activity

reacted to BlinkDL's post with 🔥 4 days ago
view post
Post
5916
RWKV-7 "Goose" 0.4B trained w/ ctx4k automatically extrapolates to ctx32k+, and perfectly solves NIAH ctx16k 🤯 100% RNN and attention-free. Only trained on the Pile. No finetuning. Replicable training runs. tested by our community: https://github.com/Jellyfish042/LongMamba