229 145 2768

Knut Jägersberg

KnutJaegersberg

jagersbergknut

AI & ML interests

NLP, opinion mining, narrative intelligence

Recent Activity

updated a model 1 day ago

KnutJaegersberg/GemmaCoder3-12B-Q8_0-GGUF

published a model 1 day ago

KnutJaegersberg/GemmaCoder3-12B-Q8_0-GGUF

liked a model 1 day ago

burtenshaw/GemmaCoder3-12B

View all activity

Organizations

KnutJaegersberg's activity

updated a model 1 day ago

KnutJaegersberg/GemmaCoder3-12B-Q8_0-GGUF

Updated 1 day ago • 4

published a model 1 day ago

KnutJaegersberg/GemmaCoder3-12B-Q8_0-GGUF

Updated 1 day ago • 4

liked a model 1 day ago

burtenshaw/GemmaCoder3-12B

Image-Text-to-Text • Updated 2 days ago • 76 • 27

updated a model 2 days ago

KnutJaegersberg/Qwen2.5-7B-Instruct-RLVR-Q8_0-GGUF

Updated 2 days ago • 5

published a model 2 days ago

KnutJaegersberg/Qwen2.5-7B-Instruct-RLVR-Q8_0-GGUF

Updated 2 days ago • 5

liked a model 2 days ago

virtuoussy/Qwen2.5-7B-Instruct-RLVR

Updated about 22 hours ago • 57 • 6

upvoted a collection 2 days ago

RLVR

Collection

Model and data for 'Expanding RL with Verifiable Rewards Across Diverse Domains' • 3 items • Updated 3 days ago • 9

liked a dataset 2 days ago

virtuoussy/Multi-subject-RLVR

Viewer • Updated about 22 hours ago • 579k • 119 • 35

reacted to BlinkDL's post with 🔥 5 days ago

Post

5957

RWKV-7 "Goose" 0.4B trained w/ ctx4k automatically extrapolates to ctx32k+, and perfectly solves NIAH ctx16k 🤯 100% RNN and attention-free. Only trained on the Pile. No finetuning. Replicable training runs. tested by our community: https://github.com/Jellyfish042/LongMamba