Oliver Hawthorne's picture

Oliver Hawthorne

Astris

AI & ML interests

None yet

Recent Activity

Organizations

Social Post Explorers's profile picture Hugging Face Discord Community's profile picture

Astris's activity

reacted to lewtun's post with 🔥 22 days ago
view post
Post
3493
I was initially pretty sceptical about Meta's Coconut paper [1] because the largest perf gains were reported on toy linguistic problems. However, these results on machine translation are pretty impressive!

https://x.com/casper_hansen_/status/1875872309996855343

Together with the recent PRIME method [2] for scaling RL, reasoning for open models is looking pretty exciting for 2025!

[1] Training Large Language Models to Reason in a Continuous Latent Space (2412.06769)
[2] https://huggingface.co./blog/ganqu/prime
New activity in Astris/Furry-AO3-LoRA 30 days ago

What do?

3
#2 opened about 1 month ago by
AnonFurry
New activity in Astris/Ensemble_of_models_VQA_v0.5 6 months ago