Kanawat Vilasri's picture

1 4 4

Kanawat Vilasri

gri11

·

gri11

AI & ML interests

None yet

Recent Activity

liked a Space 21 days ago

ThaiLLM-Leaderboard/leaderboard

upvoted a paper 27 days ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

upvoted an article 11 months ago

Mixture of Experts Explained

View all activity

Organizations

gri11's activity

upvoted a paper 27 days ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 27 days ago • 142

upvoted 3 articles 11 months ago

Article

Mixture of Experts Explained

Dec 11, 2023

• 436

Article

Sparse Mixture of Experts Language Model from Scratch: Extending makeMoE with Expert Capacity

By

•

Mar 18, 2024

• 9

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

By

•

May 7, 2024

• 57