Simeng Sun's picture

3 1

Simeng Sun

simsun131

https://people.cs.umass.edu/~simengsun/

AI & ML interests

Language Modeling, Machine Translation

Recent Activity

upvoted a paper 28 days ago

Hymba: A Hybrid-head Architecture for Small Language Models

upvoted a paper 29 days ago

Star Attention: Efficient LLM Inference over Long Sequences

View all activity

Organizations

simsun131's activity

upvoted a paper 28 days ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20 • 39

upvoted a paper 29 days ago

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published about 1 month ago • 47

upvoted a paper 9 months ago

RULER: What's the Real Context Size of Your Long-Context Language Models?

Paper • 2404.06654 • Published Apr 9 • 34