Sultan Alrashed's picture

Sultan Alrashed PRO

SultanR

·

https://sulrash.github.io/

AI & ML interests

Smol language modelling!

Recent Activity

upvoted a paper 7 days ago

s1: Simple test-time scaling

upvoted a paper 11 days ago

Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

upvoted a paper 11 days ago

OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints

View all activity

Organizations

SultanR's activity

upvoted a paper 7 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 10 days ago • 97

upvoted 9 papers 11 days ago

Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

Paper • 2501.04003 • Published Jan 7 • 25

OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints

Paper • 2501.03841 • Published Jan 7 • 53

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published about 1 month ago • 83

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 27 days ago • 273

Debate Helps Weak-to-Strong Generalization

Paper • 2501.13124 • Published 20 days ago • 7

Hallucinations Can Improve Large Language Models in Drug Discovery

Paper • 2501.13824 • Published 18 days ago • 9

Return of the Encoder: Maximizing Parameter Efficiency for SLMs

Paper • 2501.16273 • Published 14 days ago • 5

Feasible Learning

Paper • 2501.14912 • Published 17 days ago • 5

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 13 days ago • 101

commented a paper 12 days ago

Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling

Paper • 2501.16975 • Published 13 days ago • 23 •

upvoted a paper 12 days ago

Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling

Paper • 2501.16975 • Published 13 days ago • 23

liked a model 20 days ago

hexgrad/Kokoro-82M

Text-to-Speech • Updated 8 days ago • 271k • 2.98k