33 79 210

dame rajee

damerajee

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

upvoted a paper 5 days ago

Scaling Laws for Floating Point Quantization Training

upvoted a paper 5 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

View all activity

Organizations

damerajee's activity

upvoted a paper 1 day ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 5 days ago • 197

upvoted 2 papers 5 days ago

Scaling Laws for Floating Point Quantization Training

Paper • 2501.02423 • Published 8 days ago • 23

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published 9 days ago • 72

upvoted an article 10 days ago

Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

•

11 days ago

• 37

upvoted a paper 14 days ago

Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment

Paper • 2412.19326 • Published 18 days ago • 18

upvoted a paper 16 days ago

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published 20 days ago • 35

upvoted 2 papers 18 days ago

Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published 20 days ago • 44

ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing

Paper • 2412.14711 • Published 25 days ago • 15

upvoted 2 papers 19 days ago

TRecViT: A Recurrent Video Transformer

Paper • 2412.14294 • Published 26 days ago • 12

MixLLM: LLM Quantization with Global Mixed-precision between Output-features and Highly-efficient System Design

Paper • 2412.14590 • Published 25 days ago • 13

upvoted an article 20 days ago

Article

Deriving DPO's Loss

•

20 days ago

• 26

upvoted a paper 25 days ago

Proposer-Agent-Evaluator(PAE): Autonomous Skill Discovery For Foundation Model Internet Agents

Paper • 2412.13194 • Published 27 days ago • 12

upvoted a paper 26 days ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 54

upvoted 2 papers 27 days ago

Smaller Language Models Are Better Instruction Evolvers

Paper • 2412.11231 • Published 29 days ago • 27

Solving math word problems with process- and outcome-based feedback

Paper • 2211.14275 • Published Nov 25, 2022 • 8

upvoted 2 papers 28 days ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published about 1 month ago • 137

JuStRank: Benchmarking LLM Judges for System Ranking

Paper • 2412.09569 • Published Dec 12, 2024 • 19

upvoted 2 papers about 1 month ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 125

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Paper • 2412.04424 • Published Dec 5, 2024 • 59

upvoted a paper about 2 months ago

Evaluating Tokenizer Performance of Large Language Models Across Official Indian Languages

Paper • 2411.12240 • Published Nov 19, 2024 • 6