72 1378 2127

taesiri PRO

taesiri

https://taesiri.ai/

AI & ML interests

AGI ... one linear layer at a time

Recent Activity

updated a dataset about 5 hours ago

taesiri/PhotoshopRequest-DailyDump-March-2025

updated a dataset about 6 hours ago

taesiri/PRBackup_Compressed

liked a model about 21 hours ago

Qwen/QwQ-32B

View all activity

Organizations

taesiri's activity

upvoted 3 papers 2 days ago

upvoted a paper 3 days ago

HoT: Highlighted Chain of Thought for Referencing Supporting Facts from Inputs

Paper • 2503.02003 • Published 6 days ago • 37

upvoted an article 5 days ago

Article

Hugging Face and JFrog partner to make AI Security more transparent

6 days ago

• 18

upvoted a collection 5 days ago

C4AI Aya Vision

Collection

Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 5 days ago • 59

upvoted an article 5 days ago

Article

FastRTC: The Real-Time Communication Library for Python

13 days ago

• 130

upvoted a collection 5 days ago

ViRFT Datasets

Collection

ViRFT Datasets • 8 items • Updated 13 days ago • 5

upvoted a paper 5 days ago

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published 6 days ago • 59

upvoted 2 papers 6 days ago

Chain of Draft: Thinking Faster by Writing Less

Paper • 2502.18600 • Published 12 days ago • 44

DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking

Paper • 2502.20730 • Published 9 days ago • 32

upvoted 3 papers 9 days ago

CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scale

Paper • 2502.16645 • Published 14 days ago • 21

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published 11 days ago • 75

R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts

Paper • 2502.20395 • Published 10 days ago • 43

upvoted a collection 10 days ago

Phi-4

Collection

Phi-4 family of small language and multi-modal models. • 7 items • Updated 6 days ago • 108

upvoted 5 papers 10 days ago

Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator

Paper • 2502.19204 • Published 11 days ago • 11

GHOST 2.0: generative high-fidelity one shot transfer of heads

Paper • 2502.18417 • Published 12 days ago • 62

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Paper • 2502.19361 • Published 11 days ago • 26

Towards an AI co-scientist

Paper • 2502.18864 • Published 11 days ago • 41

TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding

Paper • 2502.19400 • Published 11 days ago • 42