7 1136 639

Kye Gomez

kye

https://discord.gg/qUtxnK2NMf

kyegomezb

AI & ML interests

Neuroscience, Behavior Science, Anti-Matter, Anti-Gravity propulsion,

Recent Activity

upvoted a paper 1 day ago

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

upvoted a paper 1 day ago

MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

upvoted a paper 1 day ago

Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought

View all activity

Organizations

kye's activity

upvoted 7 papers 1 day ago

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

Paper • 2504.11468 • Published 9 days ago • 19

MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

Paper • 2504.00999 • Published 17 days ago • 82

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

Paper • 2504.10514 • Published 9 days ago • 43

upvoted 3 papers 3 days ago

Seedream 3.0 Technical Report

Paper • 2504.11346 • Published 4 days ago • 34

Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding

Paper • 2504.10465 • Published 4 days ago • 26

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Paper • 2504.10481 • Published 4 days ago • 77

upvoted 10 papers 4 days ago

VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search

Paper • 2504.09130 • Published 7 days ago • 10

M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models

Paper • 2504.10449 • Published 4 days ago • 7

The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search

Paper • 2504.08066 • Published 8 days ago • 9

SocioVerse: A World Model for Social Simulation Powered by LLM Agents and A Pool of 10 Million Real-World Users

Paper • 2504.10157 • Published 5 days ago • 12

LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models

Paper • 2504.10415 • Published 4 days ago • 7

DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training

Paper • 2504.09710 • Published 5 days ago • 17

VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Paper • 2504.08837 • Published 8 days ago • 39

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 4 days ago • 223

FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding

Paper • 2504.09925 • Published 5 days ago • 36

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published 12 days ago • 112