arxiv:2412.06769
Yuandong Tian
tydsh
AI & ML interests
Reinforcement Learning, Optimization, Representation Learning
Recent Activity
authored
a paper
about 1 month ago
Training Large Language Models to Reason in a Continuous Latent Space
authored
a paper
6 months ago
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive
Low-Rank Gradients
upvoted
a
collection
7 months ago
Llama 2 Family
Organizations
None yet
Papers
18
models
None public yet
datasets
None public yet