arxiv:2501.02423
AndyYang
andyyang
AI & ML interests
None yet
Recent Activity
authored
a paper
3 days ago
HMoE: Heterogeneous Mixture of Experts for Language Modeling
authored
a paper
3 days ago
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated
Parameters by Tencent
authored
a paper
3 days ago
Scaling Laws for Floating Point Quantization Training
Organizations
Papers
3
models
None public yet