arxiv:2412.05270
Zhenyu Zhang
Kyriection
AI & ML interests
Large Language Models, Efficient Machine Learning, Quantum Computing
Recent Activity
upvoted
a
paper
6 days ago
Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and
Post-LN
authored
a paper
17 days ago
APOLLO: SGD-like Memory, AdamW-level Performance
upvoted
a
paper
17 days ago
APOLLO: SGD-like Memory, AdamW-level Performance
Organizations
None yet
Papers
12
models
None public yet
datasets
None public yet