-
Order Matters in the Presence of Dataset Imbalance for Multilingual Learning
Paper • 2312.06134 • Published • 3 -
Efficient Monotonic Multihead Attention
Paper • 2312.04515 • Published • 8 -
Contrastive Decoding Improves Reasoning in Large Language Models
Paper • 2309.09117 • Published • 39 -
Exploring Format Consistency for Instruction Tuning
Paper • 2307.15504 • Published • 8
KujoJotaro
paisleypark
·
AI & ML interests
None yet
Recent Activity
liked
a model
9 days ago
amazon/chronos-t5-small
commented on
a paper
8 months ago
MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware
Experts
upvoted
a
paper
9 months ago
Scaling Retrieval-Based Language Models with a Trillion-Token Datastore
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet