-
Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions
Paper • 2309.10150 • Published • 25 -
In-Context Pretraining: Language Modeling Beyond Document Boundaries
Paper • 2310.10638 • Published • 30 -
Farzi Data: Autoregressive Data Distillation
Paper • 2310.09983 • Published • 10 -
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper • 2311.05437 • Published • 50
Mat Miller
matdmiller
AI & ML interests
None yet
Recent Activity
updated
a dataset
1 day ago
matdmiller/finemath-4plus
published
a dataset
1 day ago
matdmiller/finemath-4plus
updated
a Space
14 days ago
LocalResearchGroup/aim