Reinforcement learning - a Testerpce Collection

Testerpce 's Collections

Agent

MoE

RAG

State space LLM

Partial layer training LLMs

Math

Dataset and Data processing

Video understanding

Reinforcement learning

Reinforcement learning

updated 2 days ago

Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning

Paper • 2407.20798 • Published Jul 30, 2024 • 24
Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published 24 days ago • 38
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published 9 days ago • 72