Papers to read - General Papers I want to read, at some point. Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation Paper β’ 2108.12409 β’ Published Aug 27, 2021 β’ 5 YaRN: Efficient Context Window Extension of Large Language Models Paper β’ 2309.00071 β’ Published Aug 31, 2023 β’ 65 MIMIC-IT: Multi-Modal In-Context Instruction Tuning Paper β’ 2306.05425 β’ Published Jun 8, 2023 β’ 11 Music ControlNet: Multiple Time-varying Controls for Music Generation Paper β’ 2311.07069 β’ Published Nov 13, 2023 β’ 43
Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation Paper β’ 2108.12409 β’ Published Aug 27, 2021 β’ 5
YaRN: Efficient Context Window Extension of Large Language Models Paper β’ 2309.00071 β’ Published Aug 31, 2023 β’ 65
MIMIC-IT: Multi-Modal In-Context Instruction Tuning Paper β’ 2306.05425 β’ Published Jun 8, 2023 β’ 11
Music ControlNet: Multiple Time-varying Controls for Music Generation Paper β’ 2311.07069 β’ Published Nov 13, 2023 β’ 43
Papers to read - Reinforcement Learning Papers I want to read, at some point. Focused on Reinforcement Learning papers. Deep reinforcement learning from human preferences Paper β’ 1706.03741 β’ Published Jun 12, 2017 β’ 3 Training language models to follow instructions with human feedback Paper β’ 2203.02155 β’ Published Mar 4, 2022 β’ 16 Direct Preference-based Policy Optimization without Reward Modeling Paper β’ 2301.12842 β’ Published Jan 30, 2023 Woodpecker: Hallucination Correction for Multimodal Large Language Models Paper β’ 2310.16045 β’ Published Oct 24, 2023 β’ 15
Deep reinforcement learning from human preferences Paper β’ 1706.03741 β’ Published Jun 12, 2017 β’ 3
Training language models to follow instructions with human feedback Paper β’ 2203.02155 β’ Published Mar 4, 2022 β’ 16
Direct Preference-based Policy Optimization without Reward Modeling Paper β’ 2301.12842 β’ Published Jan 30, 2023
Woodpecker: Hallucination Correction for Multimodal Large Language Models Paper β’ 2310.16045 β’ Published Oct 24, 2023 β’ 15