Multilabel Classification using Mistral-7B on a single GPU with quantization and LoRA Jan 22, 2024 • 14
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks Paper • 2410.22391 • Published Oct 29, 2024 • 22
One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation Paper • 2410.07170 • Published Oct 9, 2024 • 15
Retrieval-Augmented Decision Transformer: External Memory for In-context RL Paper • 2410.07071 • Published Oct 9, 2024 • 6
One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation Paper • 2410.07170 • Published Oct 9, 2024 • 15
view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention By sirluk • Oct 7, 2024 • 11
view article Article Multilabel Classification using Mistral-7B on a single GPU with quantization and LoRA By sirluk • Jan 22, 2024 • 14