Papers - a Avi66 Collection

Avi66 's Collections

Papers

Vlm

Papers

updated about 2 hours ago

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published 21 days ago • 53
Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models

Paper • 2501.12370 • Published 8 days ago • 7
Self-Refine: Iterative Refinement with Self-Feedback

Paper • 2303.17651 • Published Mar 30, 2023 • 2
Probing-RAG: Self-Probing to Guide Language Models in Selective Document Retrieval

Paper • 2410.13339 • Published Oct 17, 2024
Gorilla: Large Language Model Connected with Massive APIs

Paper • 2305.15334 • Published May 24, 2023 • 5
PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Paper • 2403.10704 • Published Mar 15, 2024 • 58
Towards Optimal Learning of Language Models

Paper • 2402.17759 • Published Feb 27, 2024 • 17
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6, 2024 • 184
MoAI: Mixture of All Intelligence for Large Language and Vision Models

Paper • 2403.07508 • Published Mar 12, 2024 • 75
Probing Out-of-Distribution Robustness of Language Models with Parameter-Efficient Transfer Learning

Paper • 2301.11660 • Published Jan 27, 2023 • 1
From RAGs to rich parameters: Probing how language models utilize external knowledge over parametric information for factual queries

Paper • 2406.12824 • Published Jun 18, 2024 • 21
Scaling and evaluating sparse autoencoders

Paper • 2406.04093 • Published Jun 6, 2024 • 3