-
Transformer^2: Self-adaptive LLMs
Paper • 2501.06252 • Published • 53 -
Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models
Paper • 2501.12370 • Published • 8 -
Self-Refine: Iterative Refinement with Self-Feedback
Paper • 2303.17651 • Published • 2 -
Probing-RAG: Self-Probing to Guide Language Models in Selective Document Retrieval
Paper • 2410.13339 • Published
Av
Avi66
AI & ML interests
ML Research , LLMs , Applications
Recent Activity
updated
a collection
about 20 hours ago
Papers
updated
a collection
10 days ago
Papers
updated
a collection
20 days ago
Vlm
Organizations
None yet
Collections
3
-
unsloth/Llama-3.2-90B-Vision-Instruct-bnb-4bit
Image-Text-to-Text • Updated • 1.79k • 18 -
neuralmagic/Llama-3.2-11B-Vision-Instruct-FP8-dynamic
Text Generation • Updated • 6.74k • 20 -
cjpais/llava-v1.6-34B-gguf
Image-Text-to-Text • Updated • 1.24k • 39 -
THUDM/cogvlm2-llama3-caption
Video-Text-to-Text • Updated • 192k • 82
models
None public yet
datasets
None public yet