NVILA: Efficient Frontier Visual Language Models Paper • 2412.04468 • Published 20 days ago • 54
Star Attention: Efficient LLM Inference over Long Sequences Paper • 2411.17116 • Published 30 days ago • 47