arxiv:2412.10319
Xufang Luo
luoxufang
AI & ML interests
None yet
Recent Activity
authored
a paper
28 days ago
SCBench: A KV Cache-Centric Analysis of Long-Context Methods
authored
a paper
6 months ago
MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via
Dynamic Sparse Attention
authored
a paper
10 months ago
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic
Prompt Compression
Organizations
None yet