Abreu Magalhães's picture

125 83

Abreu Magalhães

Hildeberto

·

AI & ML interests

None yet

Organizations

None yet

Hildeberto's activity

upvoted 2 papers 14 days ago

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Paper • 2410.19313 • Published 19 days ago • 18

Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction

Paper • 2410.21169 • Published 16 days ago • 29

upvoted 3 papers 19 days ago

MiniPLM: Knowledge Distillation for Pre-Training Language Models

Paper • 2410.17215 • Published 22 days ago • 12

Pre-training Distillation for Large Language Models: A Design Space Exploration

Paper • 2410.16215 • Published 23 days ago • 15

Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception

Paper • 2410.12788 • Published 28 days ago • 21

upvoted 2 papers 27 days ago

Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free

Paper • 2410.10814 • Published 30 days ago • 48

Toward General Instruction-Following Alignment for Retrieval-Augmented Generation

Paper • 2410.09584 • Published Oct 12 • 45

upvoted 5 papers about 1 month ago

Agent S: An Open Agentic Framework that Uses Computers Like a Human

Paper • 2410.08164 • Published Oct 10 • 24

Differential Transformer

Paper • 2410.05258 • Published Oct 7 • 165

Selective Attention Improves Transformer

Paper • 2410.02703 • Published Oct 3 • 23

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1 • 143

MinerU: An Open-Source Solution for Precise Document Content Extraction

Paper • 2409.18839 • Published Sep 27 • 25

upvoted 5 papers about 2 months ago

EuroLLM: Multilingual Language Models for Europe

Paper • 2409.16235 • Published Sep 24 • 24

Making Text Embedders Few-Shot Learners

Paper • 2409.15700 • Published Sep 24 • 29

Fact, Fetch, and Reason: A Unified Evaluation of Retrieval-Augmented Generation

Paper • 2409.12941 • Published Sep 19 • 21

Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models

Paper • 2409.11136 • Published Sep 17 • 21

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

Paper • 2409.10516 • Published Sep 16 • 37

upvoted 3 papers 2 months ago

Attention Heads of Large Language Models: A Survey

Paper • 2409.03752 • Published Sep 5 • 87

MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery

Paper • 2409.05591 • Published Sep 9 • 28

OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs

Paper • 2409.05152 • Published Sep 8 • 29