Jonathan Mamou's picture

11 6

Jonathan Mamou

jmamou

·

jmamou

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

FastDraft: How to Train Your Draft

View all activity

Articles

Universal Assisted Generation: Faster Decoding with Any Assistant Model

Faster Assisted Generation with Dynamic Speculation

Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding

Organizations

jmamou's activity

upvoted a paper about 1 month ago

FastDraft: How to Train Your Draft

Paper • 2411.11055 • Published Nov 17 • 9

upvoted an article 3 months ago

Article

Faster Assisted Generation with Dynamic Speculation

Oct 8

• 44

upvoted a paper 5 months ago

RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation

Paper • 2408.02545 • Published Aug 5 • 35

upvoted 2 papers 7 months ago

Accelerating Speculative Decoding using Dynamic Speculation Length

Paper • 2405.04304 • Published May 7 • 2

Distributed Speculative Inference of Large Language Models

Paper • 2405.14105 • Published May 23 • 16