Haihao Shen

Haihao

AI & ML interests

LLM quantization, sparsity, and acceleration

Articles

Organizations

Haihao's activity

upvoted an article 17 days ago
view article
Article

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

• 11
upvoted an article 4 months ago
view article
Article

Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding

• 4