Haihao Shen

Haihao

AI & ML interests

LLM quantization, sparsity, and acceleration

Recent Activity

Articles

Organizations

Intel's profile picture Need4Speed's profile picture Qwen's profile picture Open Platform for Enterprise AI's profile picture

Haihao's activity

reacted to wenhuach's post with ๐Ÿš€ 8 days ago
view post
Post
329
This week, OPEA Space released several new INT4 models, including:
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
allenai/OLMo-2-1124-13B-Instruct
THUDM/glm-4v-9b
AIDC-AI/Marco-o1
and several others.
Let us know which models you'd like prioritized for quantization, and we'll do our best to make it happen!

https://huggingface.co./OPEA
  • 3 replies
ยท
New activity in Intel/neural-chat-7b-v3 about 1 month ago
New activity in Intel/neural-chat-7b-v3-3 about 1 month ago
upvoted an article 4 months ago
view article
Article

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

โ€ข 12
upvoted an article 7 months ago
view article
Article

Accelerate StarCoder with ๐Ÿค— Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding

โ€ข 9