1 10 1

Alistarh

d-alistarh

AI & ML interests

NLP

Recent Activity

upvoted a paper 20 days ago

DarwinLM: Evolutionary Structured Pruning of Large Language Models

authored a paper 28 days ago

HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning

authored a paper 28 days ago

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations

View all activity

Organizations

d-alistarh's activity

upvoted a paper 20 days ago

DarwinLM: Evolutionary Structured Pruning of Large Language Models

Paper • 2502.07780 • Published 26 days ago • 17

authored 2 papers 28 days ago

HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning

Paper • 2501.02625 • Published Jan 5 • 16

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations

Paper • 2502.05003 • Published about 1 month ago • 42

upvoted 2 papers 28 days ago

HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning

Paper • 2501.02625 • Published Jan 5 • 16

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations

Paper • 2502.05003 • Published about 1 month ago • 42

commented a paper 28 days ago

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations

Paper • 2502.05003 • Published about 1 month ago • 42 •

authored 14 papers 3 months ago

SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression

Paper • 2306.03078 • Published Jun 5, 2023 • 3

Error Feedback Can Accurately Compress Preconditioners

Paper • 2306.06098 • Published Jun 9, 2023

RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust Adaptation

Paper • 2401.04679 • Published Jan 9, 2024 • 2

Extreme Compression of Large Language Models via Additive Quantization

Paper • 2401.06118 • Published Jan 11, 2024 • 12

Accurate Neural Network Pruning Requires Rethinking Sparse Optimization

Paper • 2308.02060 • Published Aug 3, 2023 • 1

How Well Do Sparse Imagenet Models Transfer?

Paper • 2111.13445 • Published Nov 26, 2021 • 1

QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs

Paper • 2404.00456 • Published Mar 30, 2024 • 4

Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization

Paper • 2404.03605 • Published Apr 4, 2024 • 1

Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning

Paper • 2208.11580 • Published Aug 24, 2022

GMP*: Well-Tuned Gradual Magnitude Pruning Can Outperform Most BERT-Pruning Methods

Paper • 2210.06384 • Published Oct 12, 2022 • 1

L-GreCo: Layerwise-Adaptive Gradient Compression for Efficient and Accurate Deep Learning

Paper • 2210.17357 • Published Oct 31, 2022 • 1