llmixer
/

BigWeave-v6-90b-4.0bpw-h8-exl2

Create README.md

426ccdf verified about 1 year ago

374 Bytes

metadata

license: llama2
language:
  - en
pipeline_tag: conversational
tags:
  - 4.0bpw
  - h8
  - exl2

Exllamav2 4.0bpw h8 quant for BigWeave-v6-90b.

Fits 48GB VRAM with 4k context. Slightly lower ppl than Goliath-120b 3.0bpw.