Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

HF1BitLLM
/
Llama3-8B-1.58-100B-tokens

Text Generation
Transformers
Safetensors
llama
conversational
text-generation-inference
Inference Endpoints
8-bit precision
bitnet
Model card Files Files and versions Community
12
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

added missing imports

#12 opened about 2 months ago by
bitsTobyte

Triton error while running demo code

2
#11 opened 3 months ago by
chiauho

Slower than standard Llama 8b?

1
#10 opened 3 months ago by
Sijuade

I found some errors when building on a rpi 5

1
#9 opened 3 months ago by
eddieoz

You can try to convert DeepSeek-V2.5 or Llama-3.1-Nemotron-70B-Instruct-HF?

2
#8 opened 3 months ago by
win10

Finetuning this model

7
#7 opened 3 months ago by
Andrefty

GGUF conversion

11
#3 opened 4 months ago by
compilade
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs