Custom 4-bit Finetuning 5-7 times faster inference than QLora
pinned
1
#13 opened over 1 year ago
by
rmihaylov
How to make it work for less experienced AI whisperers
pinned
17
#4 opened over 1 year ago
by
Sloba
Support for LoRA?
pinned
17
#3 opened over 1 year ago
by
cekal
Fix for issue #104
#105 opened 2 months ago
by
vebM
AttributeError: 'Logger' object has no attribute 'warning_once'
#104 opened 2 months ago
by
vebM
Inference tensors cannot be saved for backward. To work around you can make a clone to get a normal tensor and use it in autograd.
#103 opened 5 months ago
by
Mubarak127
Two repeated errors in model output
1
#102 opened 7 months ago
by
virilo
ValueError: The current `device_map` had weights offloaded to the disk.
#101 opened 8 months ago
by
MohamedZouabi
Why does falcon-7b have 71 attention heads?
1
#100 opened 8 months ago
by
alpindale
Creating vectordatabse using the falcon-7b model embeddings.
#99 opened 9 months ago
by
alchemistPS01
FalconForCausalLM does not support Flash Attention 2.0 yet
#98 opened 10 months ago
by
Menouar
Error while trying to load model
#96 opened 11 months ago
by
dwojcik
Adding `safetensors` variant of this model
#95 opened 12 months ago
by
SFconvertbot
Adding Evaluation Results
#94 opened 12 months ago
by
leaderboard-pr-bot
Model does not know when to stop generating text?
#93 opened about 1 year ago
by
jashsayani
Could we machine translatation task using this model?
2
#91 opened about 1 year ago
by
Pitambarmuduli
Falcon-7B decoding error
#90 opened about 1 year ago
by
rahulseetharaman
[AUTOMATED] Model Memory Requirements
#89 opened about 1 year ago
by
model-sizer-bot
Upload configuration_RW.py
#88 opened about 1 year ago
by
imranshah
Upload configuration_RW.py
#87 opened about 1 year ago
by
imranshah
Getting: HTTPError: 404 Client Error: Not Found for url: https://huggingface.co./tiiuae/falcon-7b/resolve/main/configuration_RW.py
1
#86 opened about 1 year ago
by
f5-lolabhattu
What does this file do? modeling_falcon.py
#85 opened about 1 year ago
by
Tony068
Anyone discovered "Mini" yet in prompting?
#83 opened about 1 year ago
by
YoYo1234Qwerty
How to avoid running into memory/ storage problems associated with HF Spaces while using tiiuae/falcon-7b 0r 40b etc.,
4
#82 opened about 1 year ago
by
vsrinivas
Update generation_config.json
1
#81 opened about 1 year ago
by
nkasmanoff
ValueError: Unrecognized configuration class <class 'transformers_modules.falcon-7b.configuration_RW.RWConfig'> for this kind of AutoModel....
2
#80 opened about 1 year ago
by
Inoob
RuntimeError: CUDA error: CUBLAS_STATUS_NOT_SUPPORTED
#79 opened about 1 year ago
by
ConorVanek
ImportError: Using `load_in_8bit=True` requires Accelerate
3
#78 opened about 1 year ago
by
aimananees
Adding `safetensors` variant of this model
#77 opened about 1 year ago
by
bikalnetomi
Use input attention mask instead of casual mask in attention
#76 opened over 1 year ago
by
CyberZHG
Question answering task with falcon model fails with "TypeError: forward() got an unexpected keyword argument 'token_type_ids'"
1
#75 opened over 1 year ago
by
karolzak13
Inaccurate number of parameters
1
#74 opened over 1 year ago
by
mohamedlotfy50
Title: Best Practice for Handling Variable-Length Sequences in Training an LLM Model on a Chatbot Dataset
#73 opened over 1 year ago
by
humza-sami
Can't use the model load locally
1
#72 opened over 1 year ago
by
Alouettewind
Falcon 7b instruct using cpu for inference even on NVIDIA A40 cards with 50GB VRAM
#70 opened over 1 year ago
by
Akshadv
Why is alibi: false in the config.json?
#69 opened over 1 year ago
by
ekurtic
getting error
2
#67 opened over 1 year ago
by
Akash1267a
Revert in-library commit
#65 opened over 1 year ago
by
Rocketknight1
Senior ML Scientist
#63 opened over 1 year ago
by
FinTrU-TA
OSError: tiiuae/falcon-7b does not appear to have a file named configuration_RW.py
5
#62 opened over 1 year ago
by
chintan4560
about eos and bos token id
1
#61 opened over 1 year ago
by
louisY
configuration_RW.py missing in latest commit
9
#60 opened over 1 year ago
by
ravikiran3690
Inference time issue
#59 opened over 1 year ago
by
amnasher
Update generation_config.json
#55 opened over 1 year ago
by
psinger
Fine-tuning issues
1
#53 opened over 1 year ago
by
nebulae7
How to push or shere adapter to the hub?
7
#52 opened over 1 year ago
by
Imran1
Getting an error: RuntimeError: shape '[x, 71, 64]' is invalid for input of size 3904
1
#51 opened over 1 year ago
by
Carolinehu
Getting an error TypeError: unsupported operand type(s) for *: 'Tensor' and 'NoneType'
7
#49 opened over 1 year ago
by
NajiAboo
Fix typo in `README.md`
#48 opened over 1 year ago
by
alvarobartt