RuntimeError: FlashAttention only support fp16 and bf16 data type

#15

by Satandon1999 - opened Jun 6

Satandon1999

Jun 6

I am not able to resolve this error while trying to finetune this model.
I have loaded the model as bf16 using torch_dtype=torch.bfloat16 in the from_pretrained function. I have also added an explicit cast using model = model.to(torch.bfloat16).
The same exact code works flawlessly for the 'mini' version of the model, but not for this.
Any guidance would be greatly appreciated.
Thanks.

ericxihuilin

Jun 6

see my command in the 4k model here
https://huggingface.co./microsoft/Phi-3-small-8k-instruct/discussions/11#6661de1a62de925acf74516f

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment