Spaces:

OALL
/

Open-Arabic-LLM-Leaderboard

Running on CPU Upgrade

App Files Files Community

Looks like someone else submitted sambanovasystems/SambaLingo-Arabic-Base with wrong precision

by zolicsaki - opened May 14

Discussion

zolicsaki

May 14

•

edited May 14

Pictured below is sambanovasystems/SambaLingo-Arabic-Base with FP32 precision - I re-submitted it with the correct bf16 precision in the queue and now both are in the queue

zolicsaki changed discussion title from Looks like someone else submitted sambanovasystems/SambaLingo-Arabic-Chat with wrong precision to Looks like someone else submitted sambanovasystems/SambaLingo-Arabic-Base with wrong precision May 14

alielfilali01

Open Arabic LLM Leaderboard org May 15

Pictured below is sambanovasystems/SambaLingo-Arabic-Base with FP32 precision - I re-submitted it with the correct bf16 precision in the queue and now both are in the queue

@zolicsaki
Thank you for keeping an eye on the leaderboard 🤗
I see you are a member of SambaNovaSystems, glad to have you here. As for the SambaLingo-Arabic-Base, i believe the correct precision is float32 indeed, i simply checked the config here So i will remove the newly submission made with bf16 precision from requests. Nevertheless, i saw that you guys merged my PR (auto) for safetensors, does this PR changed the 70B version from float32 to bf16 ? Because now i see it bf16 but i remember it was f32 !? Anyway please feel free to add these models to queue with the correct precision and I'll make sure to delete the wrong one 🤗

zolicsaki

May 15

@Ali-C137 Thank you so much - all the models in the queue look correct now

zolicsaki

May 20

•

edited May 20

@Ali-C137 Hey just checked back in and it looks like the queue has completed, but the SambaLingo models evaluation results are not there, any ideas on why? Thank you so much!

Also just curios whether the chat templates are applied for chat models when running the evaluation?

alielfilali01

Open Arabic LLM Leaderboard org May 20

Dear @zolicsaki , unfortunately we have about 50 models that failled to be evaluated, we are investigating the matter and will fix it from our side if we can otherwise we will contact the authors of the models with insights to fix anything that needs to be fixed from their side

zolicsaki

May 20

•

edited May 20

@Ali-C137 Thank you! I am the author of these SambaLingo models - please let me know if you need anything

alielfilali01

Open Arabic LLM Leaderboard org May 23

@zolicsaki SambaLingo-Arabic-Chat is on the leaderboard 🔥
The base model is still under maintenance and will join the queue soon 🤗

zolicsaki

May 23

@Ali-C137 Thank you so much! Are the 70B parameter versions also going to make it on there?

alielfilali01

Open Arabic LLM Leaderboard org May 23

@zolicsaki We are trying to make every model land on the leaderboard, i will personally contact you if we had an issue with one of your models that we couldn't resolve

zolicsaki

Jun 6

Hi @Ali-C137 any updates on SambaLingo 70B?

alielfilali01

Open Arabic LLM Leaderboard org Jun 6

dear @zolicsaki

You can always check status here : https://huggingface.co./datasets/OALL/requests/blob/main/sambanovasystems/SambaLingo-Arabic-Base-70B_eval_request_False_float32_Original.json

It is running and we expect it to land by tomorrow since bigger models succeeded in the last couple days ... even tho we do not guarantee anything yet since we encountered some weird errors with other models based on llama2

alielfilali01

Open Arabic LLM Leaderboard org Jun 10

Hi dear @zolicsaki
Apparently the 70B models with the float32 precision requires way more time than allowed ! Therefore we will need you guys to to provide a float16 or bfloat16 version of the model in order to be able to evaluate it on time. We can always cast it ourselves but we are afraid that this might create a confusion for the users of your model so it would be better to provide a half-precision version.
Please let us know what works better for you and we would be happy to help.

alielfilali01 changed discussion status to closed 16 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment