Can you please share more about your process

by macadeliccc - opened Jan 28

Discussion

macadeliccc

Jan 28

What dataset did you use to fine tune?

SanjiWatsuki

Owner Jan 28

https://huggingface.co./datasets/argilla/distilabel-intel-orca-dpo-pairs

macadeliccc

Jan 28

Did you filter out the samples that were used in gsm8k train data? I believe this model is contaminated with gsm8k train data making it’s scores invalid

SanjiWatsuki

Owner Jan 28

Yes, the dataset was annotated with if rows were in Gsm8k and I filtered them out before training using the provided scoring filter in the repo

macadeliccc changed discussion status to closed Jan 28

Stark2008

May 28

•

edited May 31

Hey @SanjiWatsuki ,

Can you say what the base model was?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment