Instruct-SkillMix
Collection
This collections contains the dataset generated with the Instruct-SkillMix pipeline and model checkpoints finetuned on the data.
•
7 items
•
Updated
This model was SFT-ed from meta-llama/Meta-Llama-3-8B with data generated by the Seed-Dataset Agnostic version of the Instruct-SkillMix pipeline.
We used 4000 examples from Instruct-SkillMix-SDA(k=2) (data available at PrincetonPLI/Instruct-SkillMix-SDA).
We provide the set of generation configuration used for evaluation.
Paper: Instruct-SkillMix
@misc{kaur2024instructskillmixpowerfulpipelinellm,
title={Instruct-SkillMix: A Powerful Pipeline for LLM Instruction Tuning},
author={Simran Kaur and Simon Park and Anirudh Goyal and Sanjeev Arora},
year={2024},
eprint={2408.14774},
archivePrefix={arXiv},
primaryClass={cs.LG},
url={https://arxiv.org/abs/2408.14774},
}
Simran Kaur, Princeton University
Simon Park, Princeton University
{skaur, juhyunp} 'at' princeton 'dot' edu
Base model
meta-llama/Meta-Llama-3-8B