Abhishek Patnia PRO

appliedml42

appliedml42
appliedml42
abhishekpatnia

AI & ML interests

SMOL LLMs, PEFT, GPU Optimization, Natural Language Processing, Trust & Safety

Recent Activity

posted an update 29 days ago

I am trying to find resources that explain how I can protect against instruction following capability degradation due to LoRA fine-tuning. For example, I fine-tuned Llama 3.2 3B Instruct on https://huggingface.co./datasets/cornell-movie-review-data/rotten_tomatoes dataset and saw significant degradation in ifeval benchmark scores. I would appreciate any pointers 🙏🏽

View all activity

Organizations

None yet

Posts 1

Post

1307

I am trying to find resources that explain how I can protect against instruction following capability degradation due to LoRA fine-tuning.

For example, I fine-tuned Llama 3.2 3B Instruct on cornell-movie-review-data/rotten_tomatoes dataset and saw significant degradation in ifeval benchmark scores.

I would appreciate any pointers 🙏🏽

models

None public yet

datasets

None public yet