my_cool_model

This model is a finetuned version of meta-llama/Llama-3.2-3B-Instruct on the tatsu-lab/alpaca dataset.

Model description

More information needed

Training and evaluation results

More information needed

Training procedure

This model was trained using the torchtune library using the following command:

ppo_full_finetune_single_device.py\
  --config ./target/7B_full_ppo_low_memory_single_device.yaml\
 device=cuda\
 metric_logger._component_=torchtune.utils.metric_logging.WandBLogger\
 metric_logger.project=torchtune_ppo\
 forward_batch_size=2\
 batch_size=64\
 ppo_batch_size=32\
 gradient_accumulation_steps=16\
 compile=True\
 optimizer._component_=bitsandbytes.optim.PagedAdamW\
 optimizer.lr=3e-4

Framework versions

torchtune
torchao 0.5.0
datasets 2.20.0
sentencepiece 0.2.0

smohammadi
/

torchtune-modelcards-test_20241003163932

my_cool_model

Model description

Training and evaluation results

Training procedure

Framework versions

Model tree for smohammadi/torchtune-modelcards-test_20241003163932

Dataset used to train smohammadi/torchtune-modelcards-test_20241003163932