awsuineg commited on
Commit
982233d
·
verified ·
1 Parent(s): 6fb4279

Model save

Browse files
Files changed (4) hide show
  1. README.md +3 -6
  2. all_results.json +5 -5
  3. train_results.json +5 -5
  4. trainer_state.json +0 -0
README.md CHANGED
@@ -1,20 +1,17 @@
1
  ---
2
  base_model: mistralai/Mistral-7B-v0.1
3
- datasets:
4
- - argilla/distilabel-capybara-dpo-7k-binarized
5
  library_name: transformers
6
- model_name: mistralai/Mistral-7B-v0.1
7
  tags:
8
  - generated_from_trainer
9
- - alignment-handbook
10
  - trl
11
  - orpo
12
  licence: license
13
  ---
14
 
15
- # Model Card for mistralai/Mistral-7B-v0.1
16
 
17
- This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the [['argilla/distilabel-capybara-dpo-7k-binarized']](https://huggingface.co/datasets/['argilla/distilabel-capybara-dpo-7k-binarized']) dataset.
18
  It has been trained using [TRL](https://github.com/huggingface/trl).
19
 
20
  ## Quick start
 
1
  ---
2
  base_model: mistralai/Mistral-7B-v0.1
 
 
3
  library_name: transformers
4
+ model_name: zephyr-orpo-7b-hehe
5
  tags:
6
  - generated_from_trainer
 
7
  - trl
8
  - orpo
9
  licence: license
10
  ---
11
 
12
+ # Model Card for zephyr-orpo-7b-hehe
13
 
14
+ This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1).
15
  It has been trained using [TRL](https://github.com/huggingface/trl).
16
 
17
  ## Quick start
all_results.json CHANGED
@@ -1,9 +1,9 @@
1
  {
2
  "epoch": 3.0,
3
  "total_flos": 0.0,
4
- "train_loss": 1.4294498003771459,
5
- "train_runtime": 7808.9602,
6
- "train_samples": 7210,
7
- "train_samples_per_second": 2.77,
8
- "train_steps_per_second": 0.173
9
  }
 
1
  {
2
  "epoch": 3.0,
3
  "total_flos": 0.0,
4
+ "train_loss": 1.0584710824750894,
5
+ "train_runtime": 37613.9724,
6
+ "train_samples": 61065,
7
+ "train_samples_per_second": 4.87,
8
+ "train_steps_per_second": 0.304
9
  }
train_results.json CHANGED
@@ -1,9 +1,9 @@
1
  {
2
  "epoch": 3.0,
3
  "total_flos": 0.0,
4
- "train_loss": 1.4294498003771459,
5
- "train_runtime": 7808.9602,
6
- "train_samples": 7210,
7
- "train_samples_per_second": 2.77,
8
- "train_steps_per_second": 0.173
9
  }
 
1
  {
2
  "epoch": 3.0,
3
  "total_flos": 0.0,
4
+ "train_loss": 1.0584710824750894,
5
+ "train_runtime": 37613.9724,
6
+ "train_samples": 61065,
7
+ "train_samples_per_second": 4.87,
8
+ "train_steps_per_second": 0.304
9
  }
trainer_state.json CHANGED
The diff for this file is too large to render. See raw diff