lewtun HF staff commited on
Commit
f6aa3fc
1 Parent(s): 789f267

Add evaluation results on glue dataset

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the [glue](https://huggingface.co./datasets/glue) dataset by

@lewtun

, using the predictions stored [here](https://huggingface.co./datasets/autoevaluate/autoeval-staging-eval-project-6489fc46-7764973).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co./spaces/autoevaluate/leaderboards?dataset=glue).\
Evaluate your model on more datasets [here](https://huggingface.co./spaces/autoevaluate/model-evaluator?dataset=glue).

Files changed (1) hide show
  1. README.md +36 -0
README.md CHANGED
@@ -4,6 +4,42 @@ license: apache-2.0
4
  datasets:
5
  - sst2
6
  - glue
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  ---
8
 
9
  # DistilBERT base uncased finetuned SST-2
 
4
  datasets:
5
  - sst2
6
  - glue
7
+ model-index:
8
+ - name: distilbert-base-uncased-finetuned-sst-2-english
9
+ results:
10
+ - task:
11
+ type: text-classification
12
+ name: Text Classification
13
+ dataset:
14
+ name: glue
15
+ type: glue
16
+ config: sst2
17
+ split: validation
18
+ metrics:
19
+ - name: Accuracy
20
+ type: accuracy
21
+ value: 0.9105504587155964
22
+ verified: true
23
+ - name: Precision
24
+ type: precision
25
+ value: 0.8978260869565218
26
+ verified: true
27
+ - name: Recall
28
+ type: recall
29
+ value: 0.9301801801801802
30
+ verified: true
31
+ - name: AUC
32
+ type: auc
33
+ value: 0.9716626673402374
34
+ verified: true
35
+ - name: F1
36
+ type: f1
37
+ value: 0.9137168141592922
38
+ verified: true
39
+ - name: loss
40
+ type: loss
41
+ value: 0.39013850688934326
42
+ verified: true
43
  ---
44
 
45
  # DistilBERT base uncased finetuned SST-2