abishekcodes commited on
Commit
dcd954a
·
verified ·
1 Parent(s): 659120c

End of training

Browse files
README.md CHANGED
@@ -19,9 +19,9 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 0.0305
23
- - Accuracy: 0.9928
24
- - F1: 0.9785
25
 
26
  ## Model description
27
 
@@ -41,20 +41,27 @@ More information needed
41
 
42
  The following hyperparameters were used during training:
43
  - learning_rate: 5e-05
44
- - train_batch_size: 16
45
- - eval_batch_size: 16
46
  - seed: 42
47
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
48
  - lr_scheduler_type: linear
49
- - num_epochs: 3
50
 
51
  ### Training results
52
 
53
- | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 |
54
- |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|
55
- | 0.0279 | 1.0 | 2003 | 0.0292 | 0.9912 | 0.9716 |
56
- | 0.0135 | 2.0 | 4006 | 0.0282 | 0.9919 | 0.9750 |
57
- | 0.0057 | 3.0 | 6009 | 0.0305 | 0.9928 | 0.9785 |
 
 
 
 
 
 
 
58
 
59
 
60
  ### Framework versions
 
19
 
20
  This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 0.0326
23
+ - Accuracy: 0.9950
24
+ - F1: 0.9873
25
 
26
  ## Model description
27
 
 
41
 
42
  The following hyperparameters were used during training:
43
  - learning_rate: 5e-05
44
+ - train_batch_size: 32
45
+ - eval_batch_size: 32
46
  - seed: 42
47
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
48
  - lr_scheduler_type: linear
49
+ - num_epochs: 10
50
 
51
  ### Training results
52
 
53
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 |
54
+ |:-------------:|:-----:|:-----:|:---------------:|:--------:|:------:|
55
+ | 0.0207 | 1.0 | 1846 | 0.0205 | 0.9938 | 0.9813 |
56
+ | 0.01 | 2.0 | 3692 | 0.0222 | 0.9938 | 0.9825 |
57
+ | 0.0053 | 3.0 | 5538 | 0.0231 | 0.9946 | 0.9851 |
58
+ | 0.0041 | 4.0 | 7384 | 0.0258 | 0.9945 | 0.9852 |
59
+ | 0.0026 | 5.0 | 9230 | 0.0262 | 0.9946 | 0.9857 |
60
+ | 0.0016 | 6.0 | 11076 | 0.0305 | 0.9946 | 0.9857 |
61
+ | 0.0011 | 7.0 | 12922 | 0.0268 | 0.9949 | 0.9866 |
62
+ | 0.0009 | 8.0 | 14768 | 0.0299 | 0.9948 | 0.9867 |
63
+ | 0.0004 | 9.0 | 16614 | 0.0307 | 0.9949 | 0.9872 |
64
+ | 0.0001 | 10.0 | 18460 | 0.0326 | 0.9950 | 0.9873 |
65
 
66
 
67
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7ee90deb8bcd045e0323b974e725a52a2b6d32c17a4bea629fdc3e7c5d8b6531
3
  size 265506928
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9fa0196d9602ff5659e7c0a6ea0ec74c62d1cf12c84f2ef2270e6416b9f16182
3
  size 265506928
runs/Aug20_15-12-34_f26f33d537cf/events.out.tfevents.1755703389.f26f33d537cf.4290.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1b3adc2b5f82361b763333539ed37f4569cd11a364060f607bd884e569042422
3
+ size 457
runs/Aug20_15-36-21_f26f33d537cf/events.out.tfevents.1755704185.f26f33d537cf.4290.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a4fc5ff5e8bd1db2dafed80d46b53a6b3a9797d04d09602fd019ba1bd078ace5
3
+ size 17133
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b61207ea7920ab22ff751f10c9c59d301bc8a17fb5ecebdc905a22aa8d287344
3
  size 5777
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:28d03d71c9e486d68f232cb782058fa35dfc9b53e932200f52a9f695d8d51053
3
  size 5777