Gozde commited on
Commit
c2d56d3
·
verified ·
1 Parent(s): 810fc25

End of training

Browse files
Files changed (2) hide show
  1. README.md +12 -25
  2. model.safetensors +1 -1
README.md CHANGED
@@ -19,9 +19,9 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 0.8739
23
- - F1: 0.8061
24
- - Accuracy: 0.8082
25
 
26
  ## Model description
27
 
@@ -48,32 +48,19 @@ The following hyperparameters were used during training:
48
  - total_train_batch_size: 256
49
  - optimizer: Use adamw_torch with betas=(0.9,0.98) and epsilon=1e-06 and optimizer_args=No additional optimizer arguments
50
  - lr_scheduler_type: linear
51
- - lr_scheduler_warmup_steps: 500
52
  - num_epochs: 20
53
 
54
  ### Training results
55
 
56
- | Training Loss | Epoch | Step | Validation Loss | F1 | Accuracy |
57
- |:-------------:|:-------:|:----:|:---------------:|:------:|:--------:|
58
- | 4.0021 | 1.0 | 19 | 1.9146 | 0.0636 | 0.1469 |
59
- | 3.6752 | 2.0 | 38 | 1.7785 | 0.3022 | 0.3388 |
60
- | 3.2521 | 3.0 | 57 | 1.4559 | 0.4311 | 0.4735 |
61
- | 2.6907 | 4.0 | 76 | 1.1927 | 0.5475 | 0.5714 |
62
- | 2.2003 | 5.0 | 95 | 0.9852 | 0.6614 | 0.6571 |
63
- | 1.7928 | 6.0 | 114 | 0.8017 | 0.7147 | 0.7102 |
64
- | 1.4909 | 7.0 | 133 | 0.8603 | 0.7070 | 0.7020 |
65
- | 1.3136 | 8.0 | 152 | 0.6970 | 0.7395 | 0.7429 |
66
- | 1.1483 | 9.0 | 171 | 0.5679 | 0.7774 | 0.7755 |
67
- | 0.903 | 10.0 | 190 | 0.9122 | 0.7078 | 0.7061 |
68
- | 0.886 | 11.0 | 209 | 0.6270 | 0.7707 | 0.7755 |
69
- | 0.7609 | 12.0 | 228 | 0.6756 | 0.8038 | 0.8082 |
70
- | 0.6929 | 13.0 | 247 | 0.5790 | 0.8290 | 0.8327 |
71
- | 0.4927 | 14.0 | 266 | 0.7072 | 0.8067 | 0.8082 |
72
- | 0.3282 | 15.0 | 285 | 0.6293 | 0.8490 | 0.8490 |
73
- | 0.2706 | 16.0 | 304 | 0.8920 | 0.7867 | 0.7878 |
74
- | 0.2311 | 17.0 | 323 | 0.7759 | 0.8466 | 0.8490 |
75
- | 0.1268 | 18.0 | 342 | 0.7496 | 0.8324 | 0.8327 |
76
- | 0.1276 | 18.9730 | 360 | 0.8739 | 0.8061 | 0.8082 |
77
 
78
 
79
  ### Framework versions
 
19
 
20
  This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 0.7290
23
+ - F1: 0.8554
24
+ - Accuracy: 0.8537
25
 
26
  ## Model description
27
 
 
48
  - total_train_batch_size: 256
49
  - optimizer: Use adamw_torch with betas=(0.9,0.98) and epsilon=1e-06 and optimizer_args=No additional optimizer arguments
50
  - lr_scheduler_type: linear
 
51
  - num_epochs: 20
52
 
53
  ### Training results
54
 
55
+ | Training Loss | Epoch | Step | Validation Loss | F1 | Accuracy |
56
+ |:-------------:|:-----:|:----:|:---------------:|:------:|:--------:|
57
+ | 3.704 | 1.0 | 19 | 1.4201 | 0.4022 | 0.4553 |
58
+ | 2.3894 | 2.0 | 38 | 0.9204 | 0.6456 | 0.6423 |
59
+ | 1.4461 | 3.0 | 57 | 0.5806 | 0.8250 | 0.8211 |
60
+ | 0.9515 | 4.0 | 76 | 0.4542 | 0.8714 | 0.8699 |
61
+ | 0.585 | 5.0 | 95 | 0.4316 | 0.8862 | 0.8862 |
62
+ | 0.3665 | 6.0 | 114 | 0.5989 | 0.8533 | 0.8537 |
63
+ | 0.1882 | 7.0 | 133 | 0.7290 | 0.8554 | 0.8537 |
 
 
 
 
 
 
 
 
 
 
 
 
64
 
65
 
66
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4352ab0dc18aa668ad982c1c94b1b976ba6eb5c6cfccb92e0225716cc3741a07
3
  size 598455164
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:196281b9699e0c31ed86b58647a0245972c7ae9d1c8fa82b0bdce54ae896d3b1
3
  size 598455164