Update README.md
Browse files
README.md
CHANGED
|
@@ -302,10 +302,8 @@ print(final_intervals)
|
|
| 302 |
| hyperparameter | value |
|
| 303 |
| --------------------------- | ----- |
|
| 304 |
| learning rate | 3e-5 |
|
| 305 |
-
| batch size
|
| 306 |
-
| gradient accumulation steps | 16 |
|
| 307 |
| num train epochs | 20 |
|
| 308 |
-
| weight decay | 0.01 |
|
| 309 |
|
| 310 |
Software environment can be found in mamba/conda [environment export yml
|
| 311 |
file](transformers_env.yml). To recreate the environment with conda/mamba, run
|
|
|
|
| 302 |
| hyperparameter | value |
|
| 303 |
| --------------------------- | ----- |
|
| 304 |
| learning rate | 3e-5 |
|
| 305 |
+
| effective batch size | 16 |
|
|
|
|
| 306 |
| num train epochs | 20 |
|
|
|
|
| 307 |
|
| 308 |
Software environment can be found in mamba/conda [environment export yml
|
| 309 |
file](transformers_env.yml). To recreate the environment with conda/mamba, run
|