Update README.md
Browse files
README.md
CHANGED
|
@@ -97,8 +97,8 @@ Pile-T5 can be loaded using the `AutoModelForSeq2SeqLM` functionality:
|
|
| 97 |
```python
|
| 98 |
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
|
| 99 |
|
| 100 |
-
tokenizer = AutoTokenizer.from_pretrained("EleutherAI/pile-t5-
|
| 101 |
-
model = AutoModelForSeq2SeqLM.from_pretrained("EleutherAI/pile-t5-
|
| 102 |
```
|
| 103 |
|
| 104 |
### Training
|
|
@@ -131,6 +131,7 @@ Intermediate checkpoints for Pile-T5 are accessible within this repository.
|
|
| 131 |
There are in total 200 checkpoints that are spaced 10,000 steps. For T5x-native
|
| 132 |
checkpoints that can be used for finetuning with the T5x library, refer to [here](https://huggingface.co/lintang/pile-t5-base-t5x/tree/main)
|
| 133 |
|
|
|
|
| 134 |
|
| 135 |
### Evaluations
|
| 136 |
|
|
|
|
| 97 |
```python
|
| 98 |
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
|
| 99 |
|
| 100 |
+
tokenizer = AutoTokenizer.from_pretrained("EleutherAI/pile-t5-large")
|
| 101 |
+
model = AutoModelForSeq2SeqLM.from_pretrained("EleutherAI/pile-t5-large")
|
| 102 |
```
|
| 103 |
|
| 104 |
### Training
|
|
|
|
| 131 |
There are in total 200 checkpoints that are spaced 10,000 steps. For T5x-native
|
| 132 |
checkpoints that can be used for finetuning with the T5x library, refer to [here](https://huggingface.co/lintang/pile-t5-base-t5x/tree/main)
|
| 133 |
|
| 134 |
+
The training loss (in tfevent format) and validation perplexity (in jsonl) can be found [here](https://huggingface.co/EleutherAI/pile-t5-large/blob/main/large.zip).
|
| 135 |
|
| 136 |
### Evaluations
|
| 137 |
|