Update README.md
Browse files
README.md
CHANGED
|
@@ -24,6 +24,8 @@ This model is a fine-tuned version of [Qwen/Qwen2.5-VL-3B-Instruct](https://hugg
|
|
| 24 |
|
| 25 |
You can explore the predictions of this model using this [Space](https://huggingface.co/spaces/davanstrien/iconclass-predictions).
|
| 26 |
|
|
|
|
|
|
|
| 27 |
## Model Description
|
| 28 |
|
| 29 |
This vision-language model has been fine-tuned to generate [Iconclass](https://iconclass.org/) classification codes from images. Iconclass is a comprehensive classification system for describing the content of images, particularly used in cultural heritage and art history contexts.
|
|
@@ -99,11 +101,10 @@ print(response)
|
|
| 99 |
### Training Dataset
|
| 100 |
The model was trained on a reformatted version of the Brill Iconclass AI Test Set [biglam/brill_iconclass](https://huggingface.co/datasets/biglam/brill_iconclass).
|
| 101 |
|
|
|
|
| 102 |
|
| 103 |
|
| 104 |
-
|
| 105 |
-
Training Procedure
|
| 106 |
-
<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>
|
| 107 |
|
| 108 |
This model was trained with SFT (Supervised Fine-Tuning).
|
| 109 |
|
|
@@ -118,13 +119,13 @@ Tokenizers: 0.21.4
|
|
| 118 |
|
| 119 |
### Limitations and Biases
|
| 120 |
|
| 121 |
-
The Iconclass classification system reflects biases from its creation period (1940s Netherlands)
|
| 122 |
-
Certain categories, particularly those related to human classification, may contain outdated or problematic terminology
|
| 123 |
-
Model performance may vary on images outside the Western art tradition due to dataset composition
|
| 124 |
|
| 125 |
### Citations
|
| 126 |
|
| 127 |
-
|
| 128 |
|
| 129 |
```bibtex
|
| 130 |
@misc{vonwerra2022trl,
|
|
|
|
| 24 |
|
| 25 |
You can explore the predictions of this model using this [Space](https://huggingface.co/spaces/davanstrien/iconclass-predictions).
|
| 26 |
|
| 27 |
+
**Note:** this model is a work in progress with the goal to see how far small models can be created to excel at this kind of specific but challenging task. As a result the base model used may change over time.
|
| 28 |
+
|
| 29 |
## Model Description
|
| 30 |
|
| 31 |
This vision-language model has been fine-tuned to generate [Iconclass](https://iconclass.org/) classification codes from images. Iconclass is a comprehensive classification system for describing the content of images, particularly used in cultural heritage and art history contexts.
|
|
|
|
| 101 |
### Training Dataset
|
| 102 |
The model was trained on a reformatted version of the Brill Iconclass AI Test Set [biglam/brill_iconclass](https://huggingface.co/datasets/biglam/brill_iconclass).
|
| 103 |
|
| 104 |
+
The dataset was reformatted into a message format suitable for SFT training.
|
| 105 |
|
| 106 |
|
| 107 |
+
### Training Procedure
|
|
|
|
|
|
|
| 108 |
|
| 109 |
This model was trained with SFT (Supervised Fine-Tuning).
|
| 110 |
|
|
|
|
| 119 |
|
| 120 |
### Limitations and Biases
|
| 121 |
|
| 122 |
+
The Iconclass classification system reflects biases from its creation period (1940s Netherlands).
|
| 123 |
+
Certain categories, particularly those related to human classification, may contain outdated or problematic terminology.
|
| 124 |
+
Model performance may vary on images outside the Western art tradition due to the dataset composition.
|
| 125 |
|
| 126 |
### Citations
|
| 127 |
|
| 128 |
+
Training framework
|
| 129 |
|
| 130 |
```bibtex
|
| 131 |
@misc{vonwerra2022trl,
|