Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jinaai
/
jina-vlm
like
54
Follow
Jina AI
1.49k
Image-Text-to-Text
Transformers
Safetensors
30 languages
jvlm
text-generation
multimodal
vlm
vision-language
qwen3
siglip2
conversational
custom_code
arxiv:
2512.04032
License:
cc-by-nc-4.0
🇪🇺 Region: EU
Model card
Files
Files and versions
xet
Community
2
Deploy
Use this model
4d69be8
jina-vlm
/
processor_config.json
gmastrapas
Model update
c6ce1be
verified
22 days ago
raw
Copy download link
history
blame
156 Bytes
{
"always_start_with_space"
:
true
,
"auto_map"
:
{
"AutoProcessor"
:
"processing_jvlm.JinaVLMProcessor"
}
,
"processor_class"
:
"JinaVLMProcessor"
}