Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jinaai
/
jina-vlm
like
13
Follow
Jina AI
1.48k
Image-Text-to-Text
Transformers
Safetensors
30 languages
jvlm
text-generation
multimodal
vlm
vision-language
qwen3
siglip2
conversational
custom_code
arxiv:
2512.04032
License:
cc-by-nc-4.0
🇪🇺 Region: EU
Model card
Files
Files and versions
xet
Community
2
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (0)
Sort: Recently created
fix-dtype-and-inplace
#2 opened 16 days ago by
florian-hoenicke
fix-dtype
#1 opened 16 days ago by
florian-hoenicke