Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jinaai
/
jina-vlm
like
68
Follow
Jina AI
1.5k
Image-Text-to-Text
Transformers
Safetensors
30 languages
jvlm
text-generation
multimodal
vlm
vision-language
qwen3
siglip2
conversational
custom_code
arxiv:
2512.04032
License:
cc-by-nc-4.0
๐ช๐บ Region: EU
Model card
Files
Files and versions
xet
Community
2
Deploy
Use this model
refs/pr/2
jina-vlm
/
assets
673 kB
3 contributors
History:
1 commit
gmastrapas
Model update
4586ed4
verified
21 days ago
jvlm_architecture.png
248 kB
xet
Model update
21 days ago
the_persistence_of_memory.jpg
Safe
425 kB
xet
Model update
21 days ago