Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jinaai
/
jina-vlm
like
22
Follow
Jina AI
1.48k
Image-Text-to-Text
Transformers
Safetensors
30 languages
jvlm
text-generation
multimodal
vlm
vision-language
qwen3
siglip2
conversational
custom_code
arxiv:
2512.04032
License:
cc-by-nc-4.0
๐ช๐บ Region: EU
Model card
Files
Files and versions
xet
Community
2
Deploy
Use this model
refs/pr/2
jina-vlm
9.94 GB
3 contributors
History:
11 commits
florianhoenicke
fix: avoid in place operation on leaf
ae35f57
18 days ago
assets
Model update
19 days ago
.gitattributes
1.71 kB
Model update
19 days ago
README.md
28.1 kB
Model update
19 days ago
added_tokens.json
9.81 kB
Model update
21 days ago
blocks_jvlm.py
50 kB
fix: dtype
18 days ago
chat_template.jinja
866 Bytes
Model update
21 days ago
config.json
5.44 kB
Model update
21 days ago
configuration_jvlm.py
24.8 kB
Model update
19 days ago
generation_config.json
Safe
147 Bytes
Model update
21 days ago
image_processing_jvlm.py
43.6 kB
Model update
19 days ago
merges.txt
Safe
1.67 MB
Model update
21 days ago
model-00001-of-00003.safetensors
4.9 GB
xet
Model update
21 days ago
model-00002-of-00003.safetensors
3.78 GB
xet
Model update
21 days ago
model-00003-of-00003.safetensors
1.24 GB
xet
Model update
21 days ago
model.safetensors.index.json
61.5 kB
Model update
21 days ago
modeling_jvlm.py
23.8 kB
fix: avoid in place operation on leaf
18 days ago
preprocessor_config.json
1.36 kB
Model update
21 days ago
processing_jvlm.py
20.9 kB
Model update
19 days ago
processor_config.json
156 Bytes
Model update
21 days ago
special_tokens_map.json
7.8 kB
Model update
21 days ago
test_jvlm.py
18.1 kB
Model update
19 days ago
tokenizer.json
11.5 MB
xet
Model update
21 days ago
tokenizer_config.json
63.6 kB
Model update
21 days ago
vocab.json
Safe
2.78 MB
Model update
21 days ago