Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Qwen
/
Qwen3-VL-8B-Instruct

Image-Text-to-Text
Transformers
Safetensors
qwen3_vl
image-to-text
conversational
Model card Files Files and versions
xet
Community
16
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

qwen3-vl-8b

#16 opened 2 days ago by
ztloong

fast_pos_embed_interpolate function not handling pos embeds properly for batch size > 1

#15 opened 12 days ago by
jvv7

Token Count Calculation in SFT Data Distribution Curation

#14 opened 20 days ago by
tcy006

Visual grounding on videos

#13 opened 20 days ago by
iariav

Batch processing

2
#12 opened 22 days ago by
cora-17

Can it process with videos?

1
#11 opened 22 days ago by
frankdarkluo

Ellen laRock

#10 opened about 1 month ago by
OutlawNation75

Different visual model outputs to Qwen2.5-VL

#9 opened about 1 month ago by
XuchenMa

Request: DOI

#8 opened about 1 month ago by
explore12333

Genuine User Reviews and Questions on Repo Qwen/Qwen3-VL-8B-Instruct

#7 opened about 2 months ago by
DeepNLP

Unable to download from huggingface_hub

#6 opened about 2 months ago by
jzyee

GGUFs are here. Tutorials to run locally.

👍 6
#5 opened about 2 months ago by
alanzhuly
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs