metadata
datasets:
- dmarsili/Omni3D-Bench
language:
- en
metrics:
- accuracy
base_model:
- Qwen/Qwen3-8B
tags:
- reasoning
- visual-programming
- program-synthesis
- visual-reasoning
license: mit
Model Card for VALOR-8B
This is the RL-tuned Qwen3-8B model from the paper: No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers
For further information please refer to the project webpage, paper, and repository.
Citation
If you use VALOR in your research, please consider citing our work:
BibTeX:
@misc{marsili2025labelsproblemtrainingvisual,
title={No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers},
author={Damiano Marsili and Georgia Gkioxari},
year={2025},
eprint={2512.08889},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2512.08889},
}