Region-based Cluster Discrimination for Visual Representation Learning Paper • 2507.20025 • Published Jul 26 • 19
LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training Paper • 2509.23661 • Published Sep 28 • 46