view article Article Universal Image Segmentation with Mask2Former and OneFormer +1 Jan 19, 2023 • 15
view article Article Accelerating Vision-Language Models: BridgeTower on Habana Gaudi2 Jun 29, 2023 • 3
view article Article Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Langage Model +9 Aug 22, 2023 • 37
view article Article Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset +1 Mar 15, 2024 • 12
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community +1 Apr 15, 2024 • 190
view article Article Docmatix - a huge dataset for Document Visual Question Answering Jul 18, 2024 • 78