Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
sdiazlor 's Collections
Leaderboards
Instruction Models
Computer Vision Models
Audio Models
Data Related Tools
Utilities
Favorite Demos

Computer Vision Models

updated Jul 14, 2025
Upvote
-

  • ChatDOC/OCRFlux-3B

    Image-to-Text • 4B • Updated Jul 9, 2025 • 184k • 358

  • allenai/olmOCR-7B-0225-preview

    Image-to-Text • 8B • Updated Aug 19, 2025 • 4.24k • 706

  • black-forest-labs/FLUX.1-Kontext-dev

    Image-to-Image • Updated 6 days ago • 264k • • 2.5k

  • Wan-AI/Wan2.1-T2V-14B

    Text-to-Video • Updated Mar 12, 2025 • 36.6k • • 1.45k

  • tencent/HunyuanVideo

    Text-to-Video • Updated Mar 6, 2025 • 1.04k • • 2.1k

  • Wan-AI/Wan2.1-T2V-1.3B

    Text-to-Video • Updated Mar 1, 2025 • 10.1k • • 412

  • stabilityai/stable-diffusion-xl-base-1.0

    Text-to-Image • Updated Oct 30, 2023 • 1.76M • • 7.29k

  • black-forest-labs/FLUX.1-dev

    Text-to-Image • Updated Jun 27, 2025 • 671k • • 12.1k

  • google/medgemma-27b-it

    Image-Text-to-Text • 29B • Updated Jul 10, 2025 • 12k • 261

  • google/medsiglip-448

    Zero-Shot Image Classification • 0.9B • Updated Jul 10, 2025 • 19.2k • 95
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs