FastVLM WebGPU
Real-time video captioning powered by FastVLM
Kontext image editing on FLUX[dev]
relight images with Flux Kontext[dev]
Demo for multimodal understanding and generation
Generate a modified image based on a reference and prompt
nanonets ocr / smoldocling / monkey ocr / typhoon ocr
OmniGen2: Unified Image Understanding and Generation.
Repair and upscale images using prompts
Demo for Nanonets-OCR
Part-level image-to-3D generation.
Demo of Normalized Attention Guidance for 4 steps Wan2.1
Next-Gen High-Resolution 3D Model Generation
Image-to-3D Generation
Expressive Zeroshot TTS
Extreme Super-Resolution via Scale Autoregression
Interact with an AI agent to perform web tasks