Run YOLO-World object detection on images
Clone a voice to say custom text
Remove background from images
Translate and transcribe speech in real-time