RVC
No NSFW
Remove silence and split audio into segments
Generate lip-synced video from audio and reference video