Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Cactooz
/
DeepMMAudio

Video-Text-to-Text
video-to-audio
Model card Files Files and versions
xet
Community
DeepMMAudio
6.34 GB
  • 1 contributor
History: 5 commits
Cactooz's picture
Cactooz
Add base MMAudio model retrain checkpoint
f6f7b5e verified 8 days ago
  • .gitattributes
    1.52 kB
    initial commit 8 days ago
  • README.md
    213 Bytes
    Update datasets used 8 days ago
  • base-model_checkpoint_full_448b_ckpt_300000.pth
    3.15 GB
    xet
    Add base MMAudio model retrain checkpoint 8 days ago
  • depth-model_checkpoint_full_448b_ckpt_300000.pth
    3.19 GB
    xet
    Add DeepMMAudio model checkpoint 8 days ago