Spaces:
Running
on
Zero
Matched the action but not the timing
This video is generated with AI, I was hoping that MMAudio would create the audio for it. And it did create audio of someone running, but the timing is wrong and its only one person. Am I doing something wrong?
Thanks for trying it out! Our models do have failure modes and failure cases. In this instance, it seems like the slow-motion video is confusing the model. The model has not seen enough slow-motion footage during training. On a separate note, it also does not do footsteps very well, again, probably due to training data limitations.
I tried another "running" example below. It still isn't great, but without the slow-motion and timing seems to be more accurate.
Thanks. Maybe I can generate the audio at full speed and then slow it down (slow the timing without the pitch shift - which is possible).
I get that too all the time, have to shift audio in video editor