Input images can only convey so much info, especially in the “extend” case. I’d love to be able to pass in a few seconds (or more) of previous video I’m extending so the transition is smoother, voices match, etc.
Video AI Model Needs Previous Context for Smoother Transitions
By
–
Leave a Reply