It’s still pretty good! Maybe the simple solution is to first prompt it to get the camera move right, without the character – just the drone point of view. Then take that video and prompt it to add the character with text or img ref.
Two-step prompting: camera move first, then add character
By
–