Thanks for sharing! We just launched GPT-4 Turbo with Vision, and we recently added word-level timestamps in Whisper for precise text-audio synchronization. We’re excited about multi-modality, and we’d love to hear if this fits your use case!
GPT-4 Turbo Vision et Whisper timestamps pour synchronisation multimodale
By
–
Leave a Reply