Okay that was way easier than expected Just pumped the MP4 video into the speech-to-text model and it understood it right away So now http://
TherapistAI.com supports video too, but only the audio part, will add frame grabs next with vision detection! Maybe 1 frame grab
TherapistAI Adds Video and Vision Detection Capabilities
By
–
