It will also be interesting to see if eventually the system could detect dynamic inputs like nodding or facial expressions, not only voice. Right now people also can't express overlaps like "uh huh" without interrupting, it would be cool to be able to have more parallel flow.
Multimodal AI: Detecting Facial Expressions and Nodding in Conversations
By
–
Leave a Reply