PandaGPT is a general-purpose instruction-following model that can both see and hear. PandaGPT can perform complex tasks such as detailed image description generation, writing stories inspired by videos, and answering questions about audios. https://
bit.ly/45TnAVl
PandaGPT: Multimodal AI Model for Vision, Audio, Text Tasks
By
–
