Llama 3.2 11B & 90B include support for a range of multimodal vision tasks. These capabilities enable scenarios like captioning images for accessibility, providing natural language insights based on data visualizations and more.
Llama 3.2 Multimodal Vision Capabilities for Image and Data Analysis
By
–
Leave a Reply