We are excited to release Llama-3-EvoVLM-JP-v2, the first open-source Japanese vision-language model capable of reasoning across multiple images! This model was developed quite inexpensively, in a resource-efficient manner, using evolutionary model merge. https://
sakana.ai/evovlm-jp/
Llama-3-EvoVLM-JP-v2: Open-Source Japanese Vision-Language Model Released
By
–
