
The next big leap in useable AI: The ability to understand images (see examples) Microsoft's Kosmos-1, a Multimodal Large Language Model (MLLM) conducts various vision tasks – and suggests MLLMs may be capable of nonverbal reasoning Link to paper: https://
arxiv.org/abs/2302.14045
