MIMIC-IT: Multi-Modal In-Context Instruction Tuning paper page: https://
huggingface.co/papers/2306.05
425
… High-quality instructions and responses are essential for the zero-shot performance of large language models on interactive natural language tasks. For interactive vision-language tasks
@_akhaliq
-

MIMIC-IT: Multi-Modal In-Context Instruction Tuning for Vision-Language Tasks
By
–
-

Modular Visual Question Answering Framework Using Code Generation
By
–
Modular Visual Question Answering via Code Generation paper page: https://
huggingface.co/papers/2306.05
392
… present a framework that formulates visual question answering as modular code generation. In contrast to prior work on modular approaches to VQA, our approach requires no additional -

MusicGen: Simple and Controllable Music Generation with Language Models
By
–
Simple and Controllable Music Generation paper page: https://
huggingface.co/papers/2306.05
284
… introduce MusicGen, a single Language Model (LM) that operates over several streams of compressed discrete music representation, i.e., tokens. Unlike prior work, MusicGen is comprised of a single-stage -

Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Experts
By
–
Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts paper page: https://
huggingface.co/papers/2306.04
845
… Weight-sharing supernet has become a vital component for performance estimation in the state-of-the-art (SOTA) neural architecture -

BlenderBot 3x: Improving Language Models Through Organic Interactions
By
–
Improving Open Language Models by Learning from Organic Interactions paper page: https://
huggingface.co/papers/2306.04
707
… present BlenderBot 3x, an update on the conversational model BlenderBot 3, which is now trained using organic conversation and feedback data from participating users of the -

Video-ChatGPT: Detailed Video Understanding with Vision Language Models
By
–
-ChatGPT: Towards Detailed Understanding via Large Vision and Language Models paper page: https://
huggingface.co/papers/2306.05
424
… Conversation agents fueled by Large Language Models (LLMs) are providing a new way to interact with visual data. While there have been initial attempts -

INSTRUCTEVAL: Holistic Evaluation Framework for Instruction-Tuned LLMs
By
–
INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large Language Models paper page: https://
huggingface.co/papers/2306.04
757
… Instruction-tuned large language models have revolutionized natural language processing and have shown great potential in applications such as conversational -

Instruction Tuning of Language Models on Open Datasets
By
–
How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources paper page: https://
huggingface.co/papers/2306.04
751
… In this work we explore recent advances in instruction-tuning language models on a range of open instruction-following datasets. Despite recent claims that -
Clipdrop Launches Uncrop: Ultimate Aspect Ratio Editor
By
–
Clipdrop Launches Uncrop: The Ultimate Aspect Ratio Editor
— AK (@_akhaliq) 8 juin 2023
blog: https://t.co/5FOMhvkG0m pic.twitter.com/W0Y4nqYznqClipdrop Launches Uncrop: The Ultimate Aspect Ratio Editor blog: https://
stability.ai/blog/clipdrop-
launches-uncrop-the-ultimate-aspect-ratio-editor
… -

Trending AI News Stories and Papers
By
–
Trending AI news stories + papers https://
open.substack.com/pub/akhaliq/p/
trending-ai-news-stories-papers-55e
…