AI Dynamics

Global AI News Aggregator

About

Instruction Tuning as an Emergent Ability in Large Language Models

One of the most interesting emergent abilities IMO is instruction tuning. Anthropic and Flan-LaMDA suggest that zero-shot performance can improve from RLHF and NLP benchmark instruction tuning (although text-davinci usually loses to code-davinci). https://
arxiv.org/abs/2204.05862

→ View original post on X — @_jasonwei