Probability-based sequence generation normalization in language models

AI Dynamics

Global AI News Aggregator

Probability-based sequence generation normalization in language models

–

26 June 2023 15h43

19/ In this case, we use probabilities again but this time we compute the probability of generating the full answer sequence, not just the letter: we sum the log of the probabilities and compute a normalization by dividing by the number of tokens to not penalize longer sequences.

→ View original post on X — @thom_wolf,

26 June 2023

AI Dynamics

Probability-based sequence generation normalization in language models

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

Cheaper exploration at scale remains advantageous despite no new exploits

Gold Status Experience Brings Satisfaction

Using ChatGPT for Essay Feedback and Improvement

Intelligence Gone Wrong: Cheating Despite Having Correct Answer