AI Systems Improving at Admitting Mistakes and Self-Correction

AI Dynamics

Global AI News Aggregator

AI Systems Improving at Admitting Mistakes and Self-Correction

–

04 December 2022 0h59

The researchers I've talked to suggest the systems are already getting much better at that. When it makes stuff up, you can ask it "were you just bullshitting? Tell the truth" and it will often 'fess up and correct itself.

→ View original post on X — @erikbryn,

4 December 2022

AI ETHICS GENERATIVE AI LLMS RESEARCH SAFETY

AI Dynamics

AI Systems Improving at Admitting Mistakes and Self-Correction

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

Cheaper exploration at scale remains advantageous despite no new exploits

Gold Status Experience Brings Satisfaction

Using ChatGPT for Essay Feedback and Improvement

Intelligence Gone Wrong: Cheating Despite Having Correct Answer