The researchers I've talked to suggest the systems are already getting much better at that. When it makes stuff up, you can ask it "were you just bullshitting? Tell the truth" and it will often 'fess up and correct itself.
AI Systems Improving at Admitting Mistakes and Self-Correction
By
–
Leave a Reply