AI Dynamics

Global AI News Aggregator

RLHF LLMs Challenge: Prompt Engineering Evil Characters

Yes, it’s challenging to make RLHF trained LLMs to act evil, e.g. if you want a psychopathic character to act and talk like one. What usually happen is that they talk like nice people, compliment, have empathy. But you can prompt engineer them to act closer to their intended

→ View original post on X — @marek_rosa,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *