I know this is kind of goofy but it does illustrate how hard guardrails are when users can ask AI anything at all Do you reassure someone who seems genuinely worried? Play along? Take it seriously? Refuse to answer for fear that this is part of a jailbreak for a hacking attempt?
AI Guardrails Dilemma: Balancing Safety and User Reassurance
By
–
Leave a Reply