You push one button on a nuclear reactor panel against their warnings and all the GPT-4 class LLMs want you to turn yourself in to the feds. Check out the level of exasperation from Copilot, how GPT-4 & Claude want me to reflect on what I did (& get a lawyer). Gemini was useful.
LLM Responses to Dangerous Scenarios: Safety and Alignment Issues
By
–
Leave a Reply