I *guess* that the pure, unmodified Stack More Layers architecture used in *current* LLMs is stupid enough to be safe. Later advancements might break that capability barrier; the results would also be called 'LLMs'. The *task* of text prediction is general, and not safe.
Current LLM Architecture Safety and Future Advancement Risks
By
–
Leave a Reply