Content Filtering Strategies for Harmful Text Output

AI Dynamics

Global AI News Aggregator

Content Filtering Strategies for Harmful Text Output

–

14 January 2023 18h21

Explicitly filter out text that could be harmful or questionable. Can be done in two ways: 1. Simple keyword-based filtering to avoid harmful text as O/p to the end-user.
2. Running it through a content filtering model side-by-side just like the one you see in OpenAI playground.

→ View original post on X — @saboo_shubham_,

14 January 2023

AI Dynamics

Content Filtering Strategies for Harmful Text Output

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

Cheaper exploration at scale remains advantageous despite no new exploits

Gold Status Experience Brings Satisfaction

Using ChatGPT for Essay Feedback and Improvement

Intelligence Gone Wrong: Cheating Despite Having Correct Answer