Interesting. You think it’s better to hide the training data then? Currently in NLP the only chatbots widely deployed (and replacing search) are closed source so the situation is maybe a bit different. I’ll try to take some time write a blog post on this topic one day. It’s a
Training Data Transparency in Large Language Models Debate
By
–
Leave a Reply