Meta did some experiments whether pirated materials helped, and the bottom line is that it helped a bit on a few of the benchmarks. I don't see numbers worth justifying commercial scale Copyright infringement though…
Meta’s Pirated Content Experiments Show Minimal Benchmark Improvements
By
–
