Definitely a big limitation. Overall, the problem with synthetic data is generalization beyond the benchmarks that they target in the first place. This is where the most interesting results can be found.
Synthetic Data Generalization Limitations Beyond Benchmarks
By
–