Non AR models that expressively model the output distribution do not escape this problem. Issue is more with evaluating with point estimates. This one from Kingma is pretty accurate. https://
x.com/dpkingma/statu
/dpkingma/status/1667239246938402816
…
Non-AR Models and Output Distribution Evaluation Methods
By
–