Of course. But lots of people have observed the same thing in other contexts, and it's what we'd expect based on first principles. To me, the question isn't: when can't we use pure bf16, but when can we!
When Can We Use Pure BF16 in AI Models?
By
–
By
–
Of course. But lots of people have observed the same thing in other contexts, and it's what we'd expect based on first principles. To me, the question isn't: when can't we use pure bf16, but when can we!