An edge-case in GPT-3 with big implications: Inference is non-deterministic (even at temperature=0) when top-2 token probabilities are <1% different. So temperature=0 output is *very close* to deterministic, but actually isn't. Worth remembering.
GPT-3 Inference Non-Determinism at Temperature Zero Explained
By
–