haven't posted anything about (2) yet but here's the TLDR: • given LM output probabilities, we can infer what the input prompt was
• we built a model that can do this
• most APIs don't give you probabilities, but we came up with a clever algorithm to get them using logit bias
Inferring LM Prompts from Output Probabilities Without API Access
By
–