Hi @chawins,
Thank you for this interesting work! I was wondering how this attack would work now that logit_bias no longer affects logprobs [1][2], as we can no longer use the trick the logprobs of the target tokens (if they don't appear in the top-5). Would love to know your thoughts on this change; thanks!
Apoorva
[1] Logit_bias does not work now- OpenAI Community
[2] x.com- Brian Huang
Hi @chawins,
Thank you for this interesting work! I was wondering how this attack would work now that
logit_biasno longer affectslogprobs[1][2], as we can no longer use the trick the logprobs of the target tokens (if they don't appear in the top-5). Would love to know your thoughts on this change; thanks!Apoorva
[1] Logit_bias does not work now- OpenAI Community
[2] x.com- Brian Huang