Skip to content

logit_bias no longer affects logprobs #2

@apoorvaverma31

Description

@apoorvaverma31

Hi @chawins,

Thank you for this interesting work! I was wondering how this attack would work now that logit_bias no longer affects logprobs [1][2], as we can no longer use the trick the logprobs of the target tokens (if they don't appear in the top-5). Would love to know your thoughts on this change; thanks!

Apoorva

[1] Logit_bias does not work now- OpenAI Community
[2] x.com- Brian Huang

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions