Skip to content

Will max_length and truncation affect the similarity result? #41

@Dreamerxic

Description

@Dreamerxic

In eval_save.py:142

ret1=tokenizer([pairs[idx]], add_special_tokens=True,max_length=512,padding='max_length',truncation=True,return_tensors='pt') #tokenize them

Many functions have more than 512 tokens. Will max_length and truncation affect the similarity result? Is it ok to adjust a longer max_length?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions