Skip to content

Grad similarity test for completion dataset: Token scores (sum) vs Sequence score + Double BOS bug fix#300

Open
sweta-98 wants to merge 6 commits into
EleutherAI:mainfrom
sweta-98:main
Open

Grad similarity test for completion dataset: Token scores (sum) vs Sequence score + Double BOS bug fix#300
sweta-98 wants to merge 6 commits into
EleutherAI:mainfrom
sweta-98:main

Conversation

@sweta-98

Copy link
Copy Markdown
  • Tests if Token scores (sum) == Sequence score for 1 sample.
    Run bergson_scrs_tok_vs_seq_validate_1sample.sh

  • Double BOS fix in tokenize function + print statements for debugging

@sweta-98

Copy link
Copy Markdown
Author

Currently fails-

Sequence score: -9.000981
Token scores sum: -0.040792614
Token scores mean: -0.0027195076

---> TEST FAILED

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant