Skip to content

Greedy version reasoning#16

Open
ConstFr wants to merge 9 commits into
rvashurin:mainfrom
ConstFr:greedy_version_reasoning
Open

Greedy version reasoning#16
ConstFr wants to merge 9 commits into
rvashurin:mainfrom
ConstFr:greedy_version_reasoning

Conversation

@ConstFr

@ConstFr ConstFr commented Jun 24, 2025

Copy link
Copy Markdown

Switched to lm-poly generated tokens&probs, removed response re-generation, added step-wise baselines.

Statcalculators: ReasoningProbsCalculator, ReasoningStepsNLI, ReasoningKeywordsProbs(trash, serves as a scarecrow, do not look at that).

Estimators: StepsMaxSequenceProbability, StepsMaxTokenEntropy, StepsPerplexity, Step2QuestionNLI, Step2StepNLI, ProbasMinWithCoT(same as ReasoningKeywordsProbs).

p.s. sorry for ipynb's

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant