Greedy version reasoning by ConstFr · Pull Request #16 · rvashurin/lm-polygraph

ConstFr · 2025-06-24T07:32:21Z

Switched to lm-poly generated tokens&probs, removed response re-generation, added step-wise baselines.

Statcalculators: ReasoningProbsCalculator, ReasoningStepsNLI, ReasoningKeywordsProbs(trash, serves as a scarecrow, do not look at that).

Estimators: StepsMaxSequenceProbability, StepsMaxTokenEntropy, StepsPerplexity, Step2QuestionNLI, Step2StepNLI, ProbasMinWithCoT(same as ReasoningKeywordsProbs).

p.s. sorry for ipynb's

ConstFr and others added 9 commits April 14, 2025 06:49

Added reasoning enhanced uncertainty estimation - ProbasMean

a6ea360

Improved documentation, typing, code style

5f7cc5d

evaluated on reasoning enhanced hotpot

0ba3310

Merge remote-tracking branch 'origin/main' into cot_eval

d6cdda2

benchmarking reasoning approach

464dc13

fixed target/output postprocessing

a26686c

fixed postprocessing v2

6e7c8d6

added reasoning steps UQ method

30f55b5

remove .ipynb from commit

0c01529

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Greedy version reasoning#16

Greedy version reasoning#16
ConstFr wants to merge 9 commits into
rvashurin:mainfrom
ConstFr:greedy_version_reasoning

ConstFr commented Jun 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ConstFr commented Jun 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant