IMDB Experiment Training Details

Hi, thank you for this insightful and impactful work.

Could you please provide more details about the Small Scale experiment(IMDB)? In particular, I would appreciate clarification on how the positiveness and conciseness scores reported in Figures 2–4 are computed.

Additionally, I am a bit confused about the conciseness results. In Figure 4, the average conciseness score decrease after alignment with MaxMin-RLHF, whereas in Figure 5 it appears to increase. Are these conciseness scores computed in the same way, or different evaluation metrics are used?

Thank you so much for reading my questions!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IMDB Experiment Training Details #2

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

IMDB Experiment Training Details #2

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions