Skip to content

Missing definition - checkpointing #18

@madragonse

Description

@madragonse
    model_config = ConfigDict(extra="forbid")
    learning_rate: PositiveFloat
    weight_decay: PositiveFloat
    scheduler: object
    gradient_accumulation_steps: PositiveInt
    n_steps: PositiveInt
    gradient_clipping: PositiveFloat
    evaluation: dict
    # TODO missing checkpoint

# TODO missing checkpoint class
# EXAMPLE: 
# save:
#     interval: 1000000
#     model_checkpoint_filename: __model_optim_scheduler__.pt
#     training_state_filename: __training_state__.pt
#     path: checkpoints # Change me!

#    load:
#     type: nano
#     path: "/home/mstefaniak/llmrandom_cemetery/run_exp_2025-07-28_11-00-53/checkpoints/0/step_-1"
#     model_checkpoint_filename: "__training_state__.pt"```

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions