Skip to content

Checkpoints for best 1B models at 8K context #4

@Thiggel

Description

@Thiggel

Dear authors,

congrats for this amazing research!

I would love to experiment with nGPT, but I only have access to limited compute budget. Therefore, I am asking whether you could provide checkpoints for your baseline and nGPT models at 1B params at 8K context.

This would immensely help me in my own research.

Thanks a lot in advance.

Best regards,
Filipe Laitenberger

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions