Skip to content

Transformer Programs Training and Evaluation #4

Description

@purvapruthi
  • Implement transformer programs with absolute and relative position embeddings.
  • Train on smaller composition sizes (K=3) for uniform and diverse benchmarks with a smaller vocabulary size.
  • Evaluate the models within K (and cross-K later) for the uniform and diverse benchmark.
  • Analyze the patterns in the learned programs to understand the mechanism behind learning composition equivalence.
  • See if programs are the same across equivalent compositions.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions