Hi,
I really enjoy reading and doing one 2d pose estimation project using PoolFormer as backbone, also love the idea of metaformer. Have you thought about pretraining the model using MAE? Would you expect to have a performance boost as ViT does? Thanks in advance
Hi,
I really enjoy reading and doing one 2d pose estimation project using PoolFormer as backbone, also love the idea of metaformer. Have you thought about pretraining the model using MAE? Would you expect to have a performance boost as ViT does? Thanks in advance