You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Proposed in December 2024, this is the earliest place I can found scaling the layerwise embedding for transformers. The credit should be given to Braden Koszarsky.
BlinkDL, Triang-jyed-driung, Yaziwel, jexjws, xy3xy3 and 6 more