You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Saw Laguna-XS.2 tested for MoE + SWA (which is closer to Gemma 4), but not Qwen3.5/Qwen3.6 where MoE is mixed with linear attention. Wondering if it would still yield 5x prefill/TTFT and 3x decode/throughput
Saw Laguna-XS.2 tested for MoE + SWA (which is closer to Gemma 4), but not Qwen3.5/Qwen3.6 where MoE is mixed with linear attention. Wondering if it would still yield 5x prefill/TTFT and 3x decode/throughput