The war for large language models

Dec 19, 2021

∙ Paid

The full version of GLaM has 1.2T total parameters across 64 experts per mixture of experts (MoE) layer wi…

Keep reading with a 7-day free trial

Subscribe to Sector 6 | The Newsletter of AIM to keep reading this post and get 7 days of free access to the full post archives.