1Addition is all you need for energy-efficient language models (opens in new tab)(arxiv.org)334InvisibleUp1y ago126