1Addition is all you need for energy-efficient language models (opens in new tab)(arxiv.org)arXiv334InvisibleUp1y ago126Save