New Linear-complexity Multiplication (L-Mul) algorithm claims it can reduce energy costs by 95% for element-wise tensor multiplications and 80% for dot products in large language models. It maintains ...
Arxiv – A survey of applications and end-to-end complexities. (337 pages. Oct 2023) by Researchers at AWS Center for Quantum Computing, Institute for Quantum Information, RWTH Aachen University ...