Breakthrough new algorithm reduces AI energy consumption by 95 percent

BitEnergy AI’s L-Mul algorithm slashes AI power consumption by 95%, offering a promising solution to the growing energy demands of artificial intelligence.

Researchers at BitEnergy AI have developed an algorithm that could revolutionize how AI models consume power.

Researchers at BitEnergy AI have developed an algorithm that could revolutionize how AI models consume power. (CREDIT: CC BY-SA 3.0)

The rise of artificial intelligence has sparked significant advances across industries, but it’s also creating a new challenge: energy consumption. As companies rush to adopt AI technologies, the growing energy demands of AI models have become a critical issue.

Prominent players in the AI space, such as Nvidia, Microsoft, and OpenAI, have downplayed the energy concerns, but one company believes it has found a promising solution.

Researchers at BitEnergy AI have developed an algorithm that could revolutionize how AI models consume power. The technique, known as Linear-Complexity Multiplication or L-Mul, has the potential to cut energy consumption by as much as 95 percent without significant losses in accuracy or speed. This breakthrough could reshape the AI landscape by making it far more sustainable.

At the heart of the problem lies the use of floating-point numbers in AI computations. These numbers are essential for processing large or small values with the precision required for complex tasks, such as natural language processing and machine vision. However, this precision comes with a high energy cost.

Energy cost of various arithmetic operations cited from Horowitz (2014). (CREDIT: BitEnergy AI)

To execute these calculations, AI systems consume vast amounts of electricity. For example, it’s estimated that ChatGPT alone uses the equivalent of 18,000 US homes' daily electricity (564 megawatt-hours).

As AI adoption accelerates, the power requirements continue to grow. Analysts from the Cambridge Centre for Alternative Finance project that the AI industry could consume between 85 and 134 terawatt-hours annually by 2027. That’s a massive amount of energy, comparable to the electricity usage of medium-sized countries.

The L-Mul algorithm offers a way to mitigate this energy crisis. It works by approximating complex floating-point multiplications with simpler integer additions, which consume significantly less power. In tests, the algorithm reduced energy consumption by 95 percent for tensor multiplications and by 80 percent for dot products, both of which are common tasks in AI computations.

Despite this massive energy reduction, the models using L-Mul maintained high levels of accuracy. In fact, the algorithm showed only a 0.07 percent drop in performance across various AI tasks, including natural language processing and machine vision. This minor decrease in accuracy is a small price to pay for the significant energy savings it provides.

One of the most energy-intensive components in AI models is the attention mechanism, which is key to transformer-based systems like GPT. L-Mul integrates seamlessly into this component, making it particularly effective for these models. Tests conducted on popular AI frameworks, such as Llama and Mistral, demonstrated that not only did the models benefit from reduced energy consumption, but they also showed improved accuracy in some tasks.

While L-Mul has shown promising results, its full potential is not yet realized due to hardware limitations. The algorithm requires specialized hardware that current AI processors aren’t equipped to handle. However, BitEnergy AI is already planning to develop hardware and programming APIs that can support this new technique. These advancements could lead to a future where energy-efficient AI becomes the norm.

16-bit, 8-bit floating point numbers defined in IEEE 754 and on various hardware for tensor computations, and the 16-bit integer. MSB stands for most significant bit and LSB stands for least significant bit. (CREDIT: BitEnergy AI)

Still, there may be some resistance to this shift. Nvidia, the dominant player in the AI hardware market, could potentially slow down the adoption of L-Mul. The company has built its reputation on creating GPUs that are essential for current AI applications, and it might not readily embrace new technologies that could disrupt its market share.

Despite these challenges, the development of L-Mul offers a glimpse into a more sustainable future for AI. The ability to reduce energy consumption on such a massive scale without sacrificing accuracy or speed could be a game changer for the industry. If specialized hardware is developed to support the algorithm, and companies are willing to adopt it, the AI sector could significantly reduce its carbon footprint.

For those interested in the technical details, the research team has posted a preprint version of the study on Rutgers University's arXiv library. The study offers a deeper dive into the mathematical complexities of the L-Mul algorithm and how it functions in AI models.

Co-paring the process of regular floating-point multiplication and linear-complexity multiplication (L-Mul) between two fp32 numbers. (CREDIT: BitEnergy AI)

The road to energy-efficient AI may not be smooth, but the potential benefits are too significant to ignore. With the right hardware and industry support, this technology could play a key role in making AI more sustainable and environmentally friendly.

Note: Materials provided above by The Brighter Side of News. Content may be edited for style and length.


Like these kind of feel good stories? Get The Brighter Side of News' newsletter.


Rebecca Shavit is the Good News, Psychology, Behavioral Science, and Celebrity Good News reporter for the Brighter Side of News.