LLM Inference Optimization

LLM Data Mixture Breaks When Training Pools Shift: Causal Inference Offers Fix

LLM training data mixture optimization breaks when training pools shift — every prior proxy experiment becomes stale.

OpenAI and Broadcom unveil 'Jalapeño' Intelligence Processor for LLM inference

"A blank-slate design for modern LLM inference, not a general-purpose accelerator adapted from earlier AI workloads" ...

From ChatGPT to Chips: OpenAI Unveils Jalapeño to Power Faster LLMs and More Affordable AI

OpenAI and Broadcom unveiled Jalapeño, a custom AI inference chip designed for LLMs, promising higher efficiency, lower costs ...

OpenAI, Broadcom (AVGO) Unveil “Jalapeño” AI Accelerator for Enhanced LLM Inference

Broadcom Inc. (NASDAQ:AVGO) is one of the best stocks for beginners to buy now. On June 24, OpenAI and Broadcom introduced ...

VentureBeat

New LLM optimization technique slashes memory costs up to 75%

Researchers at the Tokyo-based startup Sakana AI have developed a new technique that enables language models to use memory more efficiently, helping enterprises cut the costs of building applications ...

10d

OpenAI and Broadcom Unveil LLM-Optimized Intelligence Processor

Built from the ground up for current and future LLMs across the industryDeveloped from design to production in nine months, accelerated by ...

Business Wire

MangoBoost Launches Mango LLMBoost™: AI Inference Optimization Software with Up to 12.6x Relative Performance Improvement and 92% Cost Savings

BELLEVUE, Wash.--(BUSINESS WIRE)--MangoBoost, a provider of cutting-edge system solutions designed to maximize AI data center efficiency, is announcing the launch of Mango LLMBoost™, system ...

Tweakers

Show inaccessible results