GPU Memory Optimization

News

Optimizing LLM Training Under GPU Memory Constraints (Argonne, RIT)

Level, Multi-Path Offloading for LLM Pre-training to Break the GPU Memory Wall” was published by researchers at Argonne ...

Semiconductor Engineering4y

Efficient Multi-GPU Shared Memory via Automatic Optimization of Fine-Grained Transfers

Harini Muthukrishnan (U of Michigan); David Nellans, Daniel Lustig (NVIDIA); Jeffrey A. Fessler, Thomas Wenisch (U of Michigan). Abstract—”Despite continuing research into inter-GPU communication ...

Nvidia Announces Rubin CPX GPU To Speed Long-Context AI

Long input to an LLM can benefit dramatically from the new GPU which has more fast memory and more processing power than ...

WinBuzzer3d

NVIDIA Unveils Rubin CPX, a Specialized GPU to Accelerate Long-Context AI Inference

NVIDIA has announced the Rubin CPX, a purpose-built GPU for 'disaggregated inference' that targets massive AI workloads, ...

SiliconANGLE1y

Nvidia acquires GPU cluster optimization startup Run:ai for reported $700M

Nvidia Corp. today disclosed that it has acquired Run:ai, a startup with software for optimizing the performance of graphics card clusters. The terms of the deal were not disclosed. TechCrunch, citing ...

Pliops Showcases XDP LightningAI’s Proven Impact at AI Infra Summit 2025

Demos Solutions Optimized for NVIDIA and AMD GPUs Highlight Breakthroughs in Inference EfficiencyJoins with Tensormesh to Simplify vLLM ...

Pinnacle Software's GPU Virtualization Patent: Large Model PD Separation Optimization, 50% Computing Power Savings

Recently, Guangzhou Pinnacle Software Co., Ltd. applied for a patent titled "A Method and System for Optimizing PD Separation of Large Models Based on GPU Virtualization," which is set to be published ...

TweakTown2mon

AMD demonstrates real-time GPU tree generation tech using only 55KB of memory

TL;DR: AMD's new Real-Time GPU Tree Generation technology leverages DirectX 12 GPU Work Graphs to create highly detailed, animated forests using minimal memory. This breakthrough enables efficient, ...

insideHPC10mon

Pain of Porting: The Limitations and Costs of GPU-Driven HPC-AI

[OPINION/INSIGHT ARTICLE] The surging demand for supercomputing power, driven by the insatiable appetite of AI/ML, big data analytics, and scientific research, has driven the HPC industry to push the ...

PC World18d

How to make your graphics card better by adjusting power limits

Is it really possible to make current graphics cards from AMD’s Radeon RX and Nvidia’s GeForce RTX series with just one click? Yes, it’s possible, and quite simple, as I will demonstrate in this ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results