Forem

# gpu

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Two Ways to Move Tensors Without Stopping: Inside vLLM's Async GPU Transfer Patterns

Two Ways to Move Tensors Without Stopping: Inside vLLM's Async GPU Transfer Patterns

2
Comments 1
7 min read
Building an AI App? Here’s the Inference Stack You Actually Need
Cover image for Building an AI App? Here’s the Inference Stack You Actually Need

Building an AI App? Here’s the Inference Stack You Actually Need

1
Comments
4 min read
A Taxonomy of GPU Bugs: 19 Defect Classes for CUDA Verification

A Taxonomy of GPU Bugs: 19 Defect Classes for CUDA Verification

Comments
42 min read
Compiling the Vision Encoder: Squeezing 3% More Throughput from Qwen3-VL on Hopper GPUs

Compiling the Vision Encoder: Squeezing 3% More Throughput from Qwen3-VL on Hopper GPUs

Comments
11 min read
How PCIe, NVLink, and NUMA Topology Affect GPU Scheduling Outcomes
Cover image for How PCIe, NVLink, and NUMA Topology Affect GPU Scheduling Outcomes

How PCIe, NVLink, and NUMA Topology Affect GPU Scheduling Outcomes

Comments
9 min read
How GPU Cloud Providers Handle Long-Tail Job Backlogs
Cover image for How GPU Cloud Providers Handle Long-Tail Job Backlogs

How GPU Cloud Providers Handle Long-Tail Job Backlogs

Comments
7 min read
Building Neuro‑OS Desktop: A Lightweight Python Desktop Environment with Adaptive Optimization

Building Neuro‑OS Desktop: A Lightweight Python Desktop Environment with Adaptive Optimization

Comments
2 min read
The Myth of “Just Add a GPU” in Machine Learning

The Myth of “Just Add a GPU” in Machine Learning

2
Comments
3 min read
VHE: Why Gate-Level Simulation Breaks at Scale (and What We Tried Instead)

VHE: Why Gate-Level Simulation Breaks at Scale (and What We Tried Instead)

Comments
2 min read
Optimizing GPU Workload Placement in Kubernetes with NVLink-Aware Scheduling

Optimizing GPU Workload Placement in Kubernetes with NVLink-Aware Scheduling

Comments
4 min read
A Universal FPGA Compiler that Understands 42 Programming Languages
Cover image for A Universal FPGA Compiler that Understands 42 Programming Languages

A Universal FPGA Compiler that Understands 42 Programming Languages

Comments
8 min read
LLMs Can Now Write GPU Kernels That Beat torch.compile
Cover image for LLMs Can Now Write GPU Kernels That Beat torch.compile

LLMs Can Now Write GPU Kernels That Beat torch.compile

Comments
7 min read
Revolution in Voice AI: Natural Conversations with NVIDIA PersonaPlex! - Proje Defteri
Cover image for Revolution in Voice AI: Natural Conversations with NVIDIA PersonaPlex! - Proje Defteri

Revolution in Voice AI: Natural Conversations with NVIDIA PersonaPlex! - Proje Defteri

2
Comments
4 min read
NVIDIA GPU Monitoring: Catch Thermal Throttling Before It Costs You $50k/Year
Cover image for NVIDIA GPU Monitoring: Catch Thermal Throttling Before It Costs You $50k/Year

NVIDIA GPU Monitoring: Catch Thermal Throttling Before It Costs You $50k/Year

3
Comments
7 min read
DVTRGA2 The Official Graphics Engine of Neuro‑OS Genesis Enters a New Era

DVTRGA2 The Official Graphics Engine of Neuro‑OS Genesis Enters a New Era

4
Comments
2 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.