Forem

# gpu

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Docker Deployment for GPU-Accelerated Services
Cover image for Docker Deployment for GPU-Accelerated Services

Docker Deployment for GPU-Accelerated Services

1
Comments
2 min read
Deploybase: Track GPU Cloud and LLM Inference Pricing Across All Providers in Real Time
Cover image for Deploybase: Track GPU Cloud and LLM Inference Pricing Across All Providers in Real Time

Deploybase: Track GPU Cloud and LLM Inference Pricing Across All Providers in Real Time

5
Comments 1
1 min read
I Ran a 24-Hour AI Experiment on H100 GPUs. The Real Cost Will SHOCK You.

I Ran a 24-Hour AI Experiment on H100 GPUs. The Real Cost Will SHOCK You.

Comments
4 min read
Profiling GPU (CUDA) — What Is Actually Limiting Your Kernel?

Profiling GPU (CUDA) — What Is Actually Limiting Your Kernel?

1
Comments
4 min read
GPU Scheduling Deep Dive: How Cloud Providers Allocate GPUs for Multi-Tenant AI Workloads
Cover image for GPU Scheduling Deep Dive: How Cloud Providers Allocate GPUs for Multi-Tenant AI Workloads

GPU Scheduling Deep Dive: How Cloud Providers Allocate GPUs for Multi-Tenant AI Workloads

Comments
9 min read
The Ghost in the Batch: How vLLM Silently Switches Algorithms

The Ghost in the Batch: How vLLM Silently Switches Algorithms

Comments
5 min read
A Taxonomy of GPU Bugs: 19 Defect Classes for CUDA Verification

A Taxonomy of GPU Bugs: 19 Defect Classes for CUDA Verification

Comments
42 min read
The GPU Delusion: Why AI Is Getting Lazy
Cover image for The GPU Delusion: Why AI Is Getting Lazy

The GPU Delusion: Why AI Is Getting Lazy

7
Comments 3
6 min read
Compiling the Vision Encoder: Squeezing 3% More Throughput from Qwen3-VL on Hopper GPUs

Compiling the Vision Encoder: Squeezing 3% More Throughput from Qwen3-VL on Hopper GPUs

Comments
11 min read
Beyond nvidia-smi part — 1
Cover image for Beyond nvidia-smi part — 1

Beyond nvidia-smi part — 1

Comments
3 min read
Optimizing GPU Workload Placement in Kubernetes with NVLink-Aware Scheduling

Optimizing GPU Workload Placement in Kubernetes with NVLink-Aware Scheduling

Comments
4 min read
Attyx: tiny and fast GPU accelerated terminal emulator
Cover image for Attyx: tiny and fast GPU accelerated terminal emulator

Attyx: tiny and fast GPU accelerated terminal emulator

1
Comments
4 min read
Porting Vello's GPU Tile Rasterizer to Pure Go
Cover image for Porting Vello's GPU Tile Rasterizer to Pure Go

Porting Vello's GPU Tile Rasterizer to Pure Go

2
Comments
12 min read
LLMs Can Now Write GPU Kernels That Beat torch.compile
Cover image for LLMs Can Now Write GPU Kernels That Beat torch.compile

LLMs Can Now Write GPU Kernels That Beat torch.compile

Comments
7 min read
GPU Economics: What Inference Actually Costs in 2026

GPU Economics: What Inference Actually Costs in 2026

Comments 1
6 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.