Skip to content

Forem

# gpu

👋 Sign in for the ability to sort posts by relevant, latest, or top.

Cover image for 20 Years of GPUs in Numbers: How FLOPS and TDP Grew, and Who Led the NVIDIA vs AMD Duel (+ open dataset of 13,500 GPUs)

May 26

20 Years of GPUs in Numbers: How FLOPS and TDP Grew, and Who Led the NVIDIA vs AMD Duel (+ open dataset of 13,500 GPUs)

#gpu #machinelearning #hardware #datascience

7 min read

soy

May 25

PatentLLM: CUDA TileLang/Triton B200 5x Speedup, RTX 5090 Power, PTX Grammar

#gpu #nvidia #hardware

3 min read

Cover image for How to Detect GPU Waste in a Kubernetes Cluster

May 25

How to Detect GPU Waste in a Kubernetes Cluster

#kubernetes #gpu #mlops #devops

5 min read

Cover image for Why Your PyTorch Training Crawls on a Beefy GPU (And How to Fix It)

Alan West

May 24

Why Your PyTorch Training Crawls on a Beefy GPU (And How to Fix It)

#pytorch #performance #machinelearning #gpu

5 min read

soy

May 24

RTX 5080 Undervolt Benchmarks, CGO-Free CUDA API Binding, & AMD GPU Compatibility Fix

#gpu #nvidia #hardware

3 min read

soy

May 23

AMD GPU/AI Launches, Legacy Driver Update & CUDA Optimization Platform

#gpu #nvidia #hardware

3 min read

May 22

Running LTX-2.3 Alongside TTS on a Single 96GB GPU with a Cold-Start Architecture

#gpu #python #machinelearning #ai

5 min read

May 22

HiDream Skeleton Mode: Prompt Beats OpenPose Ref — 8 Patterns Benchmarked

#ai #python #machinelearning #gpu

11 min read

soy

May 22

RTX 5090 Cooling, BeeLlama VRAM Opts, Resizable BAR Performance Gains

#gpu #nvidia #hardware

4 min read

May 22

Five Years Later, I Finally Have 96GB VRAM — What It Actually Unlocks for Agent Loops

#gpu #ai #machinelearning #python

8 min read

May 22

HiDream-O1-Image 3–8x Faster: Benchmarking Steps, CFG, and Resolution

#ai #machinelearning #gpu #python

5 min read

May 22

Turning a 1-Line Idea Into a 40-Second Short with a 10-Beat Local Video Pipeline

#python #ai #machinelearning #gpu

7 min read

May 22

Cutting LTX-2 22B Peak VRAM by 40% with fp8_cast — and Why optimum-quanto Was a Trap

#ai #machinelearning #gpu #python

7 min read

May 22

Profiling a CUDA Python Program with GPUFlight

#performance #python #cuda #gpu

10 min read

soy

May 20

LLM Compilers, GGUF Quantization, & Radeon RX 9060 Benchmarks

#gpu #nvidia #hardware

3 min read

👋 Sign in for the ability to sort posts by relevant, latest, or top.