Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
Forem
Close
#
gpu
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
20 Years of GPUs in Numbers: How FLOPS and TDP Grew, and Who Led the NVIDIA vs AMD Duel (+ open dataset of 13,500 GPUs)
Max Vyaznikov
Max Vyaznikov
Max Vyaznikov
Follow
May 26
20 Years of GPUs in Numbers: How FLOPS and TDP Grew, and Who Led the NVIDIA vs AMD Duel (+ open dataset of 13,500 GPUs)
#
gpu
#
machinelearning
#
hardware
#
datascience
Comments
Add Comment
7 min read
PatentLLM: CUDA TileLang/Triton B200 5x Speedup, RTX 5090 Power, PTX Grammar
soy
soy
soy
Follow
May 25
PatentLLM: CUDA TileLang/Triton B200 5x Speedup, RTX 5090 Power, PTX Grammar
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
How to Detect GPU Waste in a Kubernetes Cluster
Sam Hosseini
Sam Hosseini
Sam Hosseini
Follow
May 25
How to Detect GPU Waste in a Kubernetes Cluster
#
kubernetes
#
gpu
#
mlops
#
devops
Comments
Add Comment
5 min read
Why Your PyTorch Training Crawls on a Beefy GPU (And How to Fix It)
Alan West
Alan West
Alan West
Follow
May 24
Why Your PyTorch Training Crawls on a Beefy GPU (And How to Fix It)
#
pytorch
#
performance
#
machinelearning
#
gpu
Comments
Add Comment
5 min read
RTX 5080 Undervolt Benchmarks, CGO-Free CUDA API Binding, & AMD GPU Compatibility Fix
soy
soy
soy
Follow
May 24
RTX 5080 Undervolt Benchmarks, CGO-Free CUDA API Binding, & AMD GPU Compatibility Fix
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
AMD GPU/AI Launches, Legacy Driver Update & CUDA Optimization Platform
soy
soy
soy
Follow
May 23
AMD GPU/AI Launches, Legacy Driver Update & CUDA Optimization Platform
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
Running LTX-2.3 Alongside TTS on a Single 96GB GPU with a Cold-Start Architecture
shinji shimizu
shinji shimizu
shinji shimizu
Follow
May 22
Running LTX-2.3 Alongside TTS on a Single 96GB GPU with a Cold-Start Architecture
#
gpu
#
python
#
machinelearning
#
ai
Comments
Add Comment
5 min read
HiDream Skeleton Mode: Prompt Beats OpenPose Ref — 8 Patterns Benchmarked
shinji shimizu
shinji shimizu
shinji shimizu
Follow
May 22
HiDream Skeleton Mode: Prompt Beats OpenPose Ref — 8 Patterns Benchmarked
#
ai
#
python
#
machinelearning
#
gpu
Comments
Add Comment
11 min read
RTX 5090 Cooling, BeeLlama VRAM Opts, Resizable BAR Performance Gains
soy
soy
soy
Follow
May 22
RTX 5090 Cooling, BeeLlama VRAM Opts, Resizable BAR Performance Gains
#
gpu
#
nvidia
#
hardware
1
 reaction
Comments
Add Comment
4 min read
Five Years Later, I Finally Have 96GB VRAM — What It Actually Unlocks for Agent Loops
shinji shimizu
shinji shimizu
shinji shimizu
Follow
May 22
Five Years Later, I Finally Have 96GB VRAM — What It Actually Unlocks for Agent Loops
#
gpu
#
ai
#
machinelearning
#
python
Comments
Add Comment
8 min read
HiDream-O1-Image 3–8x Faster: Benchmarking Steps, CFG, and Resolution
shinji shimizu
shinji shimizu
shinji shimizu
Follow
May 22
HiDream-O1-Image 3–8x Faster: Benchmarking Steps, CFG, and Resolution
#
ai
#
machinelearning
#
gpu
#
python
Comments
Add Comment
5 min read
Turning a 1-Line Idea Into a 40-Second Short with a 10-Beat Local Video Pipeline
shinji shimizu
shinji shimizu
shinji shimizu
Follow
May 22
Turning a 1-Line Idea Into a 40-Second Short with a 10-Beat Local Video Pipeline
#
python
#
ai
#
machinelearning
#
gpu
Comments
Add Comment
7 min read
Cutting LTX-2 22B Peak VRAM by 40% with fp8_cast — and Why optimum-quanto Was a Trap
shinji shimizu
shinji shimizu
shinji shimizu
Follow
May 22
Cutting LTX-2 22B Peak VRAM by 40% with fp8_cast — and Why optimum-quanto Was a Trap
#
ai
#
machinelearning
#
gpu
#
python
Comments
Add Comment
7 min read
Profiling a CUDA Python Program with GPUFlight
Myoungho Shin
Myoungho Shin
Myoungho Shin
Follow
May 22
Profiling a CUDA Python Program with GPUFlight
#
performance
#
python
#
cuda
#
gpu
Comments
Add Comment
10 min read
LLM Compilers, GGUF Quantization, & Radeon RX 9060 Benchmarks
soy
soy
soy
Follow
May 20
LLM Compilers, GGUF Quantization, & Radeon RX 9060 Benchmarks
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a blogging-forward open source social network where we learn from one another
Log in
Create account