Forem

# gpu

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
I Tried Speculative Decoding on RTX 4060 8GB — Every Config Was Slower Than Baseline

I Tried Speculative Decoding on RTX 4060 8GB — Every Config Was Slower Than Baseline

1
Comments
8 min read
Local LLM Security Criticals, Rust on GPU, & Deep Dive into PTX Optimization

Local LLM Security Criticals, Rust on GPU, & Deep Dive into PTX Optimization

Comments
3 min read
Building a Cost-Effective Local AI Server in 2026: Proxmox, PCIe Passthrough, and Surviving the GPU Shortage

Building a Cost-Effective Local AI Server in 2026: Proxmox, PCIe Passthrough, and Surviving the GPU Shortage

Comments
4 min read
Introducing vMetal: Run Your GPU Data Center Like a Hyperscaler

Introducing vMetal: Run Your GPU Data Center Like a Hyperscaler

Comments
4 min read
I Rented Out My GPU for Passive Income — Here’s What Happened After My First Week

I Rented Out My GPU for Passive Income — Here’s What Happened After My First Week

Comments
4 min read
Running a 4-Agent AI Fleet on a Single NVIDIA RTX 3060 Ti

Running a 4-Agent AI Fleet on a Single NVIDIA RTX 3060 Ti

1
Comments
6 min read
Scaling multi-node GPU data pipelines using Dask on Kubernetes
Cover image for Scaling multi-node GPU data pipelines using Dask on Kubernetes

Scaling multi-node GPU data pipelines using Dask on Kubernetes

1
Comments
10 min read
Unleash Large AI Models: Extend GPU VRAM with System RAM (Nvidia Greenboost)

Unleash Large AI Models: Extend GPU VRAM with System RAM (Nvidia Greenboost)

Comments
17 min read
Running Qwen2.5-32B on RTX 4060 8GB — Beating M4 at 10.8 t/s with llama.cpp

Running Qwen2.5-32B on RTX 4060 8GB — Beating M4 at 10.8 t/s with llama.cpp

1
Comments
7 min read
NVIDIA GTC 2026: What Vera Rubin and the Groq Partnership Mean for Your Inference Stack

NVIDIA GTC 2026: What Vera Rubin and the Groq Partnership Mean for Your Inference Stack

1
Comments
3 min read
Best GPU Rental for AI Training in India

Best GPU Rental for AI Training in India

3
Comments
2 min read
EVAL #005: GPU Cloud Showdown — Lambda Labs vs CoreWeave vs RunPod vs Vast.ai vs Modal vs AWS/GCP/Azure

EVAL #005: GPU Cloud Showdown — Lambda Labs vs CoreWeave vs RunPod vs Vast.ai vs Modal vs AWS/GCP/Azure

1
Comments
8 min read
Why Most AI Infrastructure Fails in Production
Cover image for Why Most AI Infrastructure Fails in Production

Why Most AI Infrastructure Fails in Production

Comments
3 min read
Migrating from DAS to DRA in OpenShift: The Pragmatic Guide
Cover image for Migrating from DAS to DRA in OpenShift: The Pragmatic Guide

Migrating from DAS to DRA in OpenShift: The Pragmatic Guide

Comments
5 min read
AutoKernel: Autoresearch for GPU Kernels!

AutoKernel: Autoresearch for GPU Kernels!

Comments
6 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.