Forem

# cuda

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Build an AI‑Ready Linux Workstation Under $800 in 2024 – Step‑by‑Step Guide

Build an AI‑Ready Linux Workstation Under $800 in 2024 – Step‑by‑Step Guide

Comments
5 min read
Part 4: The Setback and Heart Break

Part 4: The Setback and Heart Break

Comments
6 min read
Part 5: The Comeback

Part 5: The Comeback

Comments
8 min read
NVIDIA Unleashes CUDA 13.1: CUDA Tile Takes Computing to the Next Level

NVIDIA Unleashes CUDA 13.1: CUDA Tile Takes Computing to the Next Level

Comments
2 min read
I Made A Fish Schooling Sim And Honestly It Was Fun As Hell

I Made A Fish Schooling Sim And Honestly It Was Fun As Hell

3
Comments
2 min read
Part 3: Accelerating Calculations using GPU

Part 3: Accelerating Calculations using GPU

1
Comments 2
5 min read
GPU-Powered Networking: The Future of Blazing-Fast Model Training by Arvind Sundararajan

GPU-Powered Networking: The Future of Blazing-Fast Model Training by Arvind Sundararajan

Comments
2 min read
Modeling Epidemic Spread on Large Graphs Using CUDA
Cover image for Modeling Epidemic Spread on Large Graphs Using CUDA

Modeling Epidemic Spread on Large Graphs Using CUDA

Comments 1
29 min read
Verifying CUDA Kernels in Coq with Rust MIR (Introducing cuq)
Cover image for Verifying CUDA Kernels in Coq with Rust MIR (Introducing cuq)

Verifying CUDA Kernels in Coq with Rust MIR (Introducing cuq)

3
Comments 2
1 min read
ROCm RX 6700 XT Installation Guide on Ubuntu 24.04

ROCm RX 6700 XT Installation Guide on Ubuntu 24.04

Comments
2 min read
CUDA Kernel Execution Debugging Journey

CUDA Kernel Execution Debugging Journey

1
Comments
3 min read
Evolution of GPU Programming
Cover image for Evolution of GPU Programming

Evolution of GPU Programming

Comments
26 min read
Building a CUDA-Accelerated Neural Network Library in Rust

Building a CUDA-Accelerated Neural Network Library in Rust

1
Comments
7 min read
Custom CUDA Kernels Outperforming cuBLAS: Deep Dive into GPU Memory Optimization for Small-Batch ML Workloads
Cover image for Custom CUDA Kernels Outperforming cuBLAS: Deep Dive into GPU Memory Optimization for Small-Batch ML Workloads

Custom CUDA Kernels Outperforming cuBLAS: Deep Dive into GPU Memory Optimization for Small-Batch ML Workloads

Comments
9 min read
Single bash script to install CUDA 12.8 on Ubuntu

Single bash script to install CUDA 12.8 on Ubuntu

Comments
2 min read
Just finished my GGUF-Shard

Just finished my GGUF-Shard

Comments
1 min read
Demystifying GPUs: From Core Architecture to Scalable Systems
Cover image for Demystifying GPUs: From Core Architecture to Scalable Systems

Demystifying GPUs: From Core Architecture to Scalable Systems

81
Comments 2
12 min read
"A wild goose never laid a tame egg" - I rebuild the Xerxes DDoS Tool
Cover image for "A wild goose never laid a tame egg" - I rebuild the Xerxes DDoS Tool

"A wild goose never laid a tame egg" - I rebuild the Xerxes DDoS Tool

1
Comments 2
6 min read
#Day1 of My Journey to Google

#Day1 of My Journey to Google

Comments
1 min read
WSL2 TensorFlow GPU Setup – RTX 4060 + Ubuntu 22.04 + CUDA 12.2 + cuDNN

WSL2 TensorFlow GPU Setup – RTX 4060 + Ubuntu 22.04 + CUDA 12.2 + cuDNN

Comments
2 min read
CUDA Deep Dive: Demystifying Kernels, Thread Hierarchies, and the GPU Execution Model: P-1
Cover image for CUDA Deep Dive: Demystifying Kernels, Thread Hierarchies, and the GPU Execution Model: P-1

CUDA Deep Dive: Demystifying Kernels, Thread Hierarchies, and the GPU Execution Model: P-1

Comments 2
8 min read
NVIDIA CUDA Toolkit 12.8

NVIDIA CUDA Toolkit 12.8

2
Comments
2 min read
Building a JS pytorch clone: Performance investigation

Building a JS pytorch clone: Performance investigation

Comments
9 min read
CUDA Series (2/3)
Cover image for CUDA Series (2/3)

CUDA Series (2/3)

Comments
5 min read
CUDA Series (1/3)
Cover image for CUDA Series (1/3)

CUDA Series (1/3)

5
Comments
11 min read
loading...