Forem

# cuda

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
CUDA Deep Dive: Demystifying Kernels, Thread Hierarchies, and the GPU Execution Model: P-1
Cover image for CUDA Deep Dive: Demystifying Kernels, Thread Hierarchies, and the GPU Execution Model: P-1

CUDA Deep Dive: Demystifying Kernels, Thread Hierarchies, and the GPU Execution Model: P-1

Comments 2
8 min read
NVIDIA CUDA Toolkit 12.8

NVIDIA CUDA Toolkit 12.8

2
Comments
2 min read
Building a JS pytorch clone: Performance investigation

Building a JS pytorch clone: Performance investigation

Comments
9 min read
CUDA Series (2/3)
Cover image for CUDA Series (2/3)

CUDA Series (2/3)

Comments
5 min read
CUDA Series (1/3)
Cover image for CUDA Series (1/3)

CUDA Series (1/3)

5
Comments
11 min read
Implementing DeepSeek-R1 Tool Calls with OpenWebUI and Llama.cpp for Local AI Workflows

Implementing DeepSeek-R1 Tool Calls with OpenWebUI and Llama.cpp for Local AI Workflows

Comments
2 min read
Accelerating OpenCV with CUDA on Jetson Orin NX: A Complete Build Guide
Cover image for Accelerating OpenCV with CUDA on Jetson Orin NX: A Complete Build Guide

Accelerating OpenCV with CUDA on Jetson Orin NX: A Complete Build Guide

1
Comments
4 min read
Running Nvidia COSMOS on A100 80Gb
Cover image for Running Nvidia COSMOS on A100 80Gb

Running Nvidia COSMOS on A100 80Gb

2
Comments
2 min read
Global vs Static in C++

Global vs Static in C++

Comments
1 min read
OpenMP Data-Sharing Clauses: Differences Explained

OpenMP Data-Sharing Clauses: Differences Explained

2
Comments
2 min read
"Learn HPC with me" kickoff

"Learn HPC with me" kickoff

Comments
1 min read
Snooping on your GPU: Using eBPF to Build Zero-instrumentation CUDA Monitoring
Cover image for Snooping on your GPU: Using eBPF to Build Zero-instrumentation CUDA Monitoring

Snooping on your GPU: Using eBPF to Build Zero-instrumentation CUDA Monitoring

7
Comments 1
15 min read
Qt error when opening ncu-ui
Cover image for Qt error when opening ncu-ui

Qt error when opening ncu-ui

Comments
1 min read
Using Polars/Tensorflow with NVIDIA GPU (CUDA), on Windows using WSL2

Using Polars/Tensorflow with NVIDIA GPU (CUDA), on Windows using WSL2

3
Comments
4 min read
Lattice Generation using GPU computing in realtime

Lattice Generation using GPU computing in realtime

Comments
1 min read
Tensorman: TensorFlow with CUDA made easy
Cover image for Tensorman: TensorFlow with CUDA made easy

Tensorman: TensorFlow with CUDA made easy

1
Comments
2 min read
Simplifying PyTorch Installation: Introducing Install.PyTorch
Cover image for Simplifying PyTorch Installation: Introducing Install.PyTorch

Simplifying PyTorch Installation: Introducing Install.PyTorch

2
Comments
1 min read
Setup Nx lib and EXLA to run NX/AXON with CUDA

Setup Nx lib and EXLA to run NX/AXON with CUDA

1
Comments
1 min read
NVIDIA GPU & CUDA

NVIDIA GPU & CUDA

1
Comments
6 min read
Deep Learning with “AWS Graviton2 + NVIDIA Tensor T4G” for as low as free* with CUDA 12.2
Cover image for Deep Learning with “AWS Graviton2 + NVIDIA Tensor T4G” for as low as free* with CUDA 12.2

Deep Learning with “AWS Graviton2 + NVIDIA Tensor T4G” for as low as free* with CUDA 12.2

2
Comments
11 min read
My Experience Running HeadJobs: Generative AI at Home
Cover image for My Experience Running HeadJobs: Generative AI at Home

My Experience Running HeadJobs: Generative AI at Home

Comments
3 min read
Why Your AWS Deep Learning AMI is Holding You Back and How to Fix
Cover image for Why Your AWS Deep Learning AMI is Holding You Back and How to Fix

Why Your AWS Deep Learning AMI is Holding You Back and How to Fix

7
Comments 2
3 min read
NVIDIA's $200B Overnight Gain: Trending CUDA Repos Revealed! ⚡️
Cover image for NVIDIA's $200B Overnight Gain: Trending CUDA Repos Revealed! ⚡️

NVIDIA's $200B Overnight Gain: Trending CUDA Repos Revealed! ⚡️

30
Comments 2
5 min read
Trending CUDA repos of the week 📈
Cover image for Trending CUDA repos of the week 📈

Trending CUDA repos of the week 📈

13
Comments 1
2 min read
Dockerize CUDA-Accelerated Applications
Cover image for Dockerize CUDA-Accelerated Applications

Dockerize CUDA-Accelerated Applications

2
Comments
4 min read
loading...