Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
Forem
Close
#
cuda
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
The Microsecond Lie: Why your Go timers are lying about the GPU
Eitamos Ring
Eitamos Ring
Eitamos Ring
Follow
May 23
The Microsecond Lie: Why your Go timers are lying about the GPU
#
ai
#
programming
#
go
#
cuda
Comments
Add Comment
3 min read
Profiling a CUDA Python Program with GPUFlight
Myoungho Shin
Myoungho Shin
Myoungho Shin
Follow
May 22
Profiling a CUDA Python Program with GPUFlight
#
performance
#
python
#
cuda
#
gpu
Comments
Add Comment
10 min read
TensorRT `trt.Dims` SIGSEGV inside a GStreamer Python plugin — root cause and fix
Michał Warian
Michał Warian
Michał Warian
Follow
May 20
TensorRT `trt.Dims` SIGSEGV inside a GStreamer Python plugin — root cause and fix
#
tensorrt
#
gstreamer
#
python
#
cuda
Comments
Add Comment
4 min read
Calling CUDA from Go without cgo
Eitamos Ring
Eitamos Ring
Eitamos Ring
Follow
May 16
Calling CUDA from Go without cgo
#
ai
#
softwareengineering
#
go
#
cuda
1
 reaction
Comments
Add Comment
2 min read
Why CUDA kernels silently corrupt memory and how to catch the bug
Alan West
Alan West
Alan West
Follow
May 12
Why CUDA kernels silently corrupt memory and how to catch the bug
#
cuda
#
rust
#
debugging
#
gpu
Comments
Add Comment
5 min read
CUDA Out of Memory at 60% Utilization: Tracing PyTorch GPU Memory Fragmentation
Ingero Team
Ingero Team
Ingero Team
Follow
May 4
CUDA Out of Memory at 60% Utilization: Tracing PyTorch GPU Memory Fragmentation
#
gpu
#
cuda
#
pytorch
#
debugging
Comments
Add Comment
4 min read
How I optimized a Solana vanity address grinder to 44M keys/sec on GPU
Anton
Anton
Anton
Follow
Apr 29
How I optimized a Solana vanity address grinder to 44M keys/sec on GPU
#
cuda
#
solana
#
gpu
#
cryptocurrency
Comments
Add Comment
2 min read
From Black Magic to Science: The Evolution of the CUDA Optimization Skill
aa24aa
aa24aa
aa24aa
Follow
Apr 22
From Black Magic to Science: The Evolution of the CUDA Optimization Skill
#
cuda
#
agents
#
cutlass
#
triton
Comments
Add Comment
11 min read
Learning Resources Tech
cookie
cookie
cookie
Follow
Apr 22
Learning Resources Tech
#
webdev
#
cuda
#
programming
#
beginners
Comments
Add Comment
1 min read
512MiB 512MB — the silent trtexec bug
Tushar Thokdar
Tushar Thokdar
Tushar Thokdar
Follow
Apr 12
512MiB 512MB — the silent trtexec bug
#
tensorrt
#
jetson
#
cuda
#
debugging
Comments
Add Comment
2 min read
Memory Coalescing: Same computation, 6x Performance Difference
Myoungho Shin
Myoungho Shin
Myoungho Shin
Follow
Apr 9
Memory Coalescing: Same computation, 6x Performance Difference
#
cuda
#
gpu
#
aiops
#
cpp
Comments
Add Comment
6 min read
Setting Up NVIDIA Drivers and CUDA for ML/DL on Ubuntu 22.04
Abraham Audu
Abraham Audu
Abraham Audu
Follow
Apr 6
Setting Up NVIDIA Drivers and CUDA for ML/DL on Ubuntu 22.04
#
nvidia
#
cuda
#
ubuntu
#
machinelearning
1
 reaction
Comments
Add Comment
3 min read
Achieving Neuro‑Sama‑Tier Speech‑to‑Text for Your Local AI Companion (Whisper + CUDA + LivinGrimoire)
owly
owly
owly
Follow
Apr 7
Achieving Neuro‑Sama‑Tier Speech‑to‑Text for Your Local AI Companion (Whisper + CUDA + LivinGrimoire)
#
whisper
#
designpatterns
#
python
#
cuda
Comments
Add Comment
5 min read
CUDA Graphs: The 8-Year Overnight Success and the Observability Gap
Ingero Team
Ingero Team
Ingero Team
Follow
Apr 8
CUDA Graphs: The 8-Year Overnight Success and the Observability Gap
#
cuda
#
gpu
#
ebpf
#
ai
Comments
Add Comment
9 min read
124x Slower: What PyTorch DataLoader Actually Does at the Kernel Level
Ingero Team
Ingero Team
Ingero Team
Follow
Apr 1
124x Slower: What PyTorch DataLoader Actually Does at the Kernel Level
#
pytorch
#
gpu
#
python
#
cuda
1
 reaction
Comments
Add Comment
5 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a blogging-forward open source social network where we learn from one another
Log in
Create account