Forem

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Revisiting the Causal Mechanisms Behind Policy Gradients

Revisiting the Causal Mechanisms Behind Policy Gradients

Comments
5 min read
The Pervasive Role and Hidden Limitations of Softmax

The Pervasive Role and Hidden Limitations of Softmax

Comments
6 min read
Mamba-3 and AttnRes: AI Architecture Research Is Finally Building for Inference, Not Just Training

Mamba-3 and AttnRes: AI Architecture Research Is Finally Building for Inference, Not Just Training

Comments
7 min read
Defending Vibe Coding: Why Syntax Might Not Be the Bottleneck Anymore

Defending Vibe Coding: Why Syntax Might Not Be the Bottleneck Anymore

1
Comments
2 min read
Why I built a Rust deep learning framework (and what I got wrong twice first)

Why I built a Rust deep learning framework (and what I got wrong twice first)

Comments
5 min read
The Mathematics That Make 1.58-bit Weights Work: How BitNet b1.58 Survives Its Own Quantization

The Mathematics That Make 1.58-bit Weights Work: How BitNet b1.58 Survives Its Own Quantization

1
Comments
7 min read
I Tried to Run VGG19 on a CPU… It Failed. So I Fixed It."
Cover image for I Tried to Run VGG19 on a CPU… It Failed. So I Fixed It."

I Tried to Run VGG19 on a CPU… It Failed. So I Fixed It."

1
Comments
3 min read
Implementing ✨ Bayesian Belief Tracking in LLM Agents 🤖
Cover image for Implementing ✨ Bayesian Belief Tracking in LLM Agents 🤖

Implementing ✨ Bayesian Belief Tracking in LLM Agents 🤖

Comments
4 min read
I built a real-time Drivable Area Segmentation model for Indian roads (Here is how it runs at 55 FPS)

I built a real-time Drivable Area Segmentation model for Indian roads (Here is how it runs at 55 FPS)

7
Comments
2 min read
My Study Notes on Convolutional Neural Networks (CNN)
Cover image for My Study Notes on Convolutional Neural Networks (CNN)

My Study Notes on Convolutional Neural Networks (CNN)

1
Comments
3 min read
स्पीकर डायराइज़ेशन SYSTEM In Hindi

स्पीकर डायराइज़ेशन SYSTEM In Hindi

1
Comments
19 min read
RTX 4090 vs RTX 3090 for AI/ML: Is the Upgrade Worth It?

RTX 4090 vs RTX 3090 for AI/ML: Is the Upgrade Worth It?

Comments
3 min read
arXiv Survey Maps KV Cache Optimization Landscape: 5 Strategies for Million-Token LLM Inference

arXiv Survey Maps KV Cache Optimization Landscape: 5 Strategies for Million-Token LLM Inference

1
Comments
6 min read
Recurrent Neural Networks: Giving Networks Memory

Recurrent Neural Networks: Giving Networks Memory

Comments
4 min read
Beyond ReconVLA: Annotation-Free Visual Grounding via Language-Attention Masked Reconstruction
Cover image for Beyond ReconVLA: Annotation-Free Visual Grounding via Language-Attention Masked Reconstruction

Beyond ReconVLA: Annotation-Free Visual Grounding via Language-Attention Masked Reconstruction

1
Comments 2
9 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.