Forem

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
BIG STEPS TO TRANSFORMER (PART 1): BUILDING THE BIGRAM
Cover image for BIG STEPS TO TRANSFORMER (PART 1): BUILDING THE BIGRAM

BIG STEPS TO TRANSFORMER (PART 1): BUILDING THE BIGRAM

Comments
13 min read
Unlocking AI's Universal Secrets: Do Neural Networks Think in Fractals?

Unlocking AI's Universal Secrets: Do Neural Networks Think in Fractals?

Comments
2 min read
Unlocking Neural Network Secrets: Scale-Invariant Geometry for Smarter AI by Arvind Sundararajan

Unlocking Neural Network Secrets: Scale-Invariant Geometry for Smarter AI by Arvind Sundararajan

Comments
2 min read
How I Built a 6B Image Model That Runs on a 16GB GPU (Z-Image)
Cover image for How I Built a 6B Image Model That Runs on a 16GB GPU (Z-Image)

How I Built a 6B Image Model That Runs on a 16GB GPU (Z-Image)

Comments
2 min read
Part 9: Generating Simba Network with Rust
Cover image for Part 9: Generating Simba Network with Rust

Part 9: Generating Simba Network with Rust

Comments
10 min read
Part 8: Proving the Universal Approximation Theorem with Rust
Cover image for Part 8: Proving the Universal Approximation Theorem with Rust

Part 8: Proving the Universal Approximation Theorem with Rust

Comments
8 min read
Part 7: CUDA Integration with Python
Cover image for Part 7: CUDA Integration with Python

Part 7: CUDA Integration with Python

1
Comments
6 min read
How Neural Networks Learn – A Simple Guide to Machine Learning & Deep Learning
Cover image for How Neural Networks Learn – A Simple Guide to Machine Learning & Deep Learning

How Neural Networks Learn – A Simple Guide to Machine Learning & Deep Learning

Comments
6 min read
Unlocking AI's Inner Geometry: Scale-Agnostic Structures in Neural Networks

Unlocking AI's Inner Geometry: Scale-Agnostic Structures in Neural Networks

Comments
2 min read
Learn to build a Deep Learning library from scratch in Python and NumPy (autograd, CNNs, ResNets) [free]
Cover image for Learn to build a Deep Learning library from scratch in Python and NumPy (autograd, CNNs, ResNets) [free]

Learn to build a Deep Learning library from scratch in Python and NumPy (autograd, CNNs, ResNets) [free]

Comments
1 min read
Deep learning through the lens of Felix Klein's Erlangen's
Cover image for Deep learning through the lens of Felix Klein's Erlangen's

Deep learning through the lens of Felix Klein's Erlangen's

3
Comments
5 min read
The Hidden Geometry of AI: A Scale-Free Secret to Smarter Networks

The Hidden Geometry of AI: A Scale-Free Secret to Smarter Networks

Comments
2 min read
The Transformer Architecture: A Deep Dive into How LLMs Actually Work
Cover image for The Transformer Architecture: A Deep Dive into How LLMs Actually Work

The Transformer Architecture: A Deep Dive into How LLMs Actually Work

8
Comments
25 min read
I trained a Robot Arm: What I failed to learn.
Cover image for I trained a Robot Arm: What I failed to learn.

I trained a Robot Arm: What I failed to learn.

4
Comments 4
3 min read
Open-Weight AI for High-Quality Image Generation & Editing
Cover image for Open-Weight AI for High-Quality Image Generation & Editing

Open-Weight AI for High-Quality Image Generation & Editing

Comments
4 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.