Forem

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Serverless Deep Learning: From Notebook to Production with AWS Lambda
Cover image for Serverless Deep Learning: From Notebook to Production with AWS Lambda

Serverless Deep Learning: From Notebook to Production with AWS Lambda

Comments
1 min read
BIG STEPS TO TRANSFORMER (PART 2): BUILDING THE TRANSFORMER
Cover image for BIG STEPS TO TRANSFORMER (PART 2): BUILDING THE TRANSFORMER

BIG STEPS TO TRANSFORMER (PART 2): BUILDING THE TRANSFORMER

Comments
12 min read
I Took a 255MB BERT Model and SHRANK it by 74.8% using ONNX (It Now Runs OFFLINE on ANY Phone!)
Cover image for I Took a 255MB BERT Model and SHRANK it by 74.8% using ONNX (It Now Runs OFFLINE on ANY Phone!)

I Took a 255MB BERT Model and SHRANK it by 74.8% using ONNX (It Now Runs OFFLINE on ANY Phone!)

1
Comments 1
1 min read
Apple Unveils STARFlow-V: First End-to-End Normalizing Flow Model forHigh-Quality Video Generation
Cover image for Apple Unveils STARFlow-V: First End-to-End Normalizing Flow Model forHigh-Quality Video Generation

Apple Unveils STARFlow-V: First End-to-End Normalizing Flow Model forHigh-Quality Video Generation

Comments
2 min read
Part 6: Building the First Neural Network

Part 6: Building the First Neural Network

Comments
4 min read
Demystifying loss.backward(): How PyTorch Autograd Actually Works

Demystifying loss.backward(): How PyTorch Autograd Actually Works

Comments 1
10 min read
Shrink Your LLMs: FAIRY2I Makes Tiny AI a Reality

Shrink Your LLMs: FAIRY2I Makes Tiny AI a Reality

Comments
2 min read
I Skipped My Birthday to Give Go Its First Real ML Framework
Cover image for I Skipped My Birthday to Give Go Its First Real ML Framework

I Skipped My Birthday to Give Go Its First Real ML Framework

Comments
4 min read
Fundamentals of Large Language Models: Understanding LLM Architectures
Cover image for Fundamentals of Large Language Models: Understanding LLM Architectures

Fundamentals of Large Language Models: Understanding LLM Architectures

1
Comments
5 min read
Introduction to Computer Vision: Teaching Machines to See
Cover image for Introduction to Computer Vision: Teaching Machines to See

Introduction to Computer Vision: Teaching Machines to See

Comments
3 min read
Neural Network — A Simple, Beginner-Friendly Overview
Cover image for Neural Network — A Simple, Beginner-Friendly Overview

Neural Network — A Simple, Beginner-Friendly Overview

Comments
3 min read
Introduction to PyTorch: The Deep Learning Framework You Need to Know
Cover image for Introduction to PyTorch: The Deep Learning Framework You Need to Know

Introduction to PyTorch: The Deep Learning Framework You Need to Know

Comments
3 min read
Introduction to Deep Learning: A Complete Beginner’s Guide
Cover image for Introduction to Deep Learning: A Complete Beginner’s Guide

Introduction to Deep Learning: A Complete Beginner’s Guide

Comments
3 min read
Why GPUs Ate the AI World

Why GPUs Ate the AI World

Comments
8 min read
BIG STEPS TO TRANSFORMER (PART 1): BUILDING THE BIGRAM
Cover image for BIG STEPS TO TRANSFORMER (PART 1): BUILDING THE BIGRAM

BIG STEPS TO TRANSFORMER (PART 1): BUILDING THE BIGRAM

Comments
13 min read
Unlocking AI's Universal Secrets: Do Neural Networks Think in Fractals?

Unlocking AI's Universal Secrets: Do Neural Networks Think in Fractals?

Comments
2 min read
Unlocking Neural Network Secrets: Scale-Invariant Geometry for Smarter AI by Arvind Sundararajan

Unlocking Neural Network Secrets: Scale-Invariant Geometry for Smarter AI by Arvind Sundararajan

Comments
2 min read
How I Built a 6B Image Model That Runs on a 16GB GPU (Z-Image)
Cover image for How I Built a 6B Image Model That Runs on a 16GB GPU (Z-Image)

How I Built a 6B Image Model That Runs on a 16GB GPU (Z-Image)

Comments
2 min read
🧑‍🚀 LLM Engine Telemetry: How to Profile Models and See Where Performance is Lost
Cover image for 🧑‍🚀 LLM Engine Telemetry: How to Profile Models and See Where Performance is Lost

🧑‍🚀 LLM Engine Telemetry: How to Profile Models and See Where Performance is Lost

Comments
5 min read
How Neural Networks Learn – A Simple Guide to Machine Learning & Deep Learning
Cover image for How Neural Networks Learn – A Simple Guide to Machine Learning & Deep Learning

How Neural Networks Learn – A Simple Guide to Machine Learning & Deep Learning

Comments
6 min read
Unlocking AI's Inner Geometry: Scale-Agnostic Structures in Neural Networks

Unlocking AI's Inner Geometry: Scale-Agnostic Structures in Neural Networks

Comments
2 min read
The Hidden Geometry of AI: A Scale-Free Secret to Smarter Networks

The Hidden Geometry of AI: A Scale-Free Secret to Smarter Networks

Comments
2 min read
My Model Cheated: How Grad-CAM Exposed a 95% Accuracy Lie
Cover image for My Model Cheated: How Grad-CAM Exposed a 95% Accuracy Lie

My Model Cheated: How Grad-CAM Exposed a 95% Accuracy Lie

Comments
3 min read
I trained a Robot Arm: What I failed to learn.
Cover image for I trained a Robot Arm: What I failed to learn.

I trained a Robot Arm: What I failed to learn.

4
Comments 4
3 min read
Open-Weight AI for High-Quality Image Generation & Editing
Cover image for Open-Weight AI for High-Quality Image Generation & Editing

Open-Weight AI for High-Quality Image Generation & Editing

Comments
4 min read
loading...