Forem

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Fundamentals of Large Language Models: Understanding LLM Architectures
Cover image for Fundamentals of Large Language Models: Understanding LLM Architectures

Fundamentals of Large Language Models: Understanding LLM Architectures

Comments
5 min read
Zero-Degradation Training: 92% ImageNet-100 Accuracy with 61% Energy Savings

Zero-Degradation Training: 92% ImageNet-100 Accuracy with 61% Energy Savings

Comments
4 min read
My Model Cheated: How Grad-CAM Exposed a 95% Accuracy Lie
Cover image for My Model Cheated: How Grad-CAM Exposed a 95% Accuracy Lie

My Model Cheated: How Grad-CAM Exposed a 95% Accuracy Lie

Comments
3 min read
🧑‍🚀 LLM Engine Telemetry: How to Profile Models and See Where Performance is Lost
Cover image for 🧑‍🚀 LLM Engine Telemetry: How to Profile Models and See Where Performance is Lost

🧑‍🚀 LLM Engine Telemetry: How to Profile Models and See Where Performance is Lost

Comments
5 min read
Daily Artificial Intelligence Digest - Oct 23, 2025

Daily Artificial Intelligence Digest - Oct 23, 2025

Comments
3 min read
Developing a Variational Autoencoder in JAX using Antigravity

Developing a Variational Autoencoder in JAX using Antigravity

Comments
6 min read
Majestic Labs vs. the Memory Wall
Cover image for Majestic Labs vs. the Memory Wall

Majestic Labs vs. the Memory Wall

6
Comments
5 min read
Tokenization in NLP: The Foundational Step That Turns Language Into Data
Cover image for Tokenization in NLP: The Foundational Step That Turns Language Into Data

Tokenization in NLP: The Foundational Step That Turns Language Into Data

Comments
3 min read
Linear Algebra for AI — Part 1
Cover image for Linear Algebra for AI — Part 1

Linear Algebra for AI — Part 1

1
Comments
2 min read
🦄 When ML Models Go Wild: Unintentional Art Created by Neural Networks
Cover image for 🦄 When ML Models Go Wild: Unintentional Art Created by Neural Networks

🦄 When ML Models Go Wild: Unintentional Art Created by Neural Networks

Comments 1
5 min read
Transformers and Attention: How LLMs Actually Process Text

Transformers and Attention: How LLMs Actually Process Text

4
Comments
19 min read
Building a 75,000-Product Image Feature Dataset for the Amazon ML Challenge 2025

Building a 75,000-Product Image Feature Dataset for the Amazon ML Challenge 2025

1
Comments
4 min read
DragonMemory: Neural Sequence Compression for Production RAG

DragonMemory: Neural Sequence Compression for Production RAG

2
Comments
8 min read
Observations from Finetuning Gemma Model on Strix Halo (Fedora 43)

Observations from Finetuning Gemma Model on Strix Halo (Fedora 43)

Comments
3 min read
How I Reached 84.35% on CIFAR-100 Using ResNet-50 (PyTorch Guide)
Cover image for How I Reached 84.35% on CIFAR-100 Using ResNet-50 (PyTorch Guide)

How I Reached 84.35% on CIFAR-100 Using ResNet-50 (PyTorch Guide)

Comments 1
2 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.