Forem

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

๐Ÿ‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
AI News This Week: Breaking Down the Latest Developments in Multimodal Large Language Models

AI News This Week: Breaking Down the Latest Developments in Multimodal Large Language Models

1
Comments 1
5 min read
Part 6 โ€” From Zero to ChatGPT: The 4 Learning Types That Built Modern AI

Part 6 โ€” From Zero to ChatGPT: The 4 Learning Types That Built Modern AI

3
Comments
11 min read
Introduction to ML Compilers + Roadmap (MLIR, TVM, GPU Kernels)
Cover image for Introduction to ML Compilers + Roadmap (MLIR, TVM, GPU Kernels)

Introduction to ML Compilers + Roadmap (MLIR, TVM, GPU Kernels)

Comments
1 min read
Benchmark Shadows Study: Data Alignment Limits LLM Generalization

Benchmark Shadows Study: Data Alignment Limits LLM Generalization

Comments
6 min read
This Week in AI: Top News and Trends to Watch (April 11, 2026)

This Week in AI: Top News and Trends to Watch (April 11, 2026)

Comments
4 min read
AI News Update: April 10, 2026 - A Week of Breakthroughs and Concerns

AI News Update: April 10, 2026 - A Week of Breakthroughs and Concerns

Comments
5 min read
ฤฦฐa World Model Tแปซ Bแบฃn Demo ฤแบนp Mแบฏt Thร nh Trแบฃi Nghiแป‡m Tฦฐฦกng Tรกc Thแปฑc Sแปฑ Trรชn GPU Phแป• Thรดng

ฤฦฐa World Model Tแปซ Bแบฃn Demo ฤแบนp Mแบฏt Thร nh Trแบฃi Nghiแป‡m Tฦฐฦกng Tรกc Thแปฑc Sแปฑ Trรชn GPU Phแป• Thรดng

Comments
15 min read
Why Reasoning Models Changed Everything
Cover image for Why Reasoning Models Changed Everything

Why Reasoning Models Changed Everything

Comments
8 min read
How to Train a 100B+ Parameter Model When You Can't Afford a GPU Cluster
Cover image for How to Train a 100B+ Parameter Model When You Can't Afford a GPU Cluster

How to Train a 100B+ Parameter Model When You Can't Afford a GPU Cluster

Comments 1
5 min read
๐—ช๐—ต๐—ฎ๐˜ ๐—œ ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป๐—ฒ๐—ฑ ๐—ณ๐—ฟ๐—ผ๐—บ ๐—–๐—ต๐—ฎ๐—ฝ๐˜๐—ฒ๐—ฟ ๐Ÿฎ ๐—ผ๐—ณ ๐—”๐—œ ๐—˜๐—ป๐—ด๐—ถ๐—ป๐—ฒ๐—ฒ๐—ฟ๐—ถ๐—ป๐—ด: ๐—ช๐—ต๐˜† ๐—ฆ๐—ฎ๐—บ๐—ฝ๐—น๐—ถ๐—ป๐—ด ๐—–๐—ต๐—ฎ๐—ป๐—ด๐—ฒ๐˜€ ๐—˜๐˜ƒ๐—ฒ๐—ฟ๐˜†๐˜๐—ต๐—ถ๐—ป๐—ด

๐—ช๐—ต๐—ฎ๐˜ ๐—œ ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป๐—ฒ๐—ฑ ๐—ณ๐—ฟ๐—ผ๐—บ ๐—–๐—ต๐—ฎ๐—ฝ๐˜๐—ฒ๐—ฟ ๐Ÿฎ ๐—ผ๐—ณ ๐—”๐—œ ๐—˜๐—ป๐—ด๐—ถ๐—ป๐—ฒ๐—ฒ๐—ฟ๐—ถ๐—ป๐—ด: ๐—ช๐—ต๐˜† ๐—ฆ๐—ฎ๐—บ๐—ฝ๐—น๐—ถ๐—ป๐—ด ๐—–๐—ต๐—ฎ๐—ป๐—ด๐—ฒ๐˜€ ๐—˜๐˜ƒ๐—ฒ๐—ฟ๐˜†๐˜๐—ต๐—ถ๐—ป๐—ด

1
Comments
4 min read
Blog 2: Momentum-Based Optimizers

Blog 2: Momentum-Based Optimizers

Comments
6 min read
Blog 1: Foundations of Gradient Descent

Blog 1: Foundations of Gradient Descent

Comments
5 min read
Empirical Research in Machine Learning Ended Mathโ€™s Monopoly

Empirical Research in Machine Learning Ended Mathโ€™s Monopoly

Comments
9 min read
I Built the World's First AI Knowledge Arena โ€” Battle Other Devs on ML & Deep Learning

I Built the World's First AI Knowledge Arena โ€” Battle Other Devs on ML & Deep Learning

Comments
3 min read
Policy Gradients: REINFORCE from Scratch with NumPy
Cover image for Policy Gradients: REINFORCE from Scratch with NumPy

Policy Gradients: REINFORCE from Scratch with NumPy

Comments
16 min read
๐Ÿ‘‹ Sign in for the ability to sort posts by relevant, latest, or top.