Forem

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
TPU vs GPU: Real-World Performance Testing for LLM Training on Google Cloud
Cover image for TPU vs GPU: Real-World Performance Testing for LLM Training on Google Cloud

TPU vs GPU: Real-World Performance Testing for LLM Training on Google Cloud

Comments
7 min read
Tame Your LLMs: A New Optimizer for Robust Deep Learning

Tame Your LLMs: A New Optimizer for Robust Deep Learning

Comments
2 min read
🚀 How I Cut Deep Learning Training Time by 45% — Without Upgrading Hardware
Cover image for 🚀 How I Cut Deep Learning Training Time by 45% — Without Upgrading Hardware

🚀 How I Cut Deep Learning Training Time by 45% — Without Upgrading Hardware

1
Comments
3 min read
How AI Sees Images: A Gentle Introduction to Convolutional Neural Networks
Cover image for How AI Sees Images: A Gentle Introduction to Convolutional Neural Networks

How AI Sees Images: A Gentle Introduction to Convolutional Neural Networks

5
Comments
3 min read
Surgical Precision with AI: A New Era in Lung Cancer Staging

Surgical Precision with AI: A New Era in Lung Cancer Staging

Comments
2 min read
Deep Maze Solver

Deep Maze Solver

Comments
3 min read
Anon: The Adaptive Optimizer Bridging SGD and Adam for Peak AI Performance

Anon: The Adaptive Optimizer Bridging SGD and Adam for Peak AI Performance

Comments
2 min read
Understanding Transformer Model Types: The Evolution from RNN to Modern AI
Cover image for Understanding Transformer Model Types: The Evolution from RNN to Modern AI

Understanding Transformer Model Types: The Evolution from RNN to Modern AI

1
Comments
6 min read
Turbocharge Your LLMs: A Breakthrough in Neural Network Optimization

Turbocharge Your LLMs: A Breakthrough in Neural Network Optimization

Comments
2 min read
Developing a Variational Autoencoder in JAX using Antigravity

Developing a Variational Autoencoder in JAX using Antigravity

Comments
6 min read
Introducing PQNT — A New Power-Law Quantization Method
Cover image for Introducing PQNT — A New Power-Law Quantization Method

Introducing PQNT — A New Power-Law Quantization Method

Comments
1 min read
Unveiling the Hidden Geometry That Supercharges Neural Nets

Unveiling the Hidden Geometry That Supercharges Neural Nets

Comments
2 min read
How Search Engines Actually Answer Your Questions
Cover image for How Search Engines Actually Answer Your Questions

How Search Engines Actually Answer Your Questions

Comments
11 min read
BATCHNORM IN LANGUAGE MODELS
Cover image for BATCHNORM IN LANGUAGE MODELS

BATCHNORM IN LANGUAGE MODELS

Comments
16 min read
Unlocking Data's Hidden Geometry: A New Era for Neural Networks by Arvind Sundararajan

Unlocking Data's Hidden Geometry: A New Era for Neural Networks by Arvind Sundararajan

Comments
2 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.