Forem

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Open-Weight AI for High-Quality Image Generation & Editing
Cover image for Open-Weight AI for High-Quality Image Generation & Editing

Open-Weight AI for High-Quality Image Generation & Editing

Comments
4 min read
Surgical Precision with AI: A New Era in Lung Cancer Staging

Surgical Precision with AI: A New Era in Lung Cancer Staging

Comments
2 min read
Anon: The Adaptive Optimizer Bridging SGD and Adam for Peak AI Performance

Anon: The Adaptive Optimizer Bridging SGD and Adam for Peak AI Performance

Comments
2 min read
Turbocharge Your LLMs: A Breakthrough in Neural Network Optimization

Turbocharge Your LLMs: A Breakthrough in Neural Network Optimization

Comments
2 min read
Introducing PQNT — A New Power-Law Quantization Method
Cover image for Introducing PQNT — A New Power-Law Quantization Method

Introducing PQNT — A New Power-Law Quantization Method

Comments
1 min read
How Search Engines Actually Answer Your Questions
Cover image for How Search Engines Actually Answer Your Questions

How Search Engines Actually Answer Your Questions

Comments
11 min read
Giving AI Eyes: Multi-Modal LLMs

Giving AI Eyes: Multi-Modal LLMs

Comments
9 min read
BATCHNORM IN LANGUAGE MODELS
Cover image for BATCHNORM IN LANGUAGE MODELS

BATCHNORM IN LANGUAGE MODELS

Comments
16 min read
Tokenization in NLP: The Foundational Step That Turns Language Into Data
Cover image for Tokenization in NLP: The Foundational Step That Turns Language Into Data

Tokenization in NLP: The Foundational Step That Turns Language Into Data

Comments
3 min read
Unlocking Data's Hidden Geometry: A New Era for Neural Networks by Arvind Sundararajan

Unlocking Data's Hidden Geometry: A New Era for Neural Networks by Arvind Sundararajan

Comments
2 min read
Linear Algebra for AI
Cover image for Linear Algebra for AI

Linear Algebra for AI

1
Comments
2 min read
Cross-Modal Embeddings: Bridging AI Modalities

Cross-Modal Embeddings: Bridging AI Modalities

Comments
11 min read
Observations from Finetuning Gemma Model on Strix Halo (Fedora 43)

Observations from Finetuning Gemma Model on Strix Halo (Fedora 43)

Comments
3 min read
Stock Price Prediction by ML Models

Stock Price Prediction by ML Models

Comments
1 min read
AI vs ML vs DL vs GenAI: Demystifying the Buzzwords
Cover image for AI vs ML vs DL vs GenAI: Demystifying the Buzzwords

AI vs ML vs DL vs GenAI: Demystifying the Buzzwords

1
Comments 2
3 min read
Fixing Identity Drift in AI Image Generation with a Deterministic Constraint Layer (Minimal PoC Inside)

Fixing Identity Drift in AI Image Generation with a Deterministic Constraint Layer (Minimal PoC Inside)

Comments
2 min read
How I Reached 84.35% on CIFAR-100 Using ResNet-50 (PyTorch Guide)
Cover image for How I Reached 84.35% on CIFAR-100 Using ResNet-50 (PyTorch Guide)

How I Reached 84.35% on CIFAR-100 Using ResNet-50 (PyTorch Guide)

Comments
2 min read
Attention Mechanism in Transformers: The Core Idea Behind Modern AI

Attention Mechanism in Transformers: The Core Idea Behind Modern AI

5
Comments
2 min read
Transformers and Attention: How LLMs Actually Process Text

Transformers and Attention: How LLMs Actually Process Text

4
Comments
18 min read
Star Multi-Class Classification Neural Network With Pytorch
Cover image for Star Multi-Class Classification Neural Network With Pytorch

Star Multi-Class Classification Neural Network With Pytorch

Comments
12 min read
A cleaner, safer, plug-and-play NanoGPT

A cleaner, safer, plug-and-play NanoGPT

Comments
1 min read
LANGUAGE MODELS USING MLP (Part 1)
Cover image for LANGUAGE MODELS USING MLP (Part 1)

LANGUAGE MODELS USING MLP (Part 1)

Comments
15 min read
Inside ChatGPT: Deconstructing "Attention Is All You Need" (Part 1)

Inside ChatGPT: Deconstructing "Attention Is All You Need" (Part 1)

5
Comments
5 min read
Decoder-Only Transformers: The Architecture Behind GPT Models

Decoder-Only Transformers: The Architecture Behind GPT Models

Comments
5 min read
Getting Started with Azure: Create and Configure a Windows 10 Virtual Machine
Cover image for Getting Started with Azure: Create and Configure a Windows 10 Virtual Machine

Getting Started with Azure: Create and Configure a Windows 10 Virtual Machine

Comments
4 min read
loading...