Forem

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
The Impact of Depth on Compositional Generalization in Transformer Language Models

The Impact of Depth on Compositional Generalization in Transformer Language Models

5
Comments
4 min read
The Expressive Power of Transformers with Chain of Thought

The Expressive Power of Transformers with Chain of Thought

5
Comments
4 min read
GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM Applications
Cover image for GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM Applications

GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM Applications

6
Comments
4 min read
Chapter: Vulnerability of Quantum Information Systems to Collective Manipulation

Chapter: Vulnerability of Quantum Information Systems to Collective Manipulation

5
Comments
4 min read
ChatGPT Can Predict the Future when it Tells Stories Set in the Future About the Past
Cover image for ChatGPT Can Predict the Future when it Tells Stories Set in the Future About the Past

ChatGPT Can Predict the Future when it Tells Stories Set in the Future About the Past

5
Comments
4 min read
JetMoE: Reaching Llama2 Performance with 0.1M Dollars
Cover image for JetMoE: Reaching Llama2 Performance with 0.1M Dollars

JetMoE: Reaching Llama2 Performance with 0.1M Dollars

4
Comments
4 min read
Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies
Cover image for Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies

Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies

5
Comments
3 min read
Vision Transformers Need Registers

Vision Transformers Need Registers

5
Comments
4 min read
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Cover image for MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

5
Comments
3 min read
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
Cover image for InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

5
Comments
3 min read
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Cover image for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

5
Comments
4 min read
Generalization in diffusion models arises from geometry-adaptive harmonic representations

Generalization in diffusion models arises from geometry-adaptive harmonic representations

5
Comments
4 min read
Rho-1: Not All Tokens Are What You Need
Cover image for Rho-1: Not All Tokens Are What You Need

Rho-1: Not All Tokens Are What You Need

5
Comments
4 min read
CodecLM: Aligning Language Models with Tailored Synthetic Data
Cover image for CodecLM: Aligning Language Models with Tailored Synthetic Data

CodecLM: Aligning Language Models with Tailored Synthetic Data

6
Comments
4 min read
Algorithmic Collective Action in Recommender Systems: Promoting Songs by Reordering Playlists
Cover image for Algorithmic Collective Action in Recommender Systems: Promoting Songs by Reordering Playlists

Algorithmic Collective Action in Recommender Systems: Promoting Songs by Reordering Playlists

6
Comments
4 min read
Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping
Cover image for Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping

Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping

5
Comments
4 min read
SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes
Cover image for SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes

SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes

5
Comments
4 min read
Announcing FiftyOne 0.23.7 and FiftyOne Teams 1.5.8
Cover image for Announcing FiftyOne 0.23.7 and FiftyOne Teams 1.5.8

Announcing FiftyOne 0.23.7 and FiftyOne Teams 1.5.8

Comments
6 min read
Getting Started with Gemma Models
Cover image for Getting Started with Gemma Models

Getting Started with Gemma Models

22
Comments
5 min read
Validating Linear Regression Assumptions: A Comprehensive Approach to Multivariate Normality
Cover image for Validating Linear Regression Assumptions: A Comprehensive Approach to Multivariate Normality

Validating Linear Regression Assumptions: A Comprehensive Approach to Multivariate Normality

1
Comments
4 min read
CVPR 2024 Survival Guide: Five Vision-Language Papers You Don’t Want to Miss
Cover image for CVPR 2024 Survival Guide: Five Vision-Language Papers You Don’t Want to Miss

CVPR 2024 Survival Guide: Five Vision-Language Papers You Don’t Want to Miss

1
Comments
9 min read
Exploring the Top AI Blogs of 2024: Illuminating Insights into Artificial Intelligence
Cover image for Exploring the Top AI Blogs of 2024: Illuminating Insights into Artificial Intelligence

Exploring the Top AI Blogs of 2024: Illuminating Insights into Artificial Intelligence

Comments 1
3 min read
Developer’s Guide : Modular, Flexible, Scalable Prod ready RAG
Cover image for Developer’s Guide : Modular, Flexible, Scalable Prod ready RAG

Developer’s Guide : Modular, Flexible, Scalable Prod ready RAG

Comments 1
2 min read
Exploring Multicollinearity: Strategies for Detecting and Managing Correlated Predictors in Regression Analysis
Cover image for Exploring Multicollinearity: Strategies for Detecting and Managing Correlated Predictors in Regression Analysis

Exploring Multicollinearity: Strategies for Detecting and Managing Correlated Predictors in Regression Analysis

7
Comments 1
3 min read
การแสดง Multiple Linear Regression เป็นกราฟ 3 มิติ โดยใช้ Python

การแสดง Multiple Linear Regression เป็นกราฟ 3 มิติ โดยใช้ Python

Comments
2 min read
loading...