Forem

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck
Cover image for Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

1
Comments
4 min read
CHOPS: CHat with custOmer Profile Systems for Customer Service with LLMs
Cover image for CHOPS: CHat with custOmer Profile Systems for Customer Service with LLMs

CHOPS: CHat with custOmer Profile Systems for Customer Service with LLMs

Comments
3 min read
Manipulating Large Language Models to Increase Product Visibility

Manipulating Large Language Models to Increase Product Visibility

Comments
3 min read
Dataset Reset Policy Optimization for RLHF
Cover image for Dataset Reset Policy Optimization for RLHF

Dataset Reset Policy Optimization for RLHF

Comments
4 min read
Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight in the Real World for Meeting Summarization?

Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight in the Real World for Meeting Summarization?

Comments
4 min read
H2O-Danube-1.8B Technical Report

H2O-Danube-1.8B Technical Report

Comments
4 min read
The Curse of Recursion: Training on Generated Data Makes Models Forget

The Curse of Recursion: Training on Generated Data Makes Models Forget

Comments
4 min read
BooookScore: A systematic exploration of book-length summarization in the era of LLMs

BooookScore: A systematic exploration of book-length summarization in the era of LLMs

Comments
4 min read
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length
Cover image for Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Comments
4 min read
TransformerFAM: Feedback attention is working memory
Cover image for TransformerFAM: Feedback attention is working memory

TransformerFAM: Feedback attention is working memory

Comments
4 min read
ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models

ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models

Comments
4 min read
Beginner's Guide to Math's for Machine Learning

Beginner's Guide to Math's for Machine Learning

2
Comments 1
2 min read
SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation
Cover image for SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation

SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation

Comments
4 min read
Tied-Lora: Enhancing parameter efficiency of LoRA with weight tying

Tied-Lora: Enhancing parameter efficiency of LoRA with weight tying

Comments
4 min read
Insights from the Leading EDJE Round Table Discussion on AI: Bridging the Gap Between Virtual and Physical Spaces

Insights from the Leading EDJE Round Table Discussion on AI: Bridging the Gap Between Virtual and Physical Spaces

2
Comments
3 min read
Assumption of Homoscedasticity : A Guide to verifying the Assumption of Constant Variance of Residuals
Cover image for Assumption of Homoscedasticity : A Guide to verifying the Assumption of Constant Variance of Residuals

Assumption of Homoscedasticity : A Guide to verifying the Assumption of Constant Variance of Residuals

1
Comments
3 min read
Meeting Minutes Made Easy: Summarize Key Points and Send Emails using Lyzr-Automata
Cover image for Meeting Minutes Made Easy: Summarize Key Points and Send Emails using Lyzr-Automata

Meeting Minutes Made Easy: Summarize Key Points and Send Emails using Lyzr-Automata

Comments
3 min read
Zero-Shot Prediction Plugin for FiftyOne
Cover image for Zero-Shot Prediction Plugin for FiftyOne

Zero-Shot Prediction Plugin for FiftyOne

Comments
8 min read
Can Facial Recognition Be Trusted for Immigration Control?
Cover image for Can Facial Recognition Be Trusted for Immigration Control?

Can Facial Recognition Be Trusted for Immigration Control?

1
Comments
1 min read
Cognita: An Open-Source Framework for Enhanced RAG Applications

Cognita: An Open-Source Framework for Enhanced RAG Applications

2
Comments
3 min read
RAG Redefined : Ready-to-Deploy RAG for Organizations at Scale.
Cover image for RAG Redefined : Ready-to-Deploy RAG for Organizations at Scale.

RAG Redefined : Ready-to-Deploy RAG for Organizations at Scale.

1
Comments 2
1 min read
Day 1 of 30 : Machine Learning

Day 1 of 30 : Machine Learning

11
Comments 6
2 min read
AI is Not Going to Steal Your Keyboard (Unless You Let Them Write the Code)
Cover image for AI is Not Going to Steal Your Keyboard (Unless You Let Them Write the Code)

AI is Not Going to Steal Your Keyboard (Unless You Let Them Write the Code)

1
Comments
2 min read
Get Hired Faster: How to use Lyzr-Automata to draft personalised cold emails
Cover image for Get Hired Faster: How to use Lyzr-Automata to draft personalised cold emails

Get Hired Faster: How to use Lyzr-Automata to draft personalised cold emails

1
Comments
4 min read
Chapter: Vulnerability of Quantum Information Systems to Collective Manipulation

Chapter: Vulnerability of Quantum Information Systems to Collective Manipulation

5
Comments
4 min read
loading...