Forem

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Computer Vision Meetup: Lessons Learned fine-tuning Llama2 for Autonomous Agents
Cover image for Computer Vision Meetup: Lessons Learned fine-tuning Llama2 for Autonomous Agents
31:03

Computer Vision Meetup: Lessons Learned fine-tuning Llama2 for Autonomous Agents

2
Comments
1 min read
S-LoRA: Serving Thousands of Concurrent LoRA Adapters

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Comments
4 min read
WaveCoder: Widespread And Versatile Enhancement For Code Large Language Models By Instruction Tuning
Cover image for WaveCoder: Widespread And Versatile Enhancement For Code Large Language Models By Instruction Tuning

WaveCoder: Widespread And Versatile Enhancement For Code Large Language Models By Instruction Tuning

1
Comments
3 min read
China's new Sora rival is here
Cover image for China's new Sora rival is here

China's new Sora rival is here

1
Comments
1 min read
From sticks and levers to worlds and chasms

From sticks and levers to worlds and chasms

1
Comments
2 min read
Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Comments
4 min read
Gated Linear Attention Transformers with Hardware-Efficient Training
Cover image for Gated Linear Attention Transformers with Hardware-Efficient Training

Gated Linear Attention Transformers with Hardware-Efficient Training

Comments
4 min read
InfoLossQA: Characterizing and Recovering Information Loss in Text Simplification
Cover image for InfoLossQA: Characterizing and Recovering Information Loss in Text Simplification

InfoLossQA: Characterizing and Recovering Information Loss in Text Simplification

Comments
3 min read
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression
Cover image for Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Comments
3 min read
Computer Vision Meetup: Combining Hugging Face Transformer Models and Image Data with FiftyOne
Cover image for Computer Vision Meetup: Combining Hugging Face Transformer Models and Image Data with FiftyOne
09:25

Computer Vision Meetup: Combining Hugging Face Transformer Models and Image Data with FiftyOne

Comments
1 min read
Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning

Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning

Comments
5 min read
To Believe or Not to Believe Your LLM
Cover image for To Believe or Not to Believe Your LLM

To Believe or Not to Believe Your LLM

2
Comments
4 min read
ReGAL: Refactoring Programs to Discover Generalizable Abstractions
Cover image for ReGAL: Refactoring Programs to Discover Generalizable Abstractions

ReGAL: Refactoring Programs to Discover Generalizable Abstractions

Comments
4 min read
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Cover image for GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

1
Comments
4 min read
LLMs cannot find reasoning errors, but can correct them given the error location

LLMs cannot find reasoning errors, but can correct them given the error location

Comments
5 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.