Forem

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Poisoning Web-Scale Training Datasets is Practical

Poisoning Web-Scale Training Datasets is Practical

Comments
3 min read
Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity
Cover image for Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity

Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity

1
Comments
4 min read
SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking

SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking

1
Comments
4 min read
Prompt Design and Engineering: Introduction and Advanced Methods
Cover image for Prompt Design and Engineering: Introduction and Advanced Methods

Prompt Design and Engineering: Introduction and Advanced Methods

2
Comments
4 min read
Generative AI Beyond LLMs: System Implications of Multi-Modal Generation
Cover image for Generative AI Beyond LLMs: System Implications of Multi-Modal Generation

Generative AI Beyond LLMs: System Implications of Multi-Modal Generation

1
Comments
3 min read
A Simple and Effective Pruning Approach for Large Language Models
Cover image for A Simple and Effective Pruning Approach for Large Language Models

A Simple and Effective Pruning Approach for Large Language Models

1
Comments
3 min read
Scalable Bayesian Inference in the Era of Deep Learning: From Gaussian Processes to Deep Neural Networks

Scalable Bayesian Inference in the Era of Deep Learning: From Gaussian Processes to Deep Neural Networks

Comments
4 min read
Circuit Component Reuse Across Tasks in Transformer Language Models
Cover image for Circuit Component Reuse Across Tasks in Transformer Language Models

Circuit Component Reuse Across Tasks in Transformer Language Models

Comments
4 min read
R-Tuning: Instructing Large Language Models to Say `I Don't Know'

R-Tuning: Instructing Large Language Models to Say `I Don't Know'

Comments
4 min read
PopulAtion Parameter Averaging (PAPA)

PopulAtion Parameter Averaging (PAPA)

Comments
3 min read
FLAME: Factuality-Aware Alignment for Large Language Models

FLAME: Factuality-Aware Alignment for Large Language Models

Comments
4 min read
SAR image matching algorithm based on multi-class features

SAR image matching algorithm based on multi-class features

Comments
4 min read
Porting HPC Applications to AMD Instinct$^text{TM}$ MI300A Using Unified Memory and OpenMP

Porting HPC Applications to AMD Instinct$^text{TM}$ MI300A Using Unified Memory and OpenMP

Comments
4 min read
ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis Testing

ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis Testing

1
Comments
4 min read
FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding
Cover image for FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding

FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding

Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.