Forem

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
BadLlama: cheaply removing safety fine-tuning from Llama 2-Chat 13B

BadLlama: cheaply removing safety fine-tuning from Llama 2-Chat 13B

Comments
4 min read
LLMs achieve adult human performance on higher-order theory of mind tasks
Cover image for LLMs achieve adult human performance on higher-order theory of mind tasks

LLMs achieve adult human performance on higher-order theory of mind tasks

Comments
4 min read
Metaheuristics and Large Language Models Join Forces: Towards an Integrated Optimization Approach
Cover image for Metaheuristics and Large Language Models Join Forces: Towards an Integrated Optimization Approach

Metaheuristics and Large Language Models Join Forces: Towards an Integrated Optimization Approach

Comments
5 min read
Large Language Models Can Self-Improve At Web Agent Tasks

Large Language Models Can Self-Improve At Web Agent Tasks

Comments
4 min read
Executable Code Actions Elicit Better LLM Agents
Cover image for Executable Code Actions Elicit Better LLM Agents

Executable Code Actions Elicit Better LLM Agents

Comments
4 min read
Sparse maximal update parameterization: A holistic approach to sparse training dynamics

Sparse maximal update parameterization: A holistic approach to sparse training dynamics

Comments
5 min read
Training-Free Long-Context Scaling of Large Language Models
Cover image for Training-Free Long-Context Scaling of Large Language Models

Training-Free Long-Context Scaling of Large Language Models

Comments
4 min read
SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering

SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering

1
Comments
4 min read
Grokfast: Accelerated Grokking by Amplifying Slow Gradients
Cover image for Grokfast: Accelerated Grokking by Amplifying Slow Gradients

Grokfast: Accelerated Grokking by Amplifying Slow Gradients

1
Comments
4 min read
Human vs. Machine: Behavioral Differences Between Expert Humans and Language Models in Wargame Simulations
Cover image for Human vs. Machine: Behavioral Differences Between Expert Humans and Language Models in Wargame Simulations

Human vs. Machine: Behavioral Differences Between Expert Humans and Language Models in Wargame Simulations

Comments
4 min read
The Road Less Scheduled
Cover image for The Road Less Scheduled

The Road Less Scheduled

Comments
4 min read
You Need to Pay Better Attention: Rethinking the Mathematics of Attention Mechanism
Cover image for You Need to Pay Better Attention: Rethinking the Mathematics of Attention Mechanism

You Need to Pay Better Attention: Rethinking the Mathematics of Attention Mechanism

Comments
4 min read
Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models

Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models

Comments
4 min read
NPGA: Neural Parametric Gaussian Avatars
Cover image for NPGA: Neural Parametric Gaussian Avatars

NPGA: Neural Parametric Gaussian Avatars

2
Comments
3 min read
gzip Predicts Data-dependent Scaling Laws
Cover image for gzip Predicts Data-dependent Scaling Laws

gzip Predicts Data-dependent Scaling Laws

Comments
5 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.