Forem

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B

LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B

Comments
4 min read
Ephemeral Rollups are All you Need

Ephemeral Rollups are All you Need

Comments
3 min read
Efficient Encoder-Decoder Transformer Decoding for Decomposable Tasks
Cover image for Efficient Encoder-Decoder Transformer Decoding for Decomposable Tasks

Efficient Encoder-Decoder Transformer Decoding for Decomposable Tasks

1
Comments
3 min read
Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization
Cover image for Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Comments
4 min read
Demo Paper: A Game Agents Battle Driven by Free-Form Text Commands Using Code-Generation LLM
Cover image for Demo Paper: A Game Agents Battle Driven by Free-Form Text Commands Using Code-Generation LLM

Demo Paper: A Game Agents Battle Driven by Free-Form Text Commands Using Code-Generation LLM

Comments
4 min read
Attention as an RNN

Attention as an RNN

Comments
4 min read
Thermodynamic Natural Gradient Descent

Thermodynamic Natural Gradient Descent

Comments
5 min read
Transformers Can Do Arithmetic with the Right Embeddings
Cover image for Transformers Can Do Arithmetic with the Right Embeddings

Transformers Can Do Arithmetic with the Right Embeddings

Comments
4 min read
Pareto Optimal Learning for Estimating Large Language Model Errors
Cover image for Pareto Optimal Learning for Estimating Large Language Model Errors

Pareto Optimal Learning for Estimating Large Language Model Errors

Comments
4 min read
ColorFoil: Investigating Color Blindness in Large Vision and Language Models
Cover image for ColorFoil: Investigating Color Blindness in Large Vision and Language Models

ColorFoil: Investigating Color Blindness in Large Vision and Language Models

Comments
4 min read
Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving
Cover image for Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving

Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving

Comments
3 min read
The CAP Principle for LLM Serving
Cover image for The CAP Principle for LLM Serving

The CAP Principle for LLM Serving

Comments
4 min read
Why are Sensitive Functions Hard for Transformers?

Why are Sensitive Functions Hard for Transformers?

Comments
4 min read
InvertAvatar: Incremental GAN Inversion for Generalized Head Avatars

InvertAvatar: Incremental GAN Inversion for Generalized Head Avatars

Comments
4 min read
ARAIDA: Analogical Reasoning-Augmented Interactive Data Annotation
Cover image for ARAIDA: Analogical Reasoning-Augmented Interactive Data Annotation

ARAIDA: Analogical Reasoning-Augmented Interactive Data Annotation

Comments
4 min read
Training Language Models to Generate Text with Citations via Fine-grained Rewards
Cover image for Training Language Models to Generate Text with Citations via Fine-grained Rewards

Training Language Models to Generate Text with Citations via Fine-grained Rewards

Comments
3 min read
Unsupervised Evaluation of Code LLMs with Round-Trip Correctness
Cover image for Unsupervised Evaluation of Code LLMs with Round-Trip Correctness

Unsupervised Evaluation of Code LLMs with Round-Trip Correctness

Comments
4 min read
Self-playing Adversarial Language Game Enhances LLM Reasoning
Cover image for Self-playing Adversarial Language Game Enhances LLM Reasoning

Self-playing Adversarial Language Game Enhances LLM Reasoning

Comments
4 min read
Increasing the LLM Accuracy for Question Answering: Ontologies to the Rescue!
Cover image for Increasing the LLM Accuracy for Question Answering: Ontologies to the Rescue!

Increasing the LLM Accuracy for Question Answering: Ontologies to the Rescue!

Comments
5 min read
A Declarative System for Optimizing AI Workloads

A Declarative System for Optimizing AI Workloads

Comments
4 min read
Representation noising effectively prevents harmful fine-tuning on LLMs

Representation noising effectively prevents harmful fine-tuning on LLMs

Comments
5 min read
BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once

BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once

Comments
4 min read
Fractal Patterns May Illuminate the Success of Next-Token Prediction
Cover image for Fractal Patterns May Illuminate the Success of Next-Token Prediction

Fractal Patterns May Illuminate the Success of Next-Token Prediction

Comments
5 min read
As an AI Language Model, Yes I Would Recommend Calling the Police'': Norm Inconsistency in LLM Decision-Making

As an AI Language Model, Yes I Would Recommend Calling the Police'': Norm Inconsistency in LLM Decision-Making

Comments
4 min read
TimeGPT-1

TimeGPT-1

Comments
4 min read
loading...