Forem

Artificial Intelligence

Artificial intelligence leverages computers and machines to mimic the problem-solving and decision-making capabilities found in humans and in nature.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning

Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning

Comments
5 min read
LLMs cannot find reasoning errors, but can correct them given the error location

LLMs cannot find reasoning errors, but can correct them given the error location

Comments
5 min read
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Cover image for GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

1
Comments
4 min read
Deep Learning for Camera Calibration and Beyond: A Survey

Deep Learning for Camera Calibration and Beyond: A Survey

Comments
4 min read
The Geometry of Categorical and Hierarchical Concepts in Large Language Models
Cover image for The Geometry of Categorical and Hierarchical Concepts in Large Language Models

The Geometry of Categorical and Hierarchical Concepts in Large Language Models

1
Comments
4 min read
REBUS: A Robust Evaluation Benchmark of Understanding Symbols
Cover image for REBUS: A Robust Evaluation Benchmark of Understanding Symbols

REBUS: A Robust Evaluation Benchmark of Understanding Symbols

1
Comments
4 min read
CSS Animations Made EZ
Cover image for CSS Animations Made EZ

CSS Animations Made EZ

10
Comments 1
1 min read
Contrastive Learning and Mixture of Experts Enables Precise Vector Embeddings

Contrastive Learning and Mixture of Experts Enables Precise Vector Embeddings

Comments
4 min read
Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?
Cover image for Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?

Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?

Comments
4 min read
Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models
Cover image for Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models

Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models

1
Comments
5 min read
CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity Quantification
Cover image for CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity Quantification

CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity Quantification

Comments
3 min read
Weekly Updates - June 7, 2024
Cover image for Weekly Updates - June 7, 2024

Weekly Updates - June 7, 2024

Comments
1 min read
LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code
Cover image for LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

1
Comments
4 min read
To Believe or Not to Believe Your LLM
Cover image for To Believe or Not to Believe Your LLM

To Believe or Not to Believe Your LLM

1
Comments
4 min read
Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs

Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs

Comments
3 min read
Do Llamas Work in English? On the Latent Language of Multilingual Transformers

Do Llamas Work in English? On the Latent Language of Multilingual Transformers

Comments
4 min read
LLark: A Multimodal Instruction-Following Language Model for Music

LLark: A Multimodal Instruction-Following Language Model for Music

Comments
4 min read
Empirical influence functions to understand the logic of fine-tuning
Cover image for Empirical influence functions to understand the logic of fine-tuning

Empirical influence functions to understand the logic of fine-tuning

Comments
5 min read
Evaluating Quantized Large Language Models

Evaluating Quantized Large Language Models

Comments
5 min read
Examining the robustness of LLM evaluation to the distributional assumptions of benchmarks
Cover image for Examining the robustness of LLM evaluation to the distributional assumptions of benchmarks

Examining the robustness of LLM evaluation to the distributional assumptions of benchmarks

Comments
4 min read
ChatDev: Communicative Agents for Software Development

ChatDev: Communicative Agents for Software Development

Comments
3 min read
SqueezeLLM: Dense-and-Sparse Quantization
Cover image for SqueezeLLM: Dense-and-Sparse Quantization

SqueezeLLM: Dense-and-Sparse Quantization

1
Comments
4 min read
SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales
Cover image for SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales

SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales

1
Comments
4 min read
Harvard Undergraduate Survey on Generative AI
Cover image for Harvard Undergraduate Survey on Generative AI

Harvard Undergraduate Survey on Generative AI

Comments
3 min read
RAFT: Adapting Language Model to Domain Specific RAG
Cover image for RAFT: Adapting Language Model to Domain Specific RAG

RAFT: Adapting Language Model to Domain Specific RAG

2
Comments 1
4 min read
loading...