Forem

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
How Far Are We From AGI
Cover image for How Far Are We From AGI

How Far Are We From AGI

1
Comments
4 min read
HMT: Hierarchical Memory Transformer for Long Context Language Processing
Cover image for HMT: Hierarchical Memory Transformer for Long Context Language Processing

HMT: Hierarchical Memory Transformer for Long Context Language Processing

Comments
4 min read
MarkLLM: An Open-Source Toolkit for LLM Watermarking
Cover image for MarkLLM: An Open-Source Toolkit for LLM Watermarking

MarkLLM: An Open-Source Toolkit for LLM Watermarking

2
Comments
5 min read
GDPR: Is it worth it? Perceptions of workers who have experienced its implementation

GDPR: Is it worth it? Perceptions of workers who have experienced its implementation

Comments
4 min read
Special Characters Attack: Toward Scalable Training Data Extraction From Large Language Models

Special Characters Attack: Toward Scalable Training Data Extraction From Large Language Models

Comments
4 min read
Distinguishing Tor From Other Encrypted Network Traffic Through Character Analysis
Cover image for Distinguishing Tor From Other Encrypted Network Traffic Through Character Analysis

Distinguishing Tor From Other Encrypted Network Traffic Through Character Analysis

Comments
3 min read
SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models

SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models

Comments
4 min read
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Cover image for Chameleon: Mixed-Modal Early-Fusion Foundation Models

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Comments
4 min read
Unveiling the Potential: Harnessing Deep Metric Learning to Circumvent Video Streaming Encryption
Cover image for Unveiling the Potential: Harnessing Deep Metric Learning to Circumvent Video Streaming Encryption

Unveiling the Potential: Harnessing Deep Metric Learning to Circumvent Video Streaming Encryption

Comments
4 min read
VILA: On Pre-training for Visual Language Models
Cover image for VILA: On Pre-training for Visual Language Models

VILA: On Pre-training for Visual Language Models

Comments
4 min read
MOMENT: A Family of Open Time-series Foundation Models

MOMENT: A Family of Open Time-series Foundation Models

Comments
4 min read
NASU -- Novel Actuating Screw Unit: Origami-inspired Screw-based Propulsion on Mobile Ground Robots

NASU -- Novel Actuating Screw Unit: Origami-inspired Screw-based Propulsion on Mobile Ground Robots

Comments
4 min read
A Spectral Condition for Feature Learning

A Spectral Condition for Feature Learning

Comments
4 min read
Sporthesia: Augmenting Sports Videos Using Natural Language

Sporthesia: Augmenting Sports Videos Using Natural Language

Comments
4 min read
SqueezeSAM: User friendly mobile interactive segmentation
Cover image for SqueezeSAM: User friendly mobile interactive segmentation

SqueezeSAM: User friendly mobile interactive segmentation

Comments
3 min read
Multimodal Chain-of-Thought Reasoning in Language Models

Multimodal Chain-of-Thought Reasoning in Language Models

Comments
4 min read
On the Security Vulnerabilities of Text-to-SQL Models

On the Security Vulnerabilities of Text-to-SQL Models

Comments
3 min read
Thinking Tokens for Language Modeling

Thinking Tokens for Language Modeling

Comments
3 min read
GPT-4 passes most of the 297 written Polish Board Certification Examinations

GPT-4 passes most of the 297 written Polish Board Certification Examinations

Comments
3 min read
Observational Scaling Laws and the Predictability of Language Model Performance
Cover image for Observational Scaling Laws and the Predictability of Language Model Performance

Observational Scaling Laws and the Predictability of Language Model Performance

1
Comments
4 min read
An Analysis of Quantile Temporal-Difference Learning

An Analysis of Quantile Temporal-Difference Learning

1
Comments
4 min read
Training-Free Consistent Text-to-Image Generation

Training-Free Consistent Text-to-Image Generation

Comments
4 min read
Zero-Shot Tokenizer Transfer

Zero-Shot Tokenizer Transfer

Comments
4 min read
Hydragen: High-Throughput LLM Inference with Shared Prefixes

Hydragen: High-Throughput LLM Inference with Shared Prefixes

Comments
4 min read
LoRA Learns Less and Forgets Less
Cover image for LoRA Learns Less and Forgets Less

LoRA Learns Less and Forgets Less

Comments
4 min read
loading...