Forem

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
I Tried Building a Whiteboard App with Claude 4.5 Sonnet
Cover image for I Tried Building a Whiteboard App with Claude 4.5 Sonnet

I Tried Building a Whiteboard App with Claude 4.5 Sonnet

Comments
2 min read
Prompt Caching Slashed My AI Bills by 90%. Here's What Nobody Tells You.
Cover image for Prompt Caching Slashed My AI Bills by 90%. Here's What Nobody Tells You.

Prompt Caching Slashed My AI Bills by 90%. Here's What Nobody Tells You.

Comments
5 min read
AI-Powered Resume & Job Description Matching with RAG
Cover image for AI-Powered Resume & Job Description Matching with RAG

AI-Powered Resume & Job Description Matching with RAG

Comments
1 min read
LLPY-14: Evaluación y Métricas de Calidad - Midiendo el Éxito del RAG

LLPY-14: Evaluación y Métricas de Calidad - Midiendo el Éxito del RAG

Comments
12 min read
Train it or feed it? Teaching LLMs your data the smart way
Cover image for Train it or feed it? Teaching LLMs your data the smart way

Train it or feed it? Teaching LLMs your data the smart way

Comments
4 min read
Agent Optimization: Why Context Engineering Isn’t Enough
Cover image for Agent Optimization: Why Context Engineering Isn’t Enough

Agent Optimization: Why Context Engineering Isn’t Enough

Comments
5 min read
Understanding RAG: How AI Models Learn to Search Before They Speak

Understanding RAG: How AI Models Learn to Search Before They Speak

1
Comments
3 min read
AI Security Tools Find Critical curl Vulnerabilities
Cover image for AI Security Tools Find Critical curl Vulnerabilities

AI Security Tools Find Critical curl Vulnerabilities

Comments
9 min read
How to Connect Salesforce to OpenAI Agent Builder
Cover image for How to Connect Salesforce to OpenAI Agent Builder

How to Connect Salesforce to OpenAI Agent Builder

9
Comments 1
7 min read
Why Claude Code's Unix Philosophy Beats Other AI Assistants

Why Claude Code's Unix Philosophy Beats Other AI Assistants

Comments
8 min read
AutoAgents – a Rust-Based Multi-Agent Framework for LLM-Powered Intelligence

AutoAgents – a Rust-Based Multi-Agent Framework for LLM-Powered Intelligence

8
Comments
1 min read
Step-by-Step: Manual vLLM Setup on Google Cloud L4 (Debian)

Step-by-Step: Manual vLLM Setup on Google Cloud L4 (Debian)

Comments
2 min read
🧩 Runtime Snapshots #3 — QA That Speaks JSON
Cover image for 🧩 Runtime Snapshots #3 — QA That Speaks JSON

🧩 Runtime Snapshots #3 — QA That Speaks JSON

7
Comments
1 min read
Gemini 2.5 Flash-Lite: Speed > Scale — 887 TPS, 50% Less Verbosity, Real-World Wins

Gemini 2.5 Flash-Lite: Speed > Scale — 887 TPS, 50% Less Verbosity, Real-World Wins

Comments
1 min read
Amazon Bedrock AgentCore Runtime - Part 7 Using AgentCore long-term Memory with Strands Agents SDK

Amazon Bedrock AgentCore Runtime - Part 7 Using AgentCore long-term Memory with Strands Agents SDK

3
Comments
13 min read
How to Build Developer Trust in AI‑Powered Code Generation Through Data‑Driven Feedback and Evaluation

How to Build Developer Trust in AI‑Powered Code Generation Through Data‑Driven Feedback and Evaluation

1
Comments 1
8 min read
How to Improve Cross-Lingual Retrieval Accuracy in Bilingual RAG Chatbots

How to Improve Cross-Lingual Retrieval Accuracy in Bilingual RAG Chatbots

1
Comments 1
9 min read
Granite 4: IBM introduces a line of small but fast LLMs

Granite 4: IBM introduces a line of small but fast LLMs

Comments
2 min read
From Query Understanding to Retrieval: Evaluating Rewriting, Filters, and Routing With Online Evals

From Query Understanding to Retrieval: Evaluating Rewriting, Filters, and Routing With Online Evals

4
Comments
12 min read
OpenAI's SORA 2 Release Pattern: What It Means for AI Video
Cover image for OpenAI's SORA 2 Release Pattern: What It Means for AI Video

OpenAI's SORA 2 Release Pattern: What It Means for AI Video

Comments
9 min read
Ten Failure Modes of RAG Nobody Talks About (And How to Detect Them Systematically)

Ten Failure Modes of RAG Nobody Talks About (And How to Detect Them Systematically)

5
Comments
10 min read
The RAG Debugging Playbook: A Step-by-Step Guide to Trace-Level Failures and Fixes

The RAG Debugging Playbook: A Step-by-Step Guide to Trace-Level Failures and Fixes

1
Comments
10 min read
LLMs & Agents Every Developer Should Know
Cover image for LLMs & Agents Every Developer Should Know

LLMs & Agents Every Developer Should Know

4
Comments
5 min read
Synthetic Data for RAG: Safe Generation, Deduplication, and Drift-Aware Curation in 2025

Synthetic Data for RAG: Safe Generation, Deduplication, and Drift-Aware Curation in 2025

2
Comments
10 min read
It's Time to Write Docs for Machines First, Then for Humans.

It's Time to Write Docs for Machines First, Then for Humans.

1
Comments
3 min read
loading...