Forem

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
OTEL Observability with Langfuse for Strands Agents
Cover image for OTEL Observability with Langfuse for Strands Agents

OTEL Observability with Langfuse for Strands Agents

8
Comments
3 min read
How To Use LLMs: Retrieval-Augmented Generation (RAG Systems)

How To Use LLMs: Retrieval-Augmented Generation (RAG Systems)

2
Comments 2
5 min read
Introduction to MCP Servers and writing one in Python
Cover image for Introduction to MCP Servers and writing one in Python

Introduction to MCP Servers and writing one in Python

1
Comments
6 min read
Optimizing technical documentations for LLMs
Cover image for Optimizing technical documentations for LLMs

Optimizing technical documentations for LLMs

1
Comments
7 min read
Paper Notes - From Mind to Machine: The Rise of Manus AI as a Fully Autonomous Digital Agent

Paper Notes - From Mind to Machine: The Rise of Manus AI as a Fully Autonomous Digital Agent

Comments
3 min read
Ranger-mini: Open Model for Agentic tool use evaluation

Ranger-mini: Open Model for Agentic tool use evaluation

3
Comments
3 min read
Code Generation with ‘Graph RAG’, AstraDB and gpt-oss

Code Generation with ‘Graph RAG’, AstraDB and gpt-oss

2
Comments 1
18 min read
Benchmarking LLM Search APIs: Tavily vs Web Search Plus vs OpenAI Web Search
Cover image for Benchmarking LLM Search APIs: Tavily vs Web Search Plus vs OpenAI Web Search

Benchmarking LLM Search APIs: Tavily vs Web Search Plus vs OpenAI Web Search

Comments
4 min read
Online softmax by hand
Cover image for Online softmax by hand

Online softmax by hand

Comments
20 min read
Incompetence as a Virtue

Incompetence as a Virtue

Comments
2 min read
LLM Act
Cover image for LLM Act

LLM Act

Comments
20 min read
The Great LLM Benchmark Illusion: Why Your Enterprise AI Strategy Needs Real-World Testing
Cover image for The Great LLM Benchmark Illusion: Why Your Enterprise AI Strategy Needs Real-World Testing

The Great LLM Benchmark Illusion: Why Your Enterprise AI Strategy Needs Real-World Testing

Comments
4 min read
🛠️ Setting Up Fusio as an API Gateway for CRM & LLM Microservices

🛠️ Setting Up Fusio as an API Gateway for CRM & LLM Microservices

Comments
2 min read
The Lazy Genius Inside Your Chatbot: Meet MoD, the Art of Thinking Less but Smarter
Cover image for The Lazy Genius Inside Your Chatbot: Meet MoD, the Art of Thinking Less but Smarter

The Lazy Genius Inside Your Chatbot: Meet MoD, the Art of Thinking Less but Smarter

1
Comments
11 min read
Your First AI Agent: A Clear, Practical Path
Cover image for Your First AI Agent: A Clear, Practical Path

Your First AI Agent: A Clear, Practical Path

Comments 2
5 min read
Continuous AI: A Simple Introduction
Cover image for Continuous AI: A Simple Introduction

Continuous AI: A Simple Introduction

6
Comments 2
4 min read
Youtube Downloader - My first MCP Server

Youtube Downloader - My first MCP Server

Comments
3 min read
Building Production RAG in 2025: Lessons from 50+ Deployments
Cover image for Building Production RAG in 2025: Lessons from 50+ Deployments

Building Production RAG in 2025: Lessons from 50+ Deployments

1
Comments
2 min read
NLP vs LLM for Content Moderation: How to Choose the Right AI Approach
Cover image for NLP vs LLM for Content Moderation: How to Choose the Right AI Approach

NLP vs LLM for Content Moderation: How to Choose the Right AI Approach

10
Comments
8 min read
SwiGLU: The FFN Upgrade I Use to Get Free Performance

SwiGLU: The FFN Upgrade I Use to Get Free Performance

Comments
5 min read
Reproducible LLM Benchmarking: GPT-5 vs Grok-4 with Promptfoo
Cover image for Reproducible LLM Benchmarking: GPT-5 vs Grok-4 with Promptfoo

Reproducible LLM Benchmarking: GPT-5 vs Grok-4 with Promptfoo

7
Comments
19 min read
How to Summarize Huge Documents with LLMs: Beyond Token Limits and Basic Prompts

How to Summarize Huge Documents with LLMs: Beyond Token Limits and Basic Prompts

1
Comments
6 min read
Build context-aware AI apps using MCP
Cover image for Build context-aware AI apps using MCP

Build context-aware AI apps using MCP

Comments
1 min read
72% Faster AI Workflows: How Hybrid Prompt Chaining with Qwen Code and Gemini CLI Boosts Efficiency
Cover image for 72% Faster AI Workflows: How Hybrid Prompt Chaining with Qwen Code and Gemini CLI Boosts Efficiency

72% Faster AI Workflows: How Hybrid Prompt Chaining with Qwen Code and Gemini CLI Boosts Efficiency

Comments
7 min read
RAG Chatbot - MoviesGPT
Cover image for RAG Chatbot - MoviesGPT

RAG Chatbot - MoviesGPT

Comments
2 min read
loading...