Forem

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Local LLM Hosting: Complete 2025 Guide - Ollama, vLLM, LocalAI, Jan, LM Studio & More

Local LLM Hosting: Complete 2025 Guide - Ollama, vLLM, LocalAI, Jan, LM Studio & More

1
Comments
19 min read
A Financial MCP server with multi-provider orchestration (Open Source)

A Financial MCP server with multi-provider orchestration (Open Source)

Comments
1 min read
How to Cut Your AI API Costs: Six Proven Strategies
Cover image for How to Cut Your AI API Costs: Six Proven Strategies

How to Cut Your AI API Costs: Six Proven Strategies

1
Comments
5 min read
Utilizing RAG Techniques for Improved AI Agent Performance

Utilizing RAG Techniques for Improved AI Agent Performance

Comments
8 min read
🧑‍🚀 Mission Accomplished: How an Engineer-Astronaut Prepared Meta’s CRAG Benchmark for Launch in Docker
Cover image for 🧑‍🚀 Mission Accomplished: How an Engineer-Astronaut Prepared Meta’s CRAG Benchmark for Launch in Docker

🧑‍🚀 Mission Accomplished: How an Engineer-Astronaut Prepared Meta’s CRAG Benchmark for Launch in Docker

Comments
3 min read
Rethinking Team Development in the Age of LLMs
Cover image for Rethinking Team Development in the Age of LLMs

Rethinking Team Development in the Age of LLMs

1
Comments 1
12 min read
AWSChallenge - Week 1
Cover image for AWSChallenge - Week 1

AWSChallenge - Week 1

3
Comments
4 min read
LLMs as Unreliable Narrators: Dealing with UUID Hallucination

LLMs as Unreliable Narrators: Dealing with UUID Hallucination

9
Comments
5 min read
How to Build Production-Ready RAG Systems (at Scale, with Low Latency & High Accuracy)
Cover image for How to Build Production-Ready RAG Systems (at Scale, with Low Latency & High Accuracy)

How to Build Production-Ready RAG Systems (at Scale, with Low Latency & High Accuracy)

Comments
5 min read
Supercharge Your LLMs: Turn Basic APIs into 3D AI Desktop Companions with Zero Code Change

Supercharge Your LLMs: Turn Basic APIs into 3D AI Desktop Companions with Zero Code Change

Comments
3 min read
Reverse Engineering the Response to "best smartphones" in Chatgpt 5.0

Reverse Engineering the Response to "best smartphones" in Chatgpt 5.0

Comments
4 min read
Title: LLMs for Your Business: Is it Better to Retrain the Brain or Give it an Open Book? (RAG vs. Fine-Tuning)
Cover image for Title: LLMs for Your Business: Is it Better to Retrain the Brain or Give it an Open Book? (RAG vs. Fine-Tuning)

Title: LLMs for Your Business: Is it Better to Retrain the Brain or Give it an Open Book? (RAG vs. Fine-Tuning)

Comments
4 min read
Understanding AI Language Models: Base, Chat, and Reasoning — A Beginners Guide
Cover image for Understanding AI Language Models: Base, Chat, and Reasoning — A Beginners Guide

Understanding AI Language Models: Base, Chat, and Reasoning — A Beginners Guide

Comments
4 min read
Development Trends and Architecture Evolution of AI Agents

Development Trends and Architecture Evolution of AI Agents

Comments
16 min read
Practical Guide to MCP (Model Context Protocol) in Python

Practical Guide to MCP (Model Context Protocol) in Python

Comments
5 min read
Synthetic Data Generation for AI Agent Testing: A Practical, Governance‑Aligned Playbook

Synthetic Data Generation for AI Agent Testing: A Practical, Governance‑Aligned Playbook

Comments
8 min read
Story Agent — Turning Simple Phrases into Powerful Mini-Stories

Story Agent — Turning Simple Phrases into Powerful Mini-Stories

Comments
1 min read
Agent Skeleton Framework: Building Domain-Specific AI Agents Through Configuration, Not Code
Cover image for Agent Skeleton Framework: Building Domain-Specific AI Agents Through Configuration, Not Code

Agent Skeleton Framework: Building Domain-Specific AI Agents Through Configuration, Not Code

1
Comments 2
8 min read
Development Musical Chairs
Cover image for Development Musical Chairs

Development Musical Chairs

1
Comments
3 min read
Building a Production-Ready Enterprise AI Assistant with RAG and Security Guardrails

Building a Production-Ready Enterprise AI Assistant with RAG and Security Guardrails

Comments
10 min read
The generative AI revolution promises productivity gains, but is it making us smarter or simply outsourcing our thinking?
Cover image for The generative AI revolution promises productivity gains, but is it making us smarter or simply outsourcing our thinking?

The generative AI revolution promises productivity gains, but is it making us smarter or simply outsourcing our thinking?

3
Comments 2
9 min read
Spring AI RAG, Demystified: From Toy Demos to Production-Grade Retrieval

Spring AI RAG, Demystified: From Toy Demos to Production-Grade Retrieval

Comments 1
12 min read
Part II : Building My First Large Language Model from Scratch

Part II : Building My First Large Language Model from Scratch

Comments
4 min read
Deep Dive into G-Eval: How LLMs Evaluate Themselves
Cover image for Deep Dive into G-Eval: How LLMs Evaluate Themselves

Deep Dive into G-Eval: How LLMs Evaluate Themselves

Comments 1
2 min read
2025 Complete Guide: How Alibaba Tongyi's UI-Ins Model Revolutionizes GUI Grounding and Automation

2025 Complete Guide: How Alibaba Tongyi's UI-Ins Model Revolutionizes GUI Grounding and Automation

Comments
8 min read
loading...