Llm Page 162

👋 Sign in for the ability to sort posts by relevant, latest, or top.

Cover image for Building a Provider-Agnostic LLM Abstraction Layer: Benchmarking OpenAI, Gemini, Groq, DeepSeek and Ollama

Souvik Sengupta

Mar 8

Building a Provider-Agnostic LLM Abstraction Layer: Benchmarking OpenAI, Gemini, Groq, DeepSeek and Ollama

#ai #llm #python #architecture

6 min read

Chris Kesler

Mar 8

The 24GB AI Lab: A Survival Guide to Full-Stack Local AI on Consumer Hardware

#ai #llm #machinelearning #tutorial

4 min read

Alexandre Caramaschi

Mar 22

I Built an Entity Consistency Audit Pipeline for GEO — Here's What I Found

#showdev #ai #llm #marketing

5 min read

Nijo George Payyappilly

Apr 12

🧠 Stop Letting Your AI Forget: MemPalace is a Wake-Up Call

#ai #claude #mempalace #llm

2 min read

Lakshmi Sravya Vedantham

Mar 9

Type-safe LLM prompts in Rust: catching prompt bugs before they happen

#rust #ai #llm #programming

3 min read

yang yaru

Mar 9

Query Rewrite in RAG Systems: Why It Matters and How It Works

#llm #rag

4 min read

CharmPic

Apr 12

Re-evaluating the ROI of GLM-5.1 Pro After a Massive Price Hike to $680

#news #ai #llm #tooling

1 min read

DavidAI311

Mar 8

AI Can Lie. And You Can't Tell.

#ai #llm #trust #machinelearning

4 min read

James Lee

Mar 22

From Single-Agent to Multi-Agent: Designing and Deploying an Enterprise-Grade Intelligent Customer Service System with LangGraph

#multiagent #langgraph #llm #ai

10 min read

James Lee

Mar 22

Engineering GraphRAG for Production: API Design, Query Optimization, and Service Reliability

#graphrag #rag #python #llm

6 min read

Cover image for Reducing LLM Cost and Latency Using Semantic Caching

Kuldeep Paul

Mar 9

Reducing LLM Cost and Latency Using Semantic Caching

#ai #llm #performance #tutorial

5 min read

Cover image for I caught Claude Sonnet 4 inventing facts about a fake tool

Ken Imoto

Apr 11

I caught Claude Sonnet 4 inventing facts about a fake tool

#ai #llm #claude #contextengineering

9 min read

DavidAI311

Mar 8

Claude Designed Its Own Rule System — A Public Experiment

#discuss #ai #llm #productivity

4 min read

André N. Darcie

Mar 8

Qwen3.5 rodando localmente: super rápido e com ótima qualidade

#ai #llm #opensource #performance

2 min read

Ultra Dune

Mar 12

The Great LLM Inference Engine Showdown: vLLM vs TGI vs TensorRT-LLM vs SGLang vs llama.cpp vs Ollama

#ai #machinelearning #llm #mlops

10 min read

👋 Sign in for the ability to sort posts by relevant, latest, or top.

Forem

# llm

Building a Provider-Agnostic LLM Abstraction Layer: Benchmarking OpenAI, Gemini, Groq, DeepSeek and Ollama

The 24GB AI Lab: A Survival Guide to Full-Stack Local AI on Consumer Hardware

I Built an Entity Consistency Audit Pipeline for GEO — Here's What I Found

🧠 Stop Letting Your AI Forget: MemPalace is a Wake-Up Call

Type-safe LLM prompts in Rust: catching prompt bugs before they happen

Query Rewrite in RAG Systems: Why It Matters and How It Works

Re-evaluating the ROI of GLM-5.1 Pro After a Massive Price Hike to $680

AI Can Lie. And You Can't Tell.

From Single-Agent to Multi-Agent: Designing and Deploying an Enterprise-Grade Intelligent Customer Service System with LangGraph

Engineering GraphRAG for Production: API Design, Query Optimization, and Service Reliability

Reducing LLM Cost and Latency Using Semantic Caching

I caught Claude Sonnet 4 inventing facts about a fake tool

Claude Designed Its Own Rule System — A Public Experiment

Qwen3.5 rodando localmente: super rápido e com ótima qualidade

The Great LLM Inference Engine Showdown: vLLM vs TGI vs TensorRT-LLM vs SGLang vs llama.cpp vs Ollama