Forem

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Why I Built USPL: Helping GPT Understand Multi-Table SQL

Why I Built USPL: Helping GPT Understand Multi-Table SQL

Comments 1
2 min read
A Deep Dive into Large Language Models (LLMs): Unpacking the Magic of AI Text Generation!

A Deep Dive into Large Language Models (LLMs): Unpacking the Magic of AI Text Generation!

Comments
5 min read
Bringing Function Calling to DeepSeek Models on SGLang

Bringing Function Calling to DeepSeek Models on SGLang

1
Comments
3 min read
A little Rust proxy for Ollama

A little Rust proxy for Ollama

Comments
2 min read
RAG Search with AWS Lambda and Bedrock

RAG Search with AWS Lambda and Bedrock

9
Comments 1
4 min read
Next-Gen AI Multi-Modal RAG with Text and Image Integration

Next-Gen AI Multi-Modal RAG with Text and Image Integration

Comments
5 min read
Build the Smartest AI Bot You’ve Ever Seen — A 7B Model + Web Search, Right on Your Laptop

Build the Smartest AI Bot You’ve Ever Seen — A 7B Model + Web Search, Right on Your Laptop

Comments
5 min read
From Zero to GenAI Cluster: Scalable Local LLMs with Docker, Kubernetes, and GPU Scheduling

From Zero to GenAI Cluster: Scalable Local LLMs with Docker, Kubernetes, and GPU Scheduling

1
Comments
4 min read
Word Counter – Analyze Your Text Instantly

Word Counter – Analyze Your Text Instantly

Comments
1 min read
Running llama3 in WSL2 using Docker in your PC 🐧🦙🐋

Running llama3 in WSL2 using Docker in your PC 🐧🦙🐋

1
Comments
3 min read
How to Deploy a LLM Locally and Make It Accessible from the Internet

How to Deploy a LLM Locally and Make It Accessible from the Internet

1
Comments
5 min read
Active MCP: Integrating Model Context Protocol with Rails

Active MCP: Integrating Model Context Protocol with Rails

Comments
4 min read
Synonymic Query Expansion for Smarter Search

Synonymic Query Expansion for Smarter Search

1
Comments
3 min read
Stop Wasting Time Formatting Resumes – Automate It! 🚀

Stop Wasting Time Formatting Resumes – Automate It! 🚀

1
Comments
1 min read
DeepSeek-V3 vs Claude 3.5 Sonnet: Which AI Model Actually Delivers?

DeepSeek-V3 vs Claude 3.5 Sonnet: Which AI Model Actually Delivers?

Comments 1
3 min read
Comparing OCR Capabilities in Amazon Bedrock LLMs: Claude 3.7 Sonnet vs. Amazon Nova Pro

Comparing OCR Capabilities in Amazon Bedrock LLMs: Claude 3.7 Sonnet vs. Amazon Nova Pro

2
Comments
7 min read
🚀 Understanding ML Ops, LLM Ops, and Agent Ops: Key Differences and Why They Matter

🚀 Understanding ML Ops, LLM Ops, and Agent Ops: Key Differences and Why They Matter

Comments
3 min read
Revolutionizing AI Agents: Letta – The Open-Source Framework You Need!

Revolutionizing AI Agents: Letta – The Open-Source Framework You Need!

2
Comments 2
3 min read
Power up your RAG chatbot with Snowflake Cortex Search Boosts and Decays

Power up your RAG chatbot with Snowflake Cortex Search Boosts and Decays

2
Comments
7 min read
🚀 Publishing to Notion with MCP: A Seamless Integration Guide

🚀 Publishing to Notion with MCP: A Seamless Integration Guide

1
Comments
2 min read
Forget the Hype: Agents are Loops

Forget the Hype: Agents are Loops

5
Comments
5 min read
Vector Databases: their utility and functioning (RAG usage)

Vector Databases: their utility and functioning (RAG usage)

Comments
12 min read
Handling rate limits of OpenAI models in Java using Guava, JTokkit

Handling rate limits of OpenAI models in Java using Guava, JTokkit

2
Comments
4 min read
Eww Sigma Devs Don’t Write Commits

Eww Sigma Devs Don’t Write Commits

1
Comments
2 min read
AI Agents: how they work and how to build them

AI Agents: how they work and how to build them

46
Comments 7
26 min read
loading...