Forem

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Refact.ai AI Agent is the new #1 on Aider’s polyglot benchmark with a score of 76.4%(a better measure than SWE bench?)
Cover image for Refact.ai AI Agent is the new #1 on Aider’s polyglot benchmark with a score of 76.4%(a better measure than SWE bench?)

Refact.ai AI Agent is the new #1 on Aider’s polyglot benchmark with a score of 76.4%(a better measure than SWE bench?)

2
Comments
4 min read
Polish Large Language Model (PLLuM) on Google Cloud
Cover image for Polish Large Language Model (PLLuM) on Google Cloud

Polish Large Language Model (PLLuM) on Google Cloud

Comments
1 min read
AI, LocalLLM and DeepSeek. What is it all about and how to dive into the world of LLM?

AI, LocalLLM and DeepSeek. What is it all about and how to dive into the world of LLM?

Comments
6 min read
LLM Inference GPU Video RAM Calculator
Cover image for LLM Inference GPU Video RAM Calculator

LLM Inference GPU Video RAM Calculator

2
Comments
2 min read
I developed a local Salesforce LLM Assistant that runs on your computer
Cover image for I developed a local Salesforce LLM Assistant that runs on your computer

I developed a local Salesforce LLM Assistant that runs on your computer

Comments
7 min read
What is Alpaca LLM?

What is Alpaca LLM?

4
Comments
4 min read
Understand data science before using LLM into your AI agents

Understand data science before using LLM into your AI agents

Comments
1 min read
LCM vs. LLM

LCM vs. LLM

11
Comments
3 min read
Overview: "Understanding LLMs: From Training to Inference"

Overview: "Understanding LLMs: From Training to Inference"

1
Comments
4 min read
What is MCP 💬 ? (Model Context Protocol) - A Primer ✅
Cover image for What is MCP 💬 ? (Model Context Protocol) - A Primer ✅

What is MCP 💬 ? (Model Context Protocol) - A Primer ✅

8
Comments 3
4 min read
Gemini 2.0 Flash: Unleashing Native Image Generation - A Tech Deep Dive
Cover image for Gemini 2.0 Flash: Unleashing Native Image Generation - A Tech Deep Dive

Gemini 2.0 Flash: Unleashing Native Image Generation - A Tech Deep Dive

2
Comments
10 min read
Born from the Quantization of Generative AI: I Released a Python Library That Quantizes Progress Bars [Zero Practicality]
Cover image for Born from the Quantization of Generative AI: I Released a Python Library That Quantizes Progress Bars [Zero Practicality]

Born from the Quantization of Generative AI: I Released a Python Library That Quantizes Progress Bars [Zero Practicality]

Comments
4 min read
LLM Model Selection Made Easy: The Most Useful Leaderboards for Real-World Applications

LLM Model Selection Made Easy: The Most Useful Leaderboards for Real-World Applications

10
Comments
4 min read
Google Gemma 3 Unlocked: The 128K-Token Multimodal AI Breakthrough Every Developer Must Explore
Cover image for Google Gemma 3 Unlocked: The 128K-Token Multimodal AI Breakthrough Every Developer Must Explore

Google Gemma 3 Unlocked: The 128K-Token Multimodal AI Breakthrough Every Developer Must Explore

9
Comments
5 min read
Understanding CAG (Cache Augmented Generation): AI's Conversation Memory With APIpie.ai

Understanding CAG (Cache Augmented Generation): AI's Conversation Memory With APIpie.ai

1
Comments
8 min read
Build Your Own AI Chatbot: A Complete Guide to Local Deployment with ServBay, Python, and ChromaDB

Build Your Own AI Chatbot: A Complete Guide to Local Deployment with ServBay, Python, and ChromaDB

6
Comments
9 min read
Model Context Protocol (MCP): Bridging LLM applications with external data sources and tools
Cover image for Model Context Protocol (MCP): Bridging LLM applications with external data sources and tools

Model Context Protocol (MCP): Bridging LLM applications with external data sources and tools

7
Comments
3 min read
Top Open-Source LLMs for Web Developers 🚀💡

Top Open-Source LLMs for Web Developers 🚀💡

Comments
1 min read
Best Open Source LLM

Best Open Source LLM

3
Comments
3 min read
Deploy and Use Open-Source AI Models Locally with Ollama: No Payment and Dev Skills Required

Deploy and Use Open-Source AI Models Locally with Ollama: No Payment and Dev Skills Required

6
Comments
4 min read
LLM vs NLP

LLM vs NLP

2
Comments
3 min read
GPT for Word. Use Gemma 3 (27B) for Summarization in Microsoft Word (100% Private).

GPT for Word. Use Gemma 3 (27B) for Summarization in Microsoft Word (100% Private).

1
Comments
1 min read
Alibaba Cloud AI Search Solution Explained: Intelligent Search Driven by Large Language Models, Helping Enterprises in Digital
Cover image for Alibaba Cloud AI Search Solution Explained: Intelligent Search Driven by Large Language Models, Helping Enterprises in Digital

Alibaba Cloud AI Search Solution Explained: Intelligent Search Driven by Large Language Models, Helping Enterprises in Digital

Comments
9 min read
Multi-Agent Hybrid Knowledge Base Retrieval: Building a High-Precision Legal Case Analysis Platform

Multi-Agent Hybrid Knowledge Base Retrieval: Building a High-Precision Legal Case Analysis Platform

7
Comments 1
7 min read
SGLang: A Deep Dive into Efficient LLM Program Execution

SGLang: A Deep Dive into Efficient LLM Program Execution

5
Comments
3 min read
loading...