Forem

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Submitting a Fine-Tuning Job: Organising the Workforce

Submitting a Fine-Tuning Job: Organising the Workforce

5
Comments
2 min read
How to access Agent interface in watsonx.ai (Beta)

How to access Agent interface in watsonx.ai (Beta)

Comments
3 min read
Integrating Large Language Models in Production Applications

Integrating Large Language Models in Production Applications

Comments
12 min read
How to call Bee Agent framework locally with a pre-built UI?

How to call Bee Agent framework locally with a pre-built UI?

Comments
8 min read
Building Websites with Cursor and AWS.

Building Websites with Cursor and AWS.

Comments
5 min read
Some books I'm almost done reading and I consider them as good resources (Jan 2025)

Some books I'm almost done reading and I consider them as good resources (Jan 2025)

Comments
2 min read
Unlock the Power of Docusaurus with AI

Unlock the Power of Docusaurus with AI

Comments
2 min read
Building Smarter, Smaller and Efficient AI Models: NVIDIA’s Minitron Approach

Building Smarter, Smaller and Efficient AI Models: NVIDIA’s Minitron Approach

1
Comments
3 min read
Building a Multi-LLM Profanity Detector in C# using StepWise

Building a Multi-LLM Profanity Detector in C# using StepWise

Comments
3 min read
All You Need to Know About AI Agents: A Full Guide

All You Need to Know About AI Agents: A Full Guide

Comments
8 min read
Try Multimodal Search with ColQwen2!

Try Multimodal Search with ColQwen2!

Comments
4 min read
How to chat with Local LLM in Obsidian

How to chat with Local LLM in Obsidian

Comments
2 min read
Building a Coding Agent from Scratch with Llama 70B: Lessons Learned

Building a Coding Agent from Scratch with Llama 70B: Lessons Learned

Comments
3 min read
OpenAI Assistants with Structured Outputs

OpenAI Assistants with Structured Outputs

Comments
1 min read
LLM Evals—The Trap No One’s Telling You 🐔

LLM Evals—The Trap No One’s Telling You 🐔

14
Comments 1
1 min read
How to set simply all “sampling parameters” or “generation parameters” for applications using watsonx?

How to set simply all “sampling parameters” or “generation parameters” for applications using watsonx?

Comments
3 min read
Implementing Enterprise LLM Solutions: A Step-by-Step Guide

Implementing Enterprise LLM Solutions: A Step-by-Step Guide

Comments
4 min read
Understanding RAG Workflow: Retrieval-Augmented Generation in Python

Understanding RAG Workflow: Retrieval-Augmented Generation in Python

Comments
3 min read
Building a React.dev RAG chatbot using Vercel AI SDK

Building a React.dev RAG chatbot using Vercel AI SDK

1
Comments
4 min read
Getting started with LLM APIs

Getting started with LLM APIs

9
Comments
16 min read
Moly: An Open-Source LLM Client Implemented in Pure Rust

Moly: An Open-Source LLM Client Implemented in Pure Rust

Comments
5 min read
Why You Should Try a Local LLM Model—and How to Get Started

Why You Should Try a Local LLM Model—and How to Get Started

Comments
2 min read
2024 - Ultimate guide to LLM analysis using NLP standalone

2024 - Ultimate guide to LLM analysis using NLP standalone

Comments
4 min read
Using LangChain to Search Your Own PDF Documents

Using LangChain to Search Your Own PDF Documents

Comments
7 min read
Data Splitting: Breaking Down the Problem

Data Splitting: Breaking Down the Problem

Comments
3 min read
Why LLMs Fall Short: Why Large Language Models Aren't Ideal for AI Agent Applications

Why LLMs Fall Short: Why Large Language Models Aren't Ideal for AI Agent Applications

Comments
3 min read
Uploading Files to OpenAI: Passing the Baton

Uploading Files to OpenAI: Passing the Baton

5
Comments
2 min read
Browser extension to summarize HN comments

Browser extension to summarize HN comments

Comments
2 min read
Why Function-Calling GenAI Must Be Built by AI, Not Manually Coded

Why Function-Calling GenAI Must Be Built by AI, Not Manually Coded

Comments
5 min read
Understanding LLM Errors and Their Impact on AI-driven Applications

Understanding LLM Errors and Their Impact on AI-driven Applications

Comments
4 min read
The Dev Tools Evolution: LLMs, Wasm, and What's Next for 2025

The Dev Tools Evolution: LLMs, Wasm, and What's Next for 2025

Comments
3 min read
My Journey into Novel Creation Using Generative AI: Day 1

My Journey into Novel Creation Using Generative AI: Day 1

Comments
2 min read
Unlocking AI-Powered Conversations: Building a Retrieval-Augmented Generation (RAG) Chatbot

Unlocking AI-Powered Conversations: Building a Retrieval-Augmented Generation (RAG) Chatbot

Comments
4 min read
Using ASTs to merge LLM generated snippets in to existing code files with surgical precision.

Using ASTs to merge LLM generated snippets in to existing code files with surgical precision.

Comments
1 min read
Counting Tokens: Sorting Through the Details

Counting Tokens: Sorting Through the Details

Comments
1 min read
How I Used LLMs to Make IoT Devices Understand Any Language

How I Used LLMs to Make IoT Devices Understand Any Language

Comments
1 min read
Counting the number of Tokens sent to a LLM in Go (part 2)

Counting the number of Tokens sent to a LLM in Go (part 2)

Comments
6 min read
To Fine-Tune or Not To Fine-Tune?

To Fine-Tune or Not To Fine-Tune?

Comments
4 min read
Counting the number of Tokens sent to a LLM in Go (part 1)

Counting the number of Tokens sent to a LLM in Go (part 1)

1
Comments
5 min read
Getting Responses from Local LLM Models with Python

Getting Responses from Local LLM Models with Python

3
Comments
3 min read
I made wut – a CLI that explains your last command with an LLM

I made wut – a CLI that explains your last command with an LLM

1
Comments
1 min read
Charting Your Unique Path in Generative AI: A Fresh Perspective for Beginners

Charting Your Unique Path in Generative AI: A Fresh Perspective for Beginners

Comments
3 min read
📉 Why Improving Your AI Model Is Killing Your Project’s Success

📉 Why Improving Your AI Model Is Killing Your Project’s Success

12
Comments
4 min read
AI Agents Architecture, Actors and Microservices: Let's Try LangGraph Command

AI Agents Architecture, Actors and Microservices: Let's Try LangGraph Command

1
Comments
4 min read
The Quest to Minimize False Positives Reaches Another Significant Milestone

The Quest to Minimize False Positives Reaches Another Significant Milestone

Comments
4 min read
Calling LangChain from Go (Part 1)

Calling LangChain from Go (Part 1)

Comments
5 min read
💬 How Intent-Driven Interfaces Will Transform the Way Users Interact with Software

💬 How Intent-Driven Interfaces Will Transform the Way Users Interact with Software

12
Comments
4 min read
Building a Local AI Code Reviewer with ClientAI and Ollama - Part 2

Building a Local AI Code Reviewer with ClientAI and Ollama - Part 2

Comments
5 min read
How Machines Hear and Understand Us

How Machines Hear and Understand Us

Comments
4 min read
Day 51: Containerization of LLM Applications

Day 51: Containerization of LLM Applications

Comments
2 min read
Day 50: Building a REST API for LLM Inference

Day 50: Building a REST API for LLM Inference

Comments
2 min read
DeepMind at Google: Denny Zhou

DeepMind at Google: Denny Zhou

Comments
2 min read
Self-Correcting AI Agents: How to Build AI That Learns From Its Mistakes

Self-Correcting AI Agents: How to Build AI That Learns From Its Mistakes

Comments
5 min read
Mastering Real-Time AI: A Developer’s Guide to Building Streaming LLMs with FastAPI and Transformers

Mastering Real-Time AI: A Developer’s Guide to Building Streaming LLMs with FastAPI and Transformers

Comments
5 min read
Integrating LangChain with FastAPI for Asynchronous Streaming

Integrating LangChain with FastAPI for Asynchronous Streaming

Comments
3 min read
Building a Local AI Task Planner with ClientAI and Ollama

Building a Local AI Task Planner with ClientAI and Ollama

1
Comments
4 min read
Building an AI-powered Docker Solution with Llama and k8sGPT

Building an AI-powered Docker Solution with Llama and k8sGPT

4
Comments
3 min read
Browsing the web with AI - Perplexity vs Copilot vs AI agents

Browsing the web with AI - Perplexity vs Copilot vs AI agents

2
Comments
4 min read
DevFest Bandung 2024 Session 2: Large Language Model, Potentials and Limitations

DevFest Bandung 2024 Session 2: Large Language Model, Potentials and Limitations

Comments
5 min read
Introducing Humiris MoAI Basic : A New Way to Build Hybrid AI Models

Introducing Humiris MoAI Basic : A New Way to Build Hybrid AI Models

Comments
5 min read
loading...