Skip to content

Forem

# vllm

👋 Sign in for the ability to sort posts by relevant, latest, or top.

iapilgrim

Mar 11

vLLM Request Lifecycle (Where TTFT is measured)

#vllm #monitoring

2 min read

Mar 2

I Pushed Local LLMs Harder. Here's What Two Models Actually Did.

#claudecode #vllm #selfhosted #amd

8 min read

Feb 15

The Ghost in the Batch: How vLLM Silently Switches Algorithms

#vllm #machinelearning #gpu #determinism

5 min read

Feb 9

Compiling the Vision Encoder: Squeezing 3% More Throughput from Qwen3-VL on Hopper GPUs

#vllm #pytorch #gpu #machinelearning

11 min read

Ben

Feb 1

vLLM — Session 2: The Engine Layer — Request Management

#vllm #llm #python #machinelearning

13 min read

Ben

Feb 1

Session 1: vLLM Overview and the User API

#vllm #llm #python #machinelearning

12 min read

Cover image for Pare de Brincar com LLMs Locais: Leve a IAG Open Source para a Produção na Magalu Cloud

Gláucio for Magalu Cloud

Feb 5

Pare de Brincar com LLMs Locais: Leve a IAG Open Source para a Produção na Magalu Cloud

#ai #llm #vllm #docker

22 min read

Feb 5

Running Claude Code with Local LLMs via vLLM and LiteLLM

#claudecode #vllm #selfhosted #ai

6 min read

Dec 29 '25

The Hidden Switchboard Behind vLLM Attention

#vllm #llm #attention #aiinference

10 min read

Cover image for The Ultimate LLM Inference Battle: vLLM vs. Ollama vs. ZML

raphiki for Technology at Worldline

Dec 29 '25

The Ultimate LLM Inference Battle: vLLM vs. Ollama vs. ZML

#qsos #zml #ollama #vllm

6 min read

👋 Sign in for the ability to sort posts by relevant, latest, or top.