Forem

# mlops

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Quantising event-camera networks to run under 1MB on a Cortex-M7

Quantising event-camera networks to run under 1MB on a Cortex-M7

Comments
4 min read
Building a Production-Grade MLOps Home Lab on Windows — K8s, LLM, RAG & GitLab CI

Building a Production-Grade MLOps Home Lab on Windows — K8s, LLM, RAG & GitLab CI

Comments
8 min read
llm-nano-vm v0.8.0 — deterministic FSM runtime for LLM pipelines, now with output validation and per-step timeouts

llm-nano-vm v0.8.0 — deterministic FSM runtime for LLM pipelines, now with output validation and per-step timeouts

1
Comments
4 min read
DevOps vs MLOps vs AIOps: What Changes, What Stays, and a Simple Roadmap to Get Started
Cover image for DevOps vs MLOps vs AIOps: What Changes, What Stays, and a Simple Roadmap to Get Started

DevOps vs MLOps vs AIOps: What Changes, What Stays, and a Simple Roadmap to Get Started

5
Comments
6 min read
Why Your LLM Eval Harness Is Lying to You (And How to Fix It)

Why Your LLM Eval Harness Is Lying to You (And How to Fix It)

Comments
4 min read
Stop paying for idle GPUs in your CI: batching LLM eval jobs

Stop paying for idle GPUs in your CI: batching LLM eval jobs

Comments
4 min read
Routing Event-Camera Pipelines Through an LLM Gateway: A Field Report

Routing Event-Camera Pipelines Through an LLM Gateway: A Field Report

Comments
4 min read
Measuring AI Gateway Failover: 30 Days of Production Data

Measuring AI Gateway Failover: 30 Days of Production Data

Comments
3 min read
Routing diffusion inference traffic across three providers

Routing diffusion inference traffic across three providers

Comments
4 min read
Inference Routing Is Becoming an Infrastructure Placement Problem

Inference Routing Is Becoming an Infrastructure Placement Problem

Comments
9 min read
The Request Is the Wrong Unit of Scale for LLMs on Kubernetes
Cover image for The Request Is the Wrong Unit of Scale for LLMs on Kubernetes

The Request Is the Wrong Unit of Scale for LLMs on Kubernetes

Comments
12 min read
Detecting Silent Model Failure: Drift Monitoring That Actually Works

Detecting Silent Model Failure: Drift Monitoring That Actually Works

1
Comments
4 min read
The Synthetic Data Trap: When It Helps, When It Lies

The Synthetic Data Trap: When It Helps, When It Lies

Comments
4 min read
Detecting Silent Model Failure: Drift Monitoring That Actually Works

Detecting Silent Model Failure: Drift Monitoring That Actually Works

Comments
4 min read
Why Your LLM Evals Are Lying to You

Why Your LLM Evals Are Lying to You

Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.