Forem

Jangwook Kim profile picture

Jangwook Kim

404 bio not found

Joined Joined on  Personal website https://effloow.com
Warp 2.0: The Terminal That Became an Agentic Development Environment

Warp 2.0: The Terminal That Became an Agentic Development Environment

Comments
12 min read
Anthropic Message Batches API Production Guide — Cut LLM Costs 50% at Scale

Anthropic Message Batches API Production Guide — Cut LLM Costs 50% at Scale

Comments
8 min read
vLLM 0.8: Native Llama 4 MoE Routing Explained

vLLM 0.8: Native Llama 4 MoE Routing Explained

Comments
10 min read
Claude Opus 4.7: Effort Controls and Migration Guide

Claude Opus 4.7: Effort Controls and Migration Guide

Comments
9 min read
AI Distiller: Extract LLM-Ready Code Context in Seconds

AI Distiller: Extract LLM-Ready Code Context in Seconds

1
Comments
9 min read
Claude API Prompt Caching in Practice — 4 Patterns That Cut LLM Costs by 70%

Claude API Prompt Caching in Practice — 4 Patterns That Cut LLM Costs by 70%

Comments
7 min read
markitdown: Convert Any Document to Markdown for LLMs

markitdown: Convert Any Document to Markdown for LLMs

Comments
8 min read
On-Device AI 2026: Developer Guide to NPUs and Edge Inference

On-Device AI 2026: Developer Guide to NPUs and Edge Inference

Comments
12 min read
Cursor 3 vs Claude Code vs Windsurf — Which AI Coding Tool Should You Use in 2026?

Cursor 3 vs Claude Code vs Windsurf — Which AI Coding Tool Should You Use in 2026?

Comments
10 min read
ChatGPT Workspace Agents: OpenAI's Enterprise Agent Platform

ChatGPT Workspace Agents: OpenAI's Enterprise Agent Platform

Comments
10 min read
Google Gemini Enterprise Agent Platform: Build and Deploy A2A Agents

Google Gemini Enterprise Agent Platform: Build and Deploy A2A Agents

Comments
10 min read
nanobot: Build AI Agents in 4,000 Lines You Can Actually Read

nanobot: Build AI Agents in 4,000 Lines You Can Actually Read

Comments
9 min read
MCP vs A2A vs Open Responses — AI Agent Communication Protocols in 2026: What to Actually Use

MCP vs A2A vs Open Responses — AI Agent Communication Protocols in 2026: What to Actually Use

Comments
6 min read
Meta Llama Stack: Deploy Llama 4 With OpenAI-Compatible API

Meta Llama Stack: Deploy Llama 4 With OpenAI-Compatible API

Comments
9 min read
DeepSeek V4-Pro and V4-Flash: Migration Guide and API Setup

DeepSeek V4-Pro and V4-Flash: Migration Guide and API Setup

Comments
11 min read
GPT-5.5 Spud: Unified Multimodal API — Developer Integration Guide

GPT-5.5 Spud: Unified Multimodal API — Developer Integration Guide

Comments
10 min read
>-

>-

Comments
9 min read
Cursor 2.0: 8 Parallel AI Agents and Visual Editor Bridge

Cursor 2.0: 8 Parallel AI Agents and Visual Editor Bridge

Comments
10 min read
Llama 4 Maverick: 400B MoE Model — Self-Hosting and API Guide

Llama 4 Maverick: 400B MoE Model — Self-Hosting and API Guide

Comments
8 min read
Databricks Unity AI Gateway: MCP Agent Governance Guide

Databricks Unity AI Gateway: MCP Agent Governance Guide

Comments
11 min read
Building a Claude Streaming Agent with Vercel AI SDK

Building a Claude Streaming Agent with Vercel AI SDK

Comments
9 min read
GitLab 18.11: Agentic AI for Security, CI, and Analytics

GitLab 18.11: Agentic AI for Security, CI, and Analytics

Comments
10 min read
Kimi Code K2.6: Moonshot AI's Coding Model vs Claude Code

Kimi Code K2.6: Moonshot AI's Coding Model vs Claude Code

Comments
11 min read
Qwen3.6-Plus: 1M Token Context and Claude-Level Performance

Qwen3.6-Plus: 1M Token Context and Claude-Level Performance

Comments
10 min read
Claude Code Routines Practical Guide — How to Automate AI Tasks 24/7 with Schedules, APIs, and GitHub Events

Claude Code Routines Practical Guide — How to Automate AI Tasks 24/7 with Schedules, APIs, and GitHub Events

Comments
8 min read
LLM Inference Engines Compared 2026: vLLM vs SGLang vs TGI vs MAX

LLM Inference Engines Compared 2026: vLLM vs SGLang vs TGI vs MAX

Comments
10 min read
smolagents: Build Code Agents with HF in Under 100 Lines

smolagents: Build Code Agents with HF in Under 100 Lines

Comments
11 min read
OpenClaw: Self-Hosted AI Gateway for WhatsApp, Telegram & Discord

OpenClaw: Self-Hosted AI Gateway for WhatsApp, Telegram & Discord

Comments
11 min read
MCP Server Kubernetes Deployment — Surviving the 52% Death Rate

MCP Server Kubernetes Deployment — Surviving the 52% Death Rate

Comments
8 min read
Hermes Agent Review: Self-Improving Open-Source AI Agent

Hermes Agent Review: Self-Improving Open-Source AI Agent

Comments
10 min read
Claude Sonnet 4.6: 1M Context, 300K Output, Agentic Coding

Claude Sonnet 4.6: 1M Context, 300K Output, Agentic Coding

Comments
10 min read
Meta Muse Spark Developer Guide 2026: Benchmarks, Modes, API

Meta Muse Spark Developer Guide 2026: Benchmarks, Modes, API

Comments
11 min read
GPT-6 Developer Guide: Symphony Architecture and 2M Context

GPT-6 Developer Guide: Symphony Architecture and 2M Context

Comments
9 min read
Python AI Agent Library Comparison 2026 — Pydantic AI vs Instructor vs Smolagents Practical Guide

Python AI Agent Library Comparison 2026 — Pydantic AI vs Instructor vs Smolagents Practical Guide

Comments
7 min read
>-

>-

Comments
9 min read
Llama 4 Scout: Run Meta's Vision Model on One GPU

Llama 4 Scout: Run Meta's Vision Model on One GPU

Comments
10 min read
The AI Context Window Race: What 1M Tokens Means for Devs

The AI Context Window Race: What 1M Tokens Means for Devs

Comments
10 min read
>-

>-

Comments
9 min read
Microsoft MAI: Three New Foundational Models for Developers

Microsoft MAI: Three New Foundational Models for Developers

Comments
10 min read
OpenAI Agents SDK: Sandbox, Memory, and MCP in 2026

OpenAI Agents SDK: Sandbox, Memory, and MCP in 2026

Comments
10 min read
LLM Structured Outputs in Production: Stop Parsing JSON with Regex

LLM Structured Outputs in Production: Stop Parsing JSON with Regex

Comments
10 min read
The Anthropic Claude Performance Decline Controversy — What Actually Happened

The Anthropic Claude Performance Decline Controversy — What Actually Happened

Comments
9 min read
Vector Database Comparison 2026: Qdrant vs Pinecone vs Chroma

Vector Database Comparison 2026: Qdrant vs Pinecone vs Chroma

Comments
11 min read
Fine-Tune LLMs with LoRA and QLoRA: 2026 Guide

Fine-Tune LLMs with LoRA and QLoRA: 2026 Guide

2
Comments
11 min read
LiteLLM: One Proxy for 140+ LLMs — Setup & Cost Guide

LiteLLM: One Proxy for 140+ LLMs — Setup & Cost Guide

Comments
9 min read
>-

>-

Comments
4 min read
Langfuse: Self-Host LLM Observability for Free — 2026 Guide

Langfuse: Self-Host LLM Observability for Free — 2026 Guide

Comments
10 min read
Gemini 3.1 Ultra: Long Context and Multimodal Dev Guide

Gemini 3.1 Ultra: Long Context and Multimodal Dev Guide

Comments
10 min read
5 Claude Code Agentic Workflow Patterns — Which One Fits Your Work?

5 Claude Code Agentic Workflow Patterns — Which One Fits Your Work?

Comments
5 min read
Microsoft Agent Framework 1.0: Build AI Agents in .NET and Python

Microsoft Agent Framework 1.0: Build AI Agents in .NET and Python

1
Comments
8 min read
OpenAI Codex CLI: Terminal Coding Agent Setup Guide 2026

OpenAI Codex CLI: Terminal Coding Agent Setup Guide 2026

Comments
9 min read
Building a Private MCP Server with Local LLM — Gemma 4 + FastMCP Fully Offline AI Tool Guide

Building a Private MCP Server with Local LLM — Gemma 4 + FastMCP Fully Offline AI Tool Guide

Comments
5 min read
Google ADK: Build Multi-Agent Systems with Python

Google ADK: Build Multi-Agent Systems with Python

Comments
10 min read
Claude Mythos Preview: Developer Guide for 2026

Claude Mythos Preview: Developer Guide for 2026

Comments
9 min read
AI Coding Market Share 2026: Who's Winning?

AI Coding Market Share 2026: Who's Winning?

Comments
9 min read
Running Claude Code in Parallel — Git Worktree Guide for Simultaneous Tasks

Running Claude Code in Parallel — Git Worktree Guide for Simultaneous Tasks

Comments
5 min read
Grok 4 Multi-Agent Architecture: Dev Guide 2026

Grok 4 Multi-Agent Architecture: Dev Guide 2026

Comments
9 min read
Qwen3 Review: Hybrid Thinking Modes and MoE Architecture Explained

Qwen3 Review: Hybrid Thinking Modes and MoE Architecture Explained

Comments
9 min read
Building Your Own MCP Server — Implementing Real AI Tools with Streamable HTTP Transport

Building Your Own MCP Server — Implementing Real AI Tools with Streamable HTTP Transport

Comments
6 min read
Gemini 3.1 Pro Developer Guide: Benchmarks, API, and Pricing

Gemini 3.1 Pro Developer Guide: Benchmarks, API, and Pricing

Comments
11 min read
loading...