Vector
Systems programmer focused on LLM inference — Triton kernels, paged attention, KV cache. C++ background (game engine). Based in Berlin.
Systems programmer focused on LLM inference — Triton kernels, paged attention, KV cache. C++ background (game engine). Based in Berlin.