Forem

Prashanth Velidandi profile picture

Prashanth Velidandi

Building a runtime for AI Inference

Location San Francisco Joined Joined on  Personal website https://inferx.net/ twitter website

Work

CoFounder at InferX (inferx.net)

We Replaced Our RAG Pipeline With Persistent KV Cache. Here's What We Found.
Cover image for We Replaced Our RAG Pipeline With Persistent KV Cache. Here's What We Found.

We Replaced Our RAG Pipeline With Persistent KV Cache. Here's What We Found.

1
Comments
3 min read

Want to connect with Prashanth Velidandi?

Create an account to connect with Prashanth Velidandi. You can also sign in below to proceed if you already have an account.

Already have an account? Sign in
loading...