DEV Community

Cover image for Adding ML layer to Search: Hybrid Search Optimizer
Dmitry Kan for Vector Podcast

Posted on

2 1 1 1 1

Adding ML layer to Search: Hybrid Search Optimizer

If we were to look 10 years out, I think an ideal solution is that we are not doing hybrid search anymore: just have a better approach. Something beyond vector + keyword, something better, that still supports the 0 results is the right answer (sometimes). We would have better approach, and not this slighly band-aidy approach, but for now hybrid search is exciting!

It is fascinating and funny how things develop, but also turn around. In 2022–23 everyone was buzzing about hybrid search. In 2024 the conversation shifted to RAG, RAG, RAG. And now we are in 2025 and back to hybrid search — on a different level. Finally, there are strides and contributions towards making hybrid search parameters learnt with ML. How cool is that?

When I looked at hybrid search, I instantly knew that fiddling with a and b in a*keyword + b*neural will be the crux of succeeding with this approach to search. I also knew, that a better way than manual tweaking will be applying ML.

I’m really happy someone clever did this. Daniel Wrigley and Eric Pugh, both from OpenSource Connections, decided to do exactly that: apply machine learning to the problem of computing these coefficients. In other words: what weight to give to keyword match vs neural search match. And what’s fascinating, is that they experimented with a multitude of methods from global to dynamic (per query), with different permutations, feature groups, combination methods, query sampling — it sounds an overwhelming study.

What’s even cooler, is that all of this is open source.

Check out this episode, let me (us) know what you think. And remember to subscribe to stay tuned for new episodes.

Design: Saurabh Rai, https://www.linkedin.com/in/srbhr/

The design of this episode is inspired by a scene in Blade Runner 2049. There’s a clear path leading towards where people want to go to, yet they’re searching for something.

As usual, you can find the episode in audio form on your favorite platform.

Postmark Image

20% off for developers who'd rather build features than debug email

Stop wrestling with email delivery and get back to the code you love. Postmark handles the complexities of email infrastructure so you can ship your product faster.

Start free

Top comments (2)

Collapse
 
srbhr profile image
𝚂𝚊𝚞𝚛𝚊𝚋𝚑 𝚁𝚊𝚒

This episode is 🔥 🔥 🔥

Collapse
 
dmitrykey profile image
Dmitry Kan

Thanks Saurabh!!

ACI image

ACI.dev: Best Open-Source Composio Alternative (AI Agent Tooling)

100% open-source tool-use platform (backend, dev portal, integration library, SDK/MCP) that connects your AI agents to 600+ tools with multi-tenant auth, granular permissions, and access through direct function calling or a unified MCP server.

Star our GitHub!

👋 Kindness is contagious

Dive into this informative piece, backed by our vibrant DEV Community

Whether you’re a novice or a pro, your perspective enriches our collective insight.

A simple “thank you” can lift someone’s spirits—share your gratitude in the comments!

On DEV, the power of shared knowledge paves a smoother path and tightens our community ties. Found value here? A quick thanks to the author makes a big impact.

Okay