What is BM25 and why is it still relevant?
LLM Engineer Interview Questions: RAG Pipeline Design, Chunking Strategies, Hybrid Retrieval
Audio flashcard · 0:19Nortren·
What is BM25 and why is it still relevant?
0:19
BM25 is a classical sparse retrieval algorithm from the 1990s that ranks documents based on term frequency and inverse document frequency, with adjustments for document length. Despite being decades old, BM25 remains a strong baseline and is essential for matching exact terms. Modern RAG systems combine BM25 with dense embeddings rather than replacing it.
en.wikipedia.org