Maximal Marginal Relevance for AI Agents

Search results can be relevant and still bad.

If the top five results all say the same thing, the agent gets less context than it appears to have.

Maximal Marginal Relevance, or MMR, helps solve that.

What MMR Does

MMR balances two goals:

The goal is not to return random variety.

The goal is to avoid redundant context.

AI agents often need to compare sources:

Five near-duplicate snippets from the same source family can hide important edge cases.

MMR can help the retrieval layer include broader evidence.

For a query like:

FastMCP streamable HTTP auth error

A pure relevance ranking might return five similar docs pages.

An MMR-aware result set might include:

That is more useful for debugging.

Agents do not need more results.

They need better coverage.

MMR helps prevent the context window from being filled with duplicates, which improves reasoning and reduces wasted tokens.