How to Diagnose Bad Search Quality

Before tuning, establish baselines. Use exact KNN as ground truth, compare against approximate HNSW. Target >95% recall@K for production.

Don't Know What's Wrong Yet

Use when: results are irrelevant or missing expected matches and you need to isolate the cause.

Test with exact=true to bypass HNSW approximation Search API
Exact search bad = model or search pipeline problem. Exact good, approximate bad = tune HNSW.
Check if quantization degrades quality (compare with and without)
Check if filters are too restrictive (then you might need to use ACORN)
If duplicate results from chunked documents, use Grouping API to deduplicate Grouping

Payload filtering and sparse vector search are different things. Metadata (dates, categories, tags) goes in payload for filtering. Text content goes in sparse vectors for search.

Approximate Search Worse Than Exact

Use when: exact search returns good results but HNSW approximation misses them.

qdrant-search-quality-diagnosis

How to Diagnose Bad Search Quality

Don't Know What's Wrong Yet

Approximate Search Worse Than Exact

More from qdrant/skills

qdrant-clients-sdk

qdrant-search-quality

qdrant-performance-optimization

qdrant-monitoring

qdrant-scaling

qdrant-model-migration