skills/eyadsibai/ltk/vector-databases

vector-databases

SKILL.md

Vector Databases

Store and search embeddings for RAG, semantic search, and similarity applications.

Comparison

Database Best For Filtering Scale Managed Option
Chroma Local dev, prototyping Yes < 1M No
FAISS Max speed, GPU, batch No Billions No
Qdrant Production, hybrid search Yes Millions Yes
Pinecone Fully managed Yes Billions Yes (only)
Weaviate Hybrid search, GraphQL Yes Millions Yes

Chroma

Embedded vector database for prototyping. No server needed.

Strengths: Zero-config, auto-embedding, metadata filtering, persistent storage Limitations: Not for production scale, single-node only

Key concept: Collections hold documents + embeddings + metadata. Auto-embeds text if no vectors provided.


FAISS (Facebook AI)

Pure vector similarity - no metadata, no filtering, maximum speed.

Index types:

  • Flat: Exact search, small datasets (< 10K)
  • IVF: Inverted file, medium datasets (10K - 1M)
  • HNSW: Graph-based, good recall/speed tradeoff
  • PQ: Product quantization, memory efficient for billions

Strengths: Fastest, GPU support, scales to billions Limitations: No filtering, no metadata, vectors only

Key concept: Choose index based on dataset size. Trade accuracy for speed with approximate search.


Qdrant

Production-ready with rich filtering and hybrid search.

Strengths: Payload filtering, horizontal scaling, cloud option, gRPC API Limitations: More complex setup than Chroma

Key concept: "Payloads" are metadata attached to vectors. Filter during search, not after.


Index Algorithm Concepts

Algorithm How It Works Trade-off
Flat Compare to every vector Perfect recall, slow
IVF Cluster vectors, search nearby clusters Good recall, fast
HNSW Graph of neighbors Best recall/speed ratio
PQ Compress vectors Memory efficient, lower recall

Decision Guide

Requirement Recommendation
Quick prototype Chroma
Metadata filtering Chroma, Qdrant, Pinecone
Billions of vectors FAISS
GPU acceleration FAISS
Production deployment Qdrant or Pinecone
Fully managed Pinecone
On-premise control Qdrant, Chroma

Resources

Weekly Installs
52
Repository
eyadsibai/ltk
First Seen
Jan 28, 2026
Installed on
gemini-cli44
opencode44
codex43
github-copilot42
cursor37
kimi-cli36