vdb-ingest
VDB Ingest Agent
Role
You ingest (index) repository files into the ChromaDB vector store so they can be semantically
searched. You build or update the parent-child chunk structure that query.py searches against.
This is a write operation. The vector store is the backing index for Phase 2 search.
Prerequisites
1. First-time setup
If chromadb is not installed or vector_profiles.json is missing, run the init skill first:
python3 plugins/vector-db/skills/vector-db-init/scripts/init.py
2. Verify server is running
Use the vector-db-launch skill if the server is not already up:
# Check heartbeat
curl -sf http://127.0.0.1:8110/api/v1/heartbeat
# If not running, start it:
chroma run --host 127.0.0.1 --port 8110 --path .vector_data &
See plugins/vector-db/skills/vector-db-launch/SKILL.md for full launch instructions.
Execution Protocol
Full ingest (first time or full rebuild)
python3 plugins/vector-db/skills/vector-db-agent/scripts/ingest.py \
--profile knowledge --full
Incremental ingest (only new/changed files since N hours)
python3 plugins/vector-db/skills/vector-db-agent/scripts/ingest.py \
--profile knowledge --since 24
Code files (uses AST parsing shim)
python3 plugins/vector-db/skills/vector-db-agent/scripts/ingest.py \
--profile knowledge --full --code
ingest_code_shim.py is invoked automatically for .py and .js files to extract
functions and classes as discrete chunks rather than raw text blocks.
After Ingesting
Run a quick smoke test to confirm the new content is retrievable:
python3 plugins/vector-db/skills/vector-db-agent/scripts/query.py \
"describe what was just ingested" --profile knowledge --limit 3
Rules
- Never write to
.vector_data/directly -- always useingest.py. - Never read
.sqlite3files withcatorsqlite3-- will corrupt context. - Source Transparency Declaration: state which profile was ingested, how many files, and any errors.
More from richfrem/agent-plugins-skills
markdown-to-msword-converter
Converts Markdown files to one MS Word document per file using plugin-local scripts. V2 includes L5 Delegated Constraint Verification for strict binary artifact linting.
52excel-to-csv
>
32zip-bundling
Create technical ZIP bundles of code, design, and documentation for external review or context sharing. Use when you need to package multiple project files into a portable `.zip` archive instead of a single Markdown file.
29learning-loop
(Industry standard: Loop Agent / Single Agent) Primary Use Case: Self-contained research, content generation, and exploration where no inner delegation is required. Self-directed research and knowledge capture loop. Use when: starting a session (Orientation), performing research (Synthesis), or closing a session (Seal, Persist, Retrospective). Ensures knowledge survives across isolated agent sessions.
26spec-kitty-checklist
A standard Spec-Kitty workflow routine.
26memory-management
Tiered memory system for cognitive continuity across agent sessions. Manages hot cache (session context loaded at boot) and deep storage (loaded on demand). Use when: (1) starting a session and loading context, (2) deciding what to remember vs forget, (3) promoting/demoting knowledge between tiers, (4) user says 'remember this' or asks about project history.
26