nanograph industry intel — SPIKE schema bootstrap

Stand up a SPIKE knowledge-graph schema on nanograph. One worked ontology ships pre-built (AI industry intel); for anything else the user brings, design the ontology together using the AI set as a pattern. This skill is ontology-focused: data ingestion, research, and seed generation are handled by the nanograph-ops skill, not here.

AI path — use the template as-is (schema + real 2026 seed data). Ready to query in under a minute.
Custom path — collaborate with the user to design domain-specific enums (entity kinds, sub-fields, actor roles, source/artifact types), adapt schema.pg, initialize an empty database. Works for Biotech, Fintech, Space, Crypto, Geopolitics, or any other domain.
Ask the user for 2–3 example source types they already read — just enough to shape SourceEntity.type and artifactType. Not a full source list, not an ingestion plan.
Two reference files: references/domain-examples.md explains the AI ontology as a worked pattern; references/schema-adaptation.md covers keep-vs-change rules and nanograph syntax gotchas.

Prerequisites

brew install nanograph          # or see https://nanograph.io
nanograph version               # needs 1.2.0+

An embedding API key in .env.nano is required before nanograph load, not just at query time. Every SPIKE node (Signal, Element, Pattern, Insight, KnowHow) has embedding: Vector(3072)? @embed(brief) @index, and nanograph embeds inline during load for any @embed(...) field whose source value is set and whose vector column is null. No API key → load fails with a cryptic embedding initialization failed error.

The template defaults to Gemini (GEMINI_API_KEY) — also covers multimodal embedding. OPENAI_API_KEY works too if you switch [embedding].provider in nanograph.toml. For CI or offline validation of just the schema/graph structure, set provider = "mock" temporarily.

Step 1: Ask which path

Do you want to:

AI — use the AI industry template as-is (109 nodes, 154 real signals from early 2026)

Custom — design a schema for a different domain (I'll walk through the ontology with you)

Path A — AI (pre-built)

cp -r <repo>/templates/spike-intel <destination>/spike-intel
cd <destination>/spike-intel
cp .env.nano.example .env.nano          # add GEMINI_API_KEY (or OPENAI_API_KEY if you switched provider)
nanograph init
nanograph load --data seed.jsonl --mode overwrite    # embeds inline — needs GEMINI_API_KEY
nanograph embed --only-null --reindex                 # no-op on fresh load (vectors already filled); build indexes

The embed --only-null step is a no-op on a fresh seed (load already populated every vector), but it's useful after you add records incrementally with @embed source fields still null. --reindex builds the vector indexes.

<repo> is the nanograph-skills checkout (or the install path if the skill came from npx skills add).

Verify:

nanograph run patterns disruption                                      # returns SaaSpocalypse, Sovereign AI
nanograph run pattern-signals pat-sovereign-ai
nanograph run similar-patterns "enterprises moving AI off public cloud"  # semantic search

Done. For day-to-day ops, point the user at the nanograph-ops skill.

Path B — Custom Domain

No pre-baked ontology sets. The AI schema at templates/spike-intel/schema.pg is the pattern; the target-domain enums come from a conversation with the user. Read references/domain-examples.md first — it explains why each AI enum was chosen, which is the mental model you'll use to design the new one.

Step 1: Confirm domain + slug

Pick a short slug for the project folder and database directory (e.g. bio-intel, fin-intel, space-intel, crypto-intel, geo-intel, climate-intel). This becomes <slug>/ and <slug>.nano.

Step 2: Design the ontology with the user

Six enum families change per domain. Walk through them in order, proposing values and having the user confirm before moving on. Start from the AI set as a mental anchor — do not hand the user a ready-made list for their domain.

Element.kind — the things signals are about. Ask: "What kinds of things do your signals talk about? 4–7 types."
Signal.domain + Element.domain — verticals or sub-fields. Ask: "What sub-fields or verticals matter?" 5–8 values. Must be identical on both Signal and Element (nanograph inlines enums).
Company.type — ecosystem roles. Ask: "What roles do organizations play in your space?" 4–7 values. Each company picks one primary role.
SourceEntity.type — derived from source examples (see Step 3).
InformationArtifact.artifactType — derived from source examples (see Step 3).
Kind-specific Element properties — Ask: "For each kind, what 2–4 properties matter most?" Swap AI's kind-specific fields (repository, use_cases, etc.) for the domain's.

Keep Pattern.kind (challenge, disruption, dynamic) unchanged — intentionally abstract, works everywhere.

Step 3: Ask for source-type examples (to shape types 4 and 5)

Ask for 2–3 example source types the user already reads. The goal is to choose SourceEntity.type and artifactType values — not to build a source list or plan ingestion.

To tune the schema, give me a few example source types you'd pull from — newsletters, publications, filings, podcasts, data feeds, whatever.

Their answers map directly to enum values. Examples of the mapping, not the answer:

"FDA filings and clinicaltrials.gov" → artifactType includes fda-filing and trial-registration; SourceEntity.type gains regulatory-filing
"FCC filings and launch reports" → artifactType includes filing and launch-report; SourceEntity.type gains agency-release; Company.type needs agency
"Governance forum posts and on-chain data" → SourceEntity.type adds governance-forum and on-chain-data; artifactType adds proposal and governance-vote

Do not elicit a full source list or plan the ingestion pipeline — that's out of scope for this skill.

Step 4: Present proposed enums for confirmation

Before editing any files, echo the proposed enum block back to the user:

Element.kind:      enum(...)
Signal.domain:     enum(...)
Element.domain:    enum(...)          // same as above
Company.type:      enum(...)
SourceEntity.type: enum(...)
InformationArtifact.artifactType: enum(...)

// Kind-specific Element properties:
...

Cheap to revise here; expensive once queries and data start flowing.

Step 5: Copy and adapt the template

cp -r <repo>/templates/spike-intel <slug>
rm <slug>/seed.jsonl <slug>/seed.md      # user brings their own data via nanograph-ops
cd <slug>

Edit <slug>/schema.pg:

Replace the six enum families with the confirmed values from Step 4
Swap kind-specific Element properties
Leave Pattern.kind alone

Update <slug>/nanograph.toml:

[project]
name = "<Domain> Intel — SPIKE"

[db]
default_path = "<slug>.nano"

See references/schema-adaptation.md for the full keep-vs-change rules and nanograph syntax gotchas (@embed, no edge cardinality, @key on every node, etc.).

Step 6: Lint + init

nanograph lint --schema schema.pg --query queries/signals.gq
cp .env.nano.example .env.nano           # add an API key
nanograph init

The database is now empty with the adapted schema. The user's next step — loading data — happens via the nanograph-ops skill, using the mutation-alias pattern (nanograph run add-signal ... etc.). Avoid nanograph load --mode merge for operational adds — it breaks CDC continuity and Lance history. This skill stops here.

Step 7: Hand off

Tell the user:

Schema designed for <domain> at <slug>/schema.pg
Empty database at <slug>/<slug>.nano
Aliases ready in nanograph.toml (they'll work once data is loaded)
Next: use the nanograph-ops skill to add data via mutations or load

nanograph Syntax Quick Reference

Concern	nanograph
`@embed(prop)`	Unquoted property name is canonical; `@embed("prop")` also parses
`@card(...)`	Tolerated but not enforced — don't rely on it
`@unique(src)` in edge body	Unsupported
`@key` on every node	Required — `Chunk` gets a synthetic `slug @key`
Property separators	Newlines only; commas rejected
Comments	`//`, not `#`

References

Reference	When to load
`references/domain-examples.md`	Reading the AI ontology as a worked pattern; designing for a new domain
`references/schema-adaptation.md`	Keep-vs-change rules, nanograph syntax gotchas, validation checklist

For everything beyond schema setup — mutations, data loading, CDC, embeddings, maintenance — use the nanograph-ops skill.

nanograph-industry-intel