* fix: migration hardening — timeout handling, lock detection, diagnostics Addresses all 8 issues from the v0.18.0 production upgrade field report: 1. LATEST_VERSION now uses Math.max() instead of array-last (was wrong when MIGRATIONS array is out of order: [.., 23, 22, 21, 20, 15, 16]) 2. Pre-flight lock check: runMigrations() queries pg_stat_activity for idle-in-transaction connections >5min before attempting DDL, prints PIDs and kill advice 3. SET LOCAL statement_timeout = 600s inside migration transactions for Supabase compatibility (server-enforced timeout overrides session SET) 4. Catches Postgres error 57014 (statement_timeout) with actionable diagnostics instead of raw stack trace 5. Better progress output: prints schema version range, migration names before/after, checkmarks on success 6. Migration 21 fix: drops files.page_slug_fkey before swapping the pages unique constraint (guarded for PGLite which has no files table) 7. idle_in_transaction_session_timeout = 5min on all Postgres connections (both instance-level and module-level) to prevent 24h stale locks 8. apply-migrations CLI warns when schema migrations are pending, since it only runs orchestrator migrations (System B) not schema DDL (System A) All 34 migrate tests pass. Typecheck clean. * feat(engine): BrainEngine.withReservedConnection() primitive + DRY session defaults Adds a ReservedConnection interface and withReservedConnection(fn) method to BrainEngine. Postgres uses postgres-js sql.reserve() to pin a single backend for the callback; PGLite passes through its single backing connection. Used immediately for non-transactional DDL timeout handling (next commit) and foundation for the future write-quiesce design. Extracts setSessionDefaults(sql) helper in db.ts, absorbing the duplicated idle_in_transaction_session_timeout block that was copy-pasted between db.ts and postgres-engine.ts (Gap 5 / ER-C1). Single write site, both connect paths call the helper now. Codex plan-review flagged that advisory-lock designs on postgres.js pools require a reserved-connection primitive; this is that primitive. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> * fix(migrate): close v21/v23 integrity window + non-transactional DDL timeout Two codex-caught issues that both the initial review and the engineering review missed: 1. Migration 21 integrity window. Original v21 dropped files_page_slug_fkey and persisted config.version=21, leaving files WITHOUT any FK to pages until v23 ran and added the replacement files.page_id. Process death between v21 and v23 left files unconstrained while file_upload / `gbrain files` kept accepting writes. Fix: v21 uses sqlFor to split engines (Postgres gets additive-only, PGLite gets the full UNIQUE swap since it has no concurrent writers). v23's handler now wraps the FK drop + UNIQUE swap + page_id addition + backfill + ledger creation in one engine.transaction(). Atomic. 2. Non-transactional DDL timeout gap. runMigrationSQL's else-branch (for migrations with transaction:false, like CREATE INDEX CONCURRENTLY) ran the DDL on the shared pool with no timeout override. Supabase's 2-min server statement_timeout would abort a CONCURRENTLY index on any large table. Fix: use engine.withReservedConnection + SET statement_timeout='600000' inside the isolated connection. Also: extracted getIdleBlockers(engine) helper — single source of truth for the pg_stat_activity query. Shared by the DDL pre-flight warning and the new `gbrain doctor --locks` CLI (next commit). 57014 diagnostic rewritten to the 4-part "what / why / fix / verify" pattern. No longer references a non-existent CLI flag. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> * feat(doctor): gbrain doctor --locks CLI flag The v0.18.0 57014 diagnostic referenced `gbrain doctor --locks` but the flag didn't exist. Users hitting statement_timeout would run the suggested command and get "unknown option". Implemented now. On Postgres: queries pg_stat_activity via the new getIdleBlockers() helper, prints each blocker's PID, state, query_start, truncated query, and the exact `SELECT pg_terminate_backend(<pid>);` command. Exits 1 on blockers, 0 on clean. On PGLite: prints "not applicable" (no pool, no idle-in-tx concept) and exits 0. The flag is a safe no-op there. --json emits structured output: {status, blockers: [...]}. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> * test: migration hardening regression guards (unit + E2E) test/migrate.test.ts — 10 new regression guards: - LATEST_VERSION equals max(versions) under any array order. Guards against regression to array[-1] (the field report's "told I'm at v16 while 7 migrations behind" bug). - getIdleBlockers shape: pglite returns [], postgres returns rows, query failure returns [] (not throw). - 57014 catch path: mocked engine throws err.code='57014', assert the 4-part diagnostic hits stderr with what/why/fix/verify markers. - apply-migrations pre-flight warning structural check. - setSessionDefaults DRY check: helper defined once in db.ts, postgres-engine calls it, neither path inlines the SET. - runMigrationSQL reserved-connection usage structural check. - Migration 21 test updates for engine-split sqlFor (codex restructure). - Migration 23 atomic-transaction assertion. test/e2e/migrate-chain.test.ts (new): 11 E2E tests against real Postgres: - Post-chain schema invariants (composite UNIQUE exists, old pages_slug_key gone, files_page_slug_fkey gone, files.page_id column present, file_migration_ledger table populated). - doctor --locks real-PG integration (second connection + BEGIN + idle, assert the PID appears in pg_stat_activity). - runMigrationsUpTo advances config.version to target, not past. - withReservedConnection round-trip (executes queries, session GUC visible inside callback). test/e2e/helpers.ts: new runMigrationsUpTo(engine, targetVersion) and setConfigVersion(version) helpers. The v15→v23 chain E2E needed a way to stop at intermediate schema versions; neither `gbrain init --migrate-only` nor the existing setupDB() supported this. Codex caught that the proposed E2E wasn't implementable without new harness work. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> * chore: bump version and changelog (v0.18.2) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> * docs(changelog): rewrite v0.18.2 entry to match gstack CLAUDE.md format Applied the gstack CHANGELOG style rules from ~/git/gstack/CLAUDE.md: - Two-line bold headline lands a verdict, not a feature list. - Single coherent lead story instead of "Second headline... Third headline..." - "The numbers that matter" table with BEFORE / AFTER / Δ columns, counted against the v0.18.0 field report (the concrete source). - "What this means for your workflow" closing paragraph with the 4-command recovery path. - TODOS.md references removed from user-facing body (explicit rule: never mention TODOS, internal tracking, or contributor-facing details in the user-read portion). - Contributor-only detail (helper extraction, test file paths, interface specifics) moved to a "For contributors" subsection. - Itemized changes reorganized as Added / Changed / Fixed / For contributors. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> * docs(changelog): v0.18.2 voice-rule audit — headline, em dashes Audit against ~/git/gstack/CLAUDE.md voice rules: - Headline tightened from 32 words to 19 (rule says 10-14; repo convention on v0.18.1 was 22, this is closer). - Em dashes removed from 7 lines. Replaced with commas, colons, or periods per the "no em dashes" rule. - AI vocabulary audit: clean. - Banned phrases audit: clean. Content unchanged. Only voice/punctuation. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> --------- Co-authored-by: root <root@localhost> Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>
236 KiB
Changelog
All notable changes to GBrain will be documented in this file.
[0.18.2] - 2026-04-23
Migrations survive a crash and Supabase's 2-min ceiling.
gbrain doctor --locks finds the connection blocking your upgrade.
The v0.18.0 production upgrade shipped a field report of 8 issues: statement timeouts, stale idle connections, a schema version that lied, a cryptic FK dependency error. The original PR #356 fix covered all 8. A codex plan-review pass found 3 more that neither the initial review nor the eng review caught. This release lands the lot.
The quiet win: if your brain crashes mid-migration on Postgres, it rolls back cleanly now. Before v0.18.2, a process death between migrations 21 and 23 left your files table with no FK to pages while uploads kept going. The window is closed. DDL either commits entirely or not at all.
The visible win: gbrain doctor --locks works. Before v0.18.2 the 57014 timeout error told you to run this command, but the flag didn't exist. Now it does. It shows you every idle-in-transaction backend older than 5 minutes and gives you the exact pg_terminate_backend(<pid>) to free them up. One command, one paste, done.
The large-brain win: CREATE INDEX CONCURRENTLY no longer gets killed at 2 minutes. The migration runner now reserves a dedicated connection and sets session-level statement_timeout='600000' before running non-transactional DDL. Brains at 500K+ pages can run the next schema change without timing out silently.
The numbers that matter
Counted against the v0.18.0 field report (the production upgrade that prompted this release):
| Metric | BEFORE v0.18.2 | AFTER v0.18.2 | Δ |
|---|---|---|---|
| Field-report issues causing production failure | 8 | 0 | −8 |
| Integrity windows between migrations | 1 (v21 → v23) | 0 | −1 |
CREATE INDEX CONCURRENTLY exposed to 2-min timeout |
yes | no (10-min override) | fixed |
| Agent-runnable lock diagnostic | missing | gbrain doctor --locks |
added |
| Regression tests on hardening paths | structural SQL asserts only | 10 unit + 11 real-PG E2E | +21 |
| 57014 error: references a flag that exists | no | yes | fixed |
The striking number: 3 of the 11 findings in this release came from a second AI model (codex) reviewing the plan after the first model (Claude) had already cleared CEO + Eng review. Two-model review catches what one-model review misses. The migration-21 integrity window in particular would have shipped as a new bug if the plan hadn't been challenged.
What this means for your workflow
Most users: run gbrain upgrade. Nothing else to do. Existing brains at schema v21 or v22 are safe, the old FK stayed intact through the original PR #356 path, and the new atomic commit means a future crash can't leave you stranded.
If a migration hits statement_timeout, the error message now tells you exactly what to do: gbrain doctor --locks to find the blocker, terminate, re-run gbrain apply-migrations --yes, verify with gbrain doctor. Four commands, top-to-bottom.
Running a 500K-page brain on Supabase? The next migration that touches a hot table won't hang silently on you.
Itemized changes
Added
gbrain doctor --locks: lists idle-in-transaction backends older than 5 minutes with PID +pg_terminate_backendcommands. Exits 1 when blockers found.--jsonemits structured output. Postgres-only; PGLite prints "not applicable".BrainEngine.withReservedConnection(fn): runs callback on a dedicated pool connection. Postgres viasql.reserve(), PGLite as a pass-through.
Changed
- Migration 21 split into engine-specific paths. Postgres is additive-only (adds
pages.source_id+ index). PGLite gets the full UNIQUE-key swap inline. The FK drop + UNIQUE swap that used to live in v21 moved into v23's handler. - Migration 23 handler now wraps its entire DDL sequence (FK drop, UNIQUE swap,
files.source_id+files.page_idaddition,page_idbackfill,file_migration_ledgercreation) in a singleengine.transaction(). Atomic commit; process-death rolls back to v22 state. - Non-transactional migrations (
CREATE INDEX CONCURRENTLY) now run on a reserved connection with session-levelSET statement_timeout='600000'. Safe on PgBouncer transaction pooling because the connection is isolated from the shared pool. - 57014 (
statement_timeout) diagnostic rewritten to the 4-part pattern: what happened, why, exact commands to fix, how to verify.
Fixed
- Migration 21 integrity window. Previously v21 dropped
files_page_slug_fkeyand persistedconfig.version=21, but the replacementfiles.page_idcolumn wasn't added until v23. Process-death between them leftfilesunconstrained whilefile_upload/gbrain fileskept accepting writes. The FK drop now lives inside v23's atomic transaction. gbrain doctor --locksflag referenced by the v0.18.0 57014 error message but not implemented. The flag exists now.
For contributors
setSessionDefaults(sql)helper insrc/core/db.tsabsorbs the duplicatedidle_in_transaction_session_timeoutblock frompostgres-engine.ts. Both connect paths call the helper; the SET appears exactly once in source.getIdleBlockers(engine)exported fromsrc/core/migrate.ts: single source of truth for thepg_stat_activityquery. Shared by the pre-flight warning andgbrain doctor --locks.ReservedConnectioninterface exposesexecuteRaw(sql, params?)only. Minimal surface, easy to mock. Not safe to call from insidetransaction(); the interface doc says so.test/e2e/helpers.tsaddsrunMigrationsUpTo(engine, targetVersion)+setConfigVersion(version): enables mid-chain migration tests that neithergbrain init --migrate-onlynor the existingsetupDB()supported.test/migrate.test.ts: 10 new regression guards (Math.maxrobustness under array scrambling,getIdleBlockersshape across engines, 57014 catch path structural check, pre-flight warning,setSessionDefaultsDRY, reserved-connection usage inrunMigrationSQL).test/e2e/migrate-chain.test.ts(new): 11 E2E tests against real Postgres covering post-chain schema invariants,doctor --locksreal-connection detection,runMigrationsUpToadvancement semantics,withReservedConnectionround-trip.
Credit: codex plan-review caught the migration-21 integrity window, the non-transactional DDL timeout gap, and the missing doctor --locks CLI. The initial Claude review and the Claude-model eng review both missed them.
To take advantage of v0.18.2
gbrain upgrade should do this automatically. If it didn't, or if gbrain doctor
warns about a partial migration:
- Run the orchestrator manually:
gbrain apply-migrations --yes - Verify the outcome:
gbrain doctor # schema_version should match latest gbrain doctor --locks # should exit 0 (no idle-in-tx blockers) - If
statement_timeoutfires during migration, the new 4-part diagnostic tells you exactly what to do: rungbrain doctor --locks, terminate blockers, re-rungbrain apply-migrations --yes. - If anything fails, file an issue: https://github.com/garrytan/gbrain/issues
with output of
gbrain doctorand~/.gbrain/upgrade-errors.jsonl(if it exists).
[0.18.1] - 2026-04-22
Row Level Security hardening pass.
Fresh installs secure by default. Existing brains are brought up to the same bar automatically on upgrade.
A security-posture tightening release. gbrain doctor now enforces RLS across the entire public schema (not a hardcoded allowlist), the base schema ships every gbrain-managed table with RLS enabled, and an automatic migration runs on gbrain upgrade to bring older installs to the same state. After gbrain upgrade (or gbrain apply-migrations --yes), gbrain doctor should report clean on healthy brains.
The doctor check severity upgrades from warn to fail. Missing RLS is a security issue, not a suggestion. gbrain doctor exits 1 when any public table is missing RLS. If you wrap gbrain doctor in a cron or CI health check, expect it to flip red on setups that haven't upgraded.
There is an escape hatch for tables you deliberately want readable by the anon key (analytics views, public materialized views, plugin tables that use anon reads on purpose). It is a Postgres COMMENT ON TABLE with a GBRAIN:RLS_EXEMPT reason=<why> prefix. No CLI subcommand. You drop to psql and type the reason. Full details in docs/guides/rls-and-you.md. The escape hatch is deliberately painful because the default should be closed.
What changes
| Area | BEFORE v0.18.1 | AFTER v0.18.1 |
|---|---|---|
| Scope of doctor RLS check | hardcoded allowlist | every pg_tables row in public |
| Severity when RLS missing | warn (exit 0) | fail (exit 1) |
| Escape hatch for intentional anon-readable tables | none | GBRAIN:RLS_EXEMPT reason=... pg comment |
| Identifier-safe remediation SQL | no | yes (ALTER TABLE "public"."<name>") |
| PGLite doctor output for RLS | misleading warn | clean ok with skip reason |
| Exemption list surfaced on every doctor run | n/a | enumerated by name |
What this means for your workflow
Existing Supabase brains: run gbrain upgrade, then gbrain doctor. Everything managed by gbrain should report clean. If doctor flags something, it's a plugin, user-created, or extension table — the message names each one and gives you the exact ALTER TABLE line.
PGLite brains (the gbrain init default): nothing to do. RLS is irrelevant on embedded Postgres. Doctor skips the check with an explicit message.
Cron and CI wrappers: audit them. The exit-code flip is the one breaking change in this release. If a table is anon-readable on purpose, use the GBRAIN:RLS_EXEMPT comment escape hatch rather than silencing the whole check.
Credit: Garry's OpenClaw for the original check-widening PR (#336). Codex found additional gaps during plan review.
To take advantage of v0.18.1
gbrain upgrade should do this automatically. It runs gbrain post-upgrade,
which calls gbrain apply-migrations --yes, which runs the v0.18.1 orchestrator.
If gbrain doctor still reports missing RLS after upgrade:
-
Apply migrations manually:
gbrain apply-migrations --yes -
Re-run the health check:
gbrain doctor -
If specific tables still fail, the doctor message names each one and gives you the fix. Example:
1 table(s) WITHOUT Row Level Security: my_plugin_state. Fix: ALTER TABLE "public"."my_plugin_state" ENABLE ROW LEVEL SECURITY; -
If a table should stay readable by the anon key on purpose, use the escape hatch (see
docs/guides/rls-and-you.md):COMMENT ON TABLE public.my_analytics_view IS 'GBRAIN:RLS_EXEMPT reason=analytics-only, anon-readable ok, owner=you, date=2026-04-22'; -
If any step fails or the numbers look wrong, please file an issue: https://github.com/garrytan/gbrain/issues with:
- output of
gbrain doctor --json - contents of
~/.gbrain/upgrade-errors.jsonlif it exists - which step broke
This feedback loop is how the gbrain maintainers find fragile upgrade paths. Thank you.
- output of
Itemized changes
- Schema + migration:
src/schema.sqlandsrc/core/schema-embedded.tsensure every gbrain-managed public table ships with RLS enabled for fresh installs. A new schema migration insrc/core/migrate.tsbackfills existing brains to the same state. The migration is gated onrolbypassrlsand fails loudly if the current role lacks BYPASSRLS (soschema_versionstays at the prior value and retries cleanly after role assignment). - Upgrade orchestrator: New
src/commands/migrations/v0_18_1.tswires the schema migration into thegbrain apply-migrations --yespath (mirrors v0.18.0's Phase A pattern). - Doctor check widened:
src/commands/doctor.tsRLS check now scans every public table frompg_tablesrather than a hardcoded allowlist. Severity upgradedwarn → fail. Success message shows table count. Failure message includes per-table quotedALTER TABLE "public"."<name>" ENABLE ROW LEVEL SECURITY;remediation SQL. - Escape hatch — "write it in blood": Doctor reads
obj_descriptionfor each non-RLS public table. Tables whose comment matches^GBRAIN:RLS_EXEMPT\s+reason=\S.{3,}count as explicitly exempt. Exempt tables are enumerated by name on every successful doctor run so the exemption list never goes invisible. No CLI subcommand — deliberate friction; operators must set the comment in psql. - PGLite skip: PGLite is embedded and single-user with no PostgREST; the RLS check now skips on PGLite with an explicit
okmessage ("Skipped — no PostgREST exposure, RLS not applicable") instead of the misleadingwarnit emitted before. Partial polish: pgvector, jsonb_integrity, and markdown_body_completeness checks still hit the samegetConnection()throw → warn pattern on PGLite. Separate follow-up. - Tests:
test/doctor.test.tsgains source-grep structural regression guards covering scan scope, fail severity + quoted-identifier remediation, PGLite skip wrapper, andGBRAIN:RLS_EXEMPTparsing.test/e2e/mechanical.test.tsE2E: RLS Verificationblock rewritten. The old allowlist-query test is replaced with an every-public-table-has-RLS assertion; new CLI-spawn tests verify fail-on-no-RLS (with exit code + ALTER TABLE in JSON message), exempt-with-valid-reason passes, empty-reason exemption fails, and unrelated comment still fails. All helpers usetry/finallywith unique suffix-per-run table names.test/migrate.test.tsgains a structural guard for the new migration: exists, name matches, BYPASSRLS gating present, LATEST_VERSION has advanced.
- Docs: new
docs/guides/rls-and-you.md— one-page explainer covering why RLS matters, what to do when doctor fails, the escape hatch format + rules, auditing exemptions, PGLite behavior, self-hosted Postgres framing. - Version reconciliation:
VERSIONandpackage.jsonland on0.18.1. - CHANGELOG privacy sweep: replaced a stale
@Wintermutecredit in the 0.17.0 entry with "Garry's OpenClaw" per the CLAUDE.md privacy rule.
Co-Authored-By: Claude Opus 4.7 (1M context) noreply@anthropic.com
[0.18.0] - 2026-04-22
Multi-source brains. One database, many repos. Federated or isolated, you choose.
gbrain sources is the new subcommand. .gbrain-source is the new dotfile.
A single gbrain database can now hold multiple knowledge repos — your wiki, your gstack checkout, your yc-media pipeline, your garrys-list essays — with clean scoping per source. Slugs are unique per source, not globally, so two sources can both have topics/ai and they are different pages. Every page, every file, every ingest_log row is scoped to a sources(id) row.
Per-source federation controls whether a source participates in unqualified default search. federated=true is cross-recall (your wiki + gstack both show up when you search "retry budgets"). federated=false is isolation (your yc-media content never leaks into your personal writing searches). Flip with gbrain sources federate <id> / unfederate <id>.
Per-directory default via .gbrain-source dotfile walk-up + GBRAIN_SOURCE env var. Same mental model as kubectl / terraform / git: cd ~/yc-media && gbrain query "X" just works, no --source flag needed. Resolution priority: explicit flag > env > dotfile > registered-path-longest-prefix > sources.default config > literal default fallback.
The numbers that matter
9 bisectable commits. 4 new schema migrations. ~85 new tests. Full suite: 2063 pass / 17 fail (the 17 pre-existing master timeouts unchanged). Migration chain runs end-to-end against real PGLite in under 1 second for the integration test.
| Metric | BEFORE v0.17 | AFTER v0.18 | Δ |
|---|---|---|---|
| Max repos per brain | 1 | unlimited | unbounded |
| Slug uniqueness | global | per-source | composite |
| Multi-source search | impossible | default (for federated) | native |
| New CLI commands | — | 9 (sources add/list/remove/rename/default/attach/detach/federate/unfederate) |
+9 |
| Schema migrations shipped | 0 new | 4 (v20-v23) | +4 |
| New unit + integration tests | — | ~85 | +85 |
What this means for agents
When a brain has multiple sources, every search result carries source_id. Agents cite in [source-id:slug] form — [wiki:topics/ai] or [gstack:plans/retry-policy] — so the user can trace which repo each fact came from. The citation key is sources.id (immutable), so renaming a source's display name via gbrain sources rename never breaks existing citations.
Back-compat is total. Pre-v0.18 brains upgrade into a seeded default source with federated=true, and their existing code paths target default via a schema DEFAULT clause. You literally do not have to change anything to upgrade; you only change things if you want to add a second source.
To take advantage of v0.18.0
gbrain upgrade should do this automatically. If it didn't, or if gbrain doctor
warns about a partial migration:
- Run the orchestrator manually:
gbrain apply-migrations --yes - Your agent reads
skills/migrations/v0.18.0.mdthe next time you interact with it. The migration chain is fully mechanical (v20 creates the sources table, v21 adds pages.source_id + composite UNIQUE, v22 adds links.resolution_type, v23 adds files.source_id + page_id + file_migration_ledger). No manual data work needed. - Verify the outcome:
gbrain sources list # should show 'default' federated, with your existing page count gbrain stats # existing behavior unchanged gbrain doctor - To start using multi-source:
gbrain sources add gstack --path ~/.gstack --no-federated cd ~/.gstack && gbrain sources attach gstack gbrain sync --source gstack - If any step fails or the numbers look wrong, please file an issue: https://github.com/garrytan/gbrain/issues with:
- output of
gbrain doctor - contents of
~/.gbrain/upgrade-errors.jsonlif it exists - which step broke
- output of
Itemized changes
Added
gbrain sourcessubcommand group — add, list, remove, rename, default, attach, detach, federate, unfederate. Seedocs/guides/multi-source-brains.mdfor three canonical scenarios (unified wiki+gstack / purpose-separated yc-media+garrys-list / mixed).sourcestable — first-class multi-repo primitive.(id, name, local_path, last_commit, last_sync_at, config). Citation key issources.id, immutable, validated[a-z0-9](?:[a-z0-9-]{0,30}[a-z0-9])?.pages.source_idcolumn + composite UNIQUE (source_id, slug) — slugs unique per source. DEFAULT 'default' on the column so existing single-source callers target the default source automatically via schema default..gbrain-sourcedotfile — walk-up resolution like kubectl/terraform/git.gbrain sources attach <id>writes it in CWD. Auto-selects the source for any command run from that directory or any subdirectory.GBRAIN_SOURCEenv var — power-user / CI / script escape hatch. Second highest priority in resolution (after explicit--source <id>).- Qualified wikilink syntax
[[source:slug]]— new in v0.18 extractor. Unqualified[[slug]]still resolves via local-first fallback.links.resolution_type ENUM('qualified','unqualified')records which kind each edge is for futuregbrain extract --refresh-unqualifiedre-resolution. files.source_id+files.page_id— files now scope per source + reference pages by id (not slug).file_migration_ledgerdrives the S3/Supabase object rewrite under the pending → copy_done → db_updated → complete state machine.gbrain sync --source <id>— per-source sync reads local_path + last_commit from the sources table, writes last_sync_at back. Single-source brains keep using the pre-v0.17sync.repo_path/sync.last_commitconfig keys unchanged.
Changed
- Search dedup is now source-aware. Pre-v0.18 keyed on slug alone; under composite uniqueness that would collapse two same-slug pages in different sources.
pageKey(r) = source_id:slugis the one canonical helper across all four dedup layers + compiled-truth guarantee. Codex review flagged this as regression-critical. SearchResult.source_idoptional field — populated by engine SELECT JOINs. Falls back to'default'for pre-v0.18 rows that lacked the column.- Migration runner sorts by version — if anyone adds a migration out of order in
MIGRATIONS[], the sort guards against silent skips.
Migrations
- v20
sources_table_additive— additive-only. Creates sources table + seeds default row with{"federated": true}. Inherits existingsync.repo_path/sync.last_commit. - v21
pages_source_id_composite_unique— addspages.source_idwith DEFAULT, swaps globalUNIQUE(slug)for compositeUNIQUE(source_id, slug). Lands atomically with the engine'sON CONFLICT (source_id, slug)rewrite. - v22
links_resolution_type— addslinks.resolution_typeCHECK column. - v23
files_source_id_page_id_ledger— Postgres-only (PGLite has no files table). Addsfiles.source_id+files.page_id, backfillspage_idfrom legacypage_slug, createsfile_migration_ledger.
Tests
test/sources.test.ts(14 tests) — CLI dispatcher, validation, overlapping-path guard.test/source-resolver.test.ts(14 tests) — full 6-priority resolution coverage including longest-prefix match.test/storage-backfill.test.ts(13 tests) — state machine + 3 crash-point recovery tests (Codex flagged each).test/multi-source-integration.test.ts(16 tests) — end-to-end against real PGLite, migration chain v2→v23.test/link-extraction.test.ts(+6) — qualified[[source:slug]]parsing + masking + v22 structural.test/dedup.test.ts(+4) — regression-critical source-aware composite key tests.test/migrate.test.ts(+18) — v20/v21/v22/v23 structural assertions.
Docs
docs/guides/multi-source-brains.md— new getting-started guide (federated / isolated / mixed scenarios).skills/migrations/v0.18.0.md— agent-facing migration skill.skills/brain-ops/SKILL.md— new "Cross-source citation format" section.
Co-Authored-By: Claude Opus 4.7 (1M context) noreply@anthropic.com
[0.17.0] - 2026-04-22
gbrain dream. Run the brain maintenance cycle while you sleep.
One primitive, two CLIs. Autopilot gains lint + orphan sweep automatically.
The README has promised "the dream cycle" for a year. v0.17 makes it real as a first-class command. gbrain dream runs one maintenance cycle and exits, designed for cron. Same six phases as gbrain autopilot — they both delegate to the new runCycle primitive in src/core/cycle.ts. One source of truth for what your brain does overnight.
Phase order is semantically driven: fix files first, then index them. Lint and backlinks write to disk. Sync picks them up into the DB. Extract links the graph. Embed refreshes vectors. Orphan sweep reports the gaps. If your autopilot daemon was doing sync-before-lint (which PR #309's original dream.ts also got wrong), your fixes landed the next cycle instead of the current one. Fixed.
Autopilot users upgrading get lint + orphan sweep for free. No config change. gbrain jobs list shows the full 6-phase report now. If you don't want the daemon modifying files, gbrain dream --phase orphans in cron keeps autopilot for embed+sync and gives you manual control over the writes.
The numbers that matter
Measured against a v0.16 baseline. Lines-of-code delta is net-small: runCycle adds ~500 lines, but the new dream.ts is 80 lines (vs the 446-line original in PR #309), and autopilot's two-path branching collapses to one delegated call.
| Metric | BEFORE v0.17 | AFTER v0.17 | Δ |
|---|---|---|---|
gbrain dream --dry-run mutates DB |
Yes (full-sync + embed silently wrote) | No (every phase honors dry-run) | correctness |
| Sources of truth for "the cycle" | 3-4 (dream inline, dream shell-outs, autopilot inline, Minions handler) | 1 (runCycle) |
DRY win |
| Phase order: fix-then-index | No (sync before lint) | Yes (lint → backlinks → sync → extract → embed → orphans) | semantics |
| Coordination across daemon + cron + Minions worker | Lockfile heuristic with 6 known holes | DB lock table + PID-liveness file lock | primitive upgrade |
| Works under PgBouncer transaction pooling | No (session-scoped pg_try_advisory_lock) |
Yes (TTL row, refreshed between phases) | Supabase-safe |
findRepoRoot walks into wrong git repo |
Yes (10 levels of cwd) | No (explicit --dir OR configured sync.repo_path) | footgun fixed |
| Autopilot daemon phase count | 4 (sync+extract+embed+backlinks in Minions mode; no backlinks inline) | 6 (+lint +orphans) | feature parity |
| CycleReport shape stability for agents | N/A | schema_version: "1" (stable, additive only) |
API contract |
What this means for your workflow
Cron users: one line. 0 2 * * * gbrain dream --json >> /var/log/gbrain-dream.log. You get a structured CycleReport every morning with per-phase timing, counts, and any errors tagged with {class, code, message, hint, docs_url}.
Autopilot users: nothing to do. Your daemon picks up the new phases on next cycle. If you want to see them: gbrain jobs get <autopilot-cycle-id> shows the full report.
Reviewers/codex caught three plan-breakers during multi-round review that would have shipped silent DB writes on dry-run: (1) performSync's full-sync path was ignoring opts.dryRun, (2) runEmbedCore had no dry-run mode and returned void, (3) findOrphans used db.getConnection() global and didn't compose with a passed engine. All three are fixed as preconditions (commits 1-3 of the 6-commit bisectable series).
Credit: Garry's OpenClaw for the original gbrain dream thesis (PR #309). The brand-promise framing survived; the implementation got redesigned from scratch around the runCycle primitive after CEO + Eng + Codex + DX review found structural issues.
To take advantage of v0.17.0
gbrain upgrade should do this automatically. If it didn't, or if gbrain doctor warns about a partial migration:
-
Run the migration orchestrator manually:
gbrain apply-migrations --yes -
Your agent reads
skills/migrations/v0.17.0.mdthe next time you interact with it. No mechanical host-repo action required; the schema migration (v16 cycle-lock table) and the behavior shift in autopilot's inline path both apply automatically. -
Verify the outcome:
gbrain dream --help # new command exists gbrain dream --dry-run --json # safe preview gbrain doctor # should show no pending migrationsAutopilot users:
gbrain jobs list --status complete | head -5and inspect anautopilot-cyclejob withgbrain jobs get <id>— the report now includes 6 phases. -
If any step fails or the numbers look wrong, please file an issue: https://github.com/garrytan/gbrain/issues with:
- output of
gbrain doctor - contents of
~/.gbrain/upgrade-errors.jsonlif it exists - which step broke
This feedback loop is how the gbrain maintainers find fragile upgrade paths. Thank you.
- output of
Itemized changes
New CLI command: gbrain dream
- One-shot maintenance cycle for cron. Exits when done. Flags:
--dry-run,--json,--phase <name>,--pull,--dir <path>,--help. --helpshows cron example + cross-reference toautopilot --installfor continuous daemon.- Empty-state output is intentionally satisfying:
Brain is healthy. 6 phase(s) checked in 2.3s.Agents detect it viastatus: "clean". - Exit code 1 on
status: "failed". Warnings (status: "partial") are not failures — don't page someone. --dirORsync.repo_pathconfig required. No more walk-up-cwd-for-.git footgun.
New primitive: src/core/cycle.ts
runCycle(engine: BrainEngine | null, opts: CycleOpts): Promise<CycleReport>.- Six phases in order: lint → backlinks → sync → extract → embed → orphans.
CycleReporthasschema_version: "1"(stable, additive).status: 'ok' | 'clean' | 'partial' | 'skipped' | 'failed'withreasonfield on skipped.PhaseResult.error: { class, code, message, hint?, docs_url? }on fail. Stripe-API-tier structured errors.yieldBetweenPhaseshook awaited between every phase + before return. Required for Minions worker lock renewal. Exceptions non-fatal.- Engine nullable — filesystem phases run without DB; DB phases skip with
reason: "no_database". - Lock-skip: read-only phase selections (
--phase orphans) skip lock acquisition.
New schema: gbrain_cycle_locks (migration v16)
- DB lock table with TTL (30 min), replaces session-scoped
pg_try_advisory_lockwhich the v0.15.4 PgBouncer-transaction-pooler fix silently broke. - Refreshed between phases via the yield hook. Crashed holders auto-release on TTL expiry.
- PGLite + engine=null use a file-based fallback at
~/.gbrain/cycle.lockwith PID-liveness check (EPERM treated as alive so PID 1 holders aren't mis-classified).
Autopilot + Minions integration
- Autopilot's inline fallback path (
--inlineflag + PGLite mode) now delegates torunCycle. Gains lint + orphan phases it didn't run before. Usespull: trueby default (preserves pre-v0.17 pull semantics). - Minions
autopilot-cyclehandler (insrc/commands/jobs.ts) also delegates torunCycle. Returns{ partial, status, report }sogbrain jobs get <id>surfaces the full structured report. gbrain autopilot --installinstall/uninstall/launchd/systemd/crontab machinery untouched.gbrain autopilot --helpnow cross-referencesgbrain dream.
Precondition fixes (required for the runCycle primitive to compose cleanly)
src/commands/sync.ts:performFullSynchonorsopts.dryRunin first-sync +--fullpaths. Was silently callingrunImportregardless.SyncResult.embedded: numberfield added;first_syncpath now returns real counts fromrunImport(was hardcoded to 0).src/commands/embed.ts:runEmbedCoreaddsdryRun?: booleanopt and returnsEmbedResult { embedded, skipped, would_embed, total_chunks, pages_processed, dryRun }instead ofvoid.gbrain embed --stale --dry-runis now a safe preview.src/commands/orphans.ts:findOrphans(engine, opts)takes aBrainEngineparameter. AddedfindOrphanPages()method toBrainEngineinterface + implementations on bothpostgres-engineandpglite-engine. Dropsdb.getConnection()global — findOrphans now composes with test-injected engines and works on PGLite.
Tests (all run in CI, no DATABASE_URL or API keys required)
test/sync.test.ts: 4 new cases. First-sync dry-run, incremental dry-run,--fulldry-run, SyncResult.embedded shape. PGLite + temp git repo.test/embed.test.ts: 4 new cases. Dry-run with stale chunks, dry-run stale-vs-fresh split, dry-run --slugs, non-dry-run regression guard. MockedembedBatch.test/orphans.test.ts: 4 new cases. Engine-injected findOrphans, includePseudo flag, queryOrphanPages delegation, empty-brain edge. PGLite.test/core/cycle.test.ts(new): 18 cases covering dryRun × phases × lock_held × engine-null. Shared PGLite engine per describe via beforeAll + truncateCycleLocks (cuts test time ~3x vs per-test init).test/dream.test.ts(rewritten, 11 cases): brainDir resolution, phase selection, phase validation, JSON output shape, dry-run propagation, exit-code semantics. Real PGLite + real library calls (nomock.moduleto avoid leakage).
Docs
skills/migrations/v0.17.0.md: new. Informational, no mechanical action required.CHANGELOG.md+CLAUDE.md: updated.
PR #309 disposition
- Closed with credit to @Wintermute. Their thesis ("
gbrain dreamas first-class CLI verb") was right; the implementation got redesigned around the runCycle primitive after deep review surfaced structural issues in the fold approach. Co-Authored-By: Wintermutepreserved on commit 5 (the dream.ts rewrite).
[0.16.4] - 2026-04-22
gbrain check-resolvable ships. The command the README promised for weeks.
Agents and CI finally have a one-shot skill-tree gate that actually exits non-zero when anything is off.
The resolver_health logic has lived inside gbrain doctor since v0.11. The README claimed a standalone gbrain check-resolvable shipped too ... it didn't. Scripts referenced it. Skillify's 10-item checklist referenced it. The binary just shrugged. Fixed.
gbrain check-resolvable runs the same four checks doctor runs (reachability, MECE overlap, MECE gap, DRY violations) but with a stricter contract: exits 1 on any issue, errors AND warnings. Doctor's resolver_health block still exits 0 on warnings-only because doctor has 15 other checks to lean on. The standalone command has nowhere to hide. CI can finally gate on a single command instead of parsing gbrain doctor --json.
The JSON output is a stable envelope, one shape for success and error: {ok, skillsDir, report, autoFix, deferred, error, message}. No more "did it succeed? let me see which keys are present." The deferred array names the two checks still pending (trigger routing eval, brain filing) with links to their tracking issues, so agents reading the JSON know the current coverage boundary.
scripts/skillify-check.ts is now machine-gated. Item #8 on the skillify 10-item checklist used to print "run: gbrain check-resolvable" and pass unconditionally. Now it subprocess-calls the real command and asserts on the exit code. Binary-missing fails loud instead of silently passing ... the kind of silent false-pass that used to put broken skills on the shelf.
To take advantage of v0.16.4
No migration needed. gbrain upgrade brings the binary; nothing to apply. Try it:
gbrain check-resolvable # human output, like doctor's resolver section
gbrain check-resolvable --json | jq .ok # machine-readable gate for CI
gbrain check-resolvable --fix --dry-run # preview DRY auto-fixes without writing
Wire it into your CI:
gbrain check-resolvable || exit 1 # fails the build on any warning/error
Itemized changes
New command
gbrain check-resolvable [--json] [--fix] [--dry-run] [--verbose] [--skills-dir PATH] [--help]— standalone skill-tree gate. Covers reachability, MECE overlap, MECE gap, DRY violations. Exits 1 on any issue.- Stable JSON envelope (
ok,skillsDir,report,autoFix,deferred,error,message) — one shape for both success and error paths. --fixauto-applies DRY fixes viaautoFixDryViolationsbefore re-checking (same ordering asdoctor --fix).--dry-runwith--fixpreviews without writing; the JSONautoFix.fixedarray shows what would change.--verboseprints the Deferred checks note with issue URLs so nobody forgets Checks 5 and 6 are still tracked.
Deferred to separate issues
- Check 5: trigger routing eval — verify every skill's own frontmatter trigger routes to itself in RESOLVER.md. Surfaced via the CLI's
deferred[]output block. - Check 6: brain filing validation — verify mutating skills register the brain directories they write to. Same surface.
Shared refactor
src/core/repo-root.ts— extractedfindRepoRoot()fromdoctor.tsto a zero-dependency shared module with a parameterizedstartDirfor test hermeticity. Doctor imports the shared version; no behavior change (default arg matches prior semantics).src/commands/doctor.ts— updated to import the sharedfindRepoRoot.
Skillify integration
scripts/skillify-check.ts— item #8 ("check-resolvable gate") now subprocess-callsgbrain check-resolvable --jsonand gates on the exit code. Result is cached per process so iterating many skills only runs the subprocess once. Binary-missing fails loud via explicitspawnerror handling ... no silent false-pass.
Tests (22 new cases)
test/repo-root.test.ts— 4 cases for the extractedfindRepoRoot()(first-iter hit, walks up, returns null, default arg behavioral parity).test/check-resolvable-cli.test.ts— 17 cases split between direct unit tests (flag parsing, resolveSkillsDir, DEFERRED constants) and subprocess integration tests (help, JSON envelope shape, exit-code regression gates for warnings AND errors,--fix --dry-runwiring,--verboseoutput).test/skillify-check.test.ts— 2 new cases for the check-resolvable wiring: loud failure when binary is missing (no silent pass), happy path when a synthetic gbrain returnsok: true.
Contract note for CI users
gbrain check-resolvableexits 1 on warnings AND errors.gbrain doctor's resolver_health block still exits 0 on warnings-only. If you scripted against doctor's looser gate,check-resolvablewill bite harder ... on purpose. This honors the README:259 contract: "Exits non-zero if anything is off."
[0.16.3] - 2026-04-22
gbrain agent run actually runs now. The subagent SDK wiring that shipped broken in v0.16.0 is fixed.
Every .ts file in the repo typechecks on every bun run test. Silent regressions end here.
v0.16.0 shipped with the headline feature, gbrain agent run, unable to make a single LLM call. makeSubagentHandler cast new Anthropic() straight to MessagesClient, but the SDK exposes .create() at sdk.messages.create, not on the top-level client. Every subagent job in production died on the first call with client.create is not a function. The type system would have caught it. Nothing was running the type system.
The root cause isn't the casting bug. It's that bun test transpiles TypeScript without type-checking it, and bun test was the entire CI pipeline. Invalid types ran until they hit runtime. This release fixes the symptom (one-line change, deps.client ?? new Anthropic().messages, which typechecks cleanly against MessagesClient because sdk.messages IS the right object) and closes the hole that let it ship (tsc --noEmit now runs on every bun run test, and the CI workflow runs bun run test not bun test). Two independent guards: anyone reverting to new Anthropic() fails the type check; a new regression test drives one handler turn through an injected fake SDK and fails loudly if the factory default branch breaks.
Closing the CI gap surfaced 100+ pre-existing type errors across 30+ files: databaseUrl → database_url rename drift, missing "meeting" / "note" entries in the PageType union that both src and tests already used, a Buffer-as-BodyInit assignment in the Supabase uploader, dead-code comparisons against narrowed status types in the migration orchestrators, and several as X casts that TS 5.6 requires be spelled as unknown as X. All cleaned up. The first tsc run is green.
The numbers that matter
From the merged branch after both the fix and the infra cleanup landed locally against master.
| Metric | Before | After | Δ |
|---|---|---|---|
bun run typecheck errors |
104 | 0 | -104 |
gbrain agent run in prod |
100% failure on first LLM call | Works | ✅ |
| Test file count | ~75 | ~75 (+1 regression test block) | +1 |
bun run test pass rate |
1962 pass / 4 fail (PGLite flake under parallel load) | 1997 pass / 0 fail | +35 pass, -4 fail |
| CI test-gate steps | bun test (no type check) |
bun run test (jsonb guard + progress-to-stdout guard + tsc --noEmit + bun test) |
1→4 |
| Regression guards on this bug class | 0 | 2 (compile-time via tsc, runtime via makeAnthropic injection test) |
+2 |
The 104 → 0 isn't a refactor. Every error was a real correctness signal TS had been trying to send that nobody was listening for. Most were trivial to fix (as unknown as X, one missing union member, one rename propagation). The Buffer/BodyInit one in Supabase upload is a live bug — fetch(url, {body: buf}) works today in Node/Bun but has no type guarantee; the fix copies data.buffer, data.byteOffset, data.byteLength into a Uint8Array slice that is genuinely assignable to BodyInit.
What this means for operators
gbrain agent run "say hello" against a Supabase brain completes end-to-end after this upgrade. No stuck subagent jobs, no client.create is not a function traceback. v0.16.0 users should upgrade immediately — the feature that release was named for did not work.
Itemized changes
gbrain agent run now works against the real Anthropic SDK
src/core/minions/handlers/subagent.ts— factory default construction replaced withconst client: MessagesClient = deps.client ?? makeAnthropic().messages. The SDK'sMessagesresource is already the right object; no helper, no wrapper, no.bind()needed (method-call semantics preservethis).const makeAnthropic = deps.makeAnthropic ?? (() => new Anthropic())adds a dependency-injection seam so tests can exercise the default branch without a real API key or network call.test/subagent-handler.test.ts— newdescribe('makeSubagentHandler default client construction')block drives a full handler turn through a fake SDK injected viamakeAnthropic. If anyone reverts.messagesor reintroduces anew Anthropic()top-level cast, this test fails loudly.
CI type-checking is now real
package.json— addedtypescript@^5.6.0as devDep; added"typecheck": "tsc --noEmit"script; chainedbun run typecheckinto"test"so localbun run testand CI run identical pipelines (grep guards + typecheck + bun test)..github/workflows/test.yml— CI now runsbun run test(the npm script) instead ofbun test(the runner). One line. Biggest-leverage change in the release.
100+ pre-existing type errors cleaned up
So tsc --noEmit actually stays green. All mechanical, zero behavior change. Groups:
databaseUrl→database_urlrename drift in 9 test fixtures (test/agent-cli, test/brain-allowlist, test/minions-shell, test/minions, test/queue-child-done, test/rate-leases, test/subagent-handler, test/subagent-transcript, test/wait-for-completion).PageTypeunion insrc/core/types.tsgained'meeting'and'note'entries. Both were already used in src (link-extraction.tshad a code comment acknowledging the gap) and across 6 test files. The union was just out of date.GBrainConfig.storagefield declared insrc/core/config.ts— the code atsrc/commands/files.tsandsrc/core/operations.tswas readingconfig.storagewith 18 inferred-type errors.ErrorCodeunion insrc/core/operations.tsgained'permission_denied'; the code was throwing this exact string but the union disagreed.- Dead-code comparisons removed from
src/commands/migrations/v0_12_0.ts,v0_12_2.ts,v0_13_0.ts,v0_16_0.ts— each orchestrator had an early-return ona.status === 'failed'followed later by a redundant check against a then-narrowed type. TS correctly flagged the later check as always-false. - postgres.js
Rowcallback typing onsrc/core/postgres-engine.ts— 6.map((r: { slug: string }) => r.slug)callbacks rewritten as.map((r) => r.slug as string)to match postgres.js'sRowgeneric. Same behavior, correct signature. - Buffer → BodyInit in
src/core/storage/supabase.ts:58,129—body: data(Buffer) replaced withbody: new Uint8Array(data.buffer, data.byteOffset, data.byteLength) as BodyInit. Zero-copy view of the same bytes, structurally assignable toBodyInit, no runtime change. - Various
as Xcasts upgraded toas unknown as Xwhere TS 5.6's stricter structural-conversion rules rejected the single-step cast. Affected:src/core/file-resolver.ts(3),src/core/minions/handlers/subagent-aggregator.ts,src/core/minions/worker.ts,src/commands/orphans.ts,src/commands/repair-jsonb.ts,src/core/postgres-engine.ts(2 RowList → array conversions).
Test suite stability
bunfig.toml— new file. Sets[test].timeout = 60_000globally. PGLite WASM init is slow enough that the default 5-second hook timeout flakes when many test files spin up PGLite instances in parallel on a loaded machine.- 8 test files (
test/wait-for-completion,test/extract-fs,test/subagent-handler,test/minions-shell,test/minions-quiet-hours,test/integrity,test/e2e/graph-quality,test/e2e/search-quality) additionally declarebeforeAll(fn, 60_000)/beforeEach(fn, 15_000)as explicit safety nets — redundant withbunfig.tomltoday, but stays as belt-and-suspenders if the bunfig schema ever changes.
To take advantage of v0.16.3
gbrain upgrade should do this automatically. If it didn't, or if gbrain doctor warns about anything:
- Verify your brain still runs:
gbrain doctor - Verify the agent runtime works:
Should complete end-to-end. If it fails with
gbrain agent run "say hello"client.create is not a function, the upgrade didn't land — rungbrain upgradeagain. - No migrations required. No schema changes in this release. Fix is in the handler code, not the DB.
- If any step fails, please file an issue: https://github.com/garrytan/gbrain/issues with:
- output of
gbrain doctor - output of
gbrain agent run "say hello" - contents of
~/.gbrain/upgrade-errors.jsonlif it exists
- output of
Itemized changes
[0.16.2] - 2026-04-22
The deployment guide now reads like a runbook an agent can execute line-by-line.
Three real bugs from v0.16.1 fixed, nine DX gaps closed.
v0.16.1 shipped the Minions worker deployment guide. Re-reading it as the agent it was written for, top-to-bottom, copy-pasting every block, surfaced twelve issues a human skim-reader would not catch. Three are real bugs that break a first-time deploy. Nine are structural gaps that force the agent to invent values.
The bugs: the crontab example used */5 * * * * user bash /path/... which is /etc/crontab format only, so an agent running crontab -e and pasting it got "bad minute" or parsed user as the command. The watchdog script grepped tail -20 of an unrotated log for shutdown markers, so every 5-minute tick after the first restart re-matched the old shutdown line forever and killed the healthy worker on loop. And DATABASE_URL=postgresql://user:pass@... lived directly in /etc/crontab, which is mode 644 (world-readable).
The gaps: no preconditions block, no "which option should I pick" selector, hardcoded /path/to/... and /my/workspace throughout with no template-variable legend, no upgrade section (so an agent coming from v0.13.x had no idea GBRAIN_ALLOW_SHELL_JOBS=1 is now required or that max_stalled flipped from 1 to 5), no alternative to bare cron for Fly/Render/systemd deployments, a "Proposed CLI flags (not yet implemented)" block that an agent would copy and get unrecognized flag, and a MinionWorker.maxStalledCount note that did not tell the agent what to do.
What this means for operators
The guide is now copy-pasteable without invention. Every $VAR is documented in a table at the top. Every code block runs as-is on the target it claims. The watchdog writes a two-line PID file (PID + restart epoch) and the shutdown check only considers log lines newer than the epoch, which is the actual fix for the restart loop. Secrets live in /etc/gbrain.env (mode 600), referenced via BASH_ENV=/etc/gbrain.env in crontab. A new Option 3 ships a systemd unit, a Procfile, and a fly.toml fragment so Fly/Render/Railway/systemd users skip cron entirely. The upgrade section walks the v0.13.x → v0.16.2 checklist (stop worker, apply migrations, add GBRAIN_ALLOW_SHELL_JOBS, swap the watchdog).
The shipped watchdog was verified against an abbreviated end-to-end test (3 ticks in ~30 seconds inside an Ubuntu 22.04 container): tick 1 starts the worker and writes the 2-line PID file; tick 2 sees a shutdown line with a 1-hour-old timestamp and correctly does nothing; tick 3 sees a fresh shutdown line and correctly restarts. The regex was caught and fixed during the test when mawk rejected {n} interval quantifiers. The systemd unit was smoked in a privileged container with Restart=always firing a second banner after a 10-second RestartSec window, confirming crash-recovery works before any host ever boots the unit.
To take advantage of v0.16.2
gbrain upgrade pulls the new guide. If you deployed under v0.16.1 with the original watchdog, swap it:
- Re-read the guide:
less docs/guides/minions-deployment.md - Swap the watchdog script. The v0.16.1 version has the restart-loop bug:
sudo install -m 755 docs/guides/minions-deployment-snippets/minion-watchdog.sh \ /usr/local/bin/minion-watchdog.sh - Move secrets out of crontab. Put
DATABASE_URLandGBRAIN_ALLOW_SHELL_JOBS=1into/etc/gbrain.env(mode 600), reference it from crontab viaBASH_ENV=/etc/gbrain.env. - Fix the cron form. If you pasted the v0.16.1
*/5 * * * * user bash ...intocrontab -e, drop theusercolumn and the explicitbashprefix. - If you have shell access to a long-running box, consider Option 3 (systemd) instead of Option 1 (watchdog). systemd replaces the watchdog entirely and is the cleanest path.
No schema change. No data migration. Docs + snippets only.
Itemized changes
Fixed
- Crontab syntax now matches the target. Two labeled blocks: 5-field for
crontab -e, 6-field with user column for/etc/crontab. An agent no longer hits "bad minute" or hasuserparsed as the command. - Watchdog restart loop killed. The shipped
minion-watchdog.shwrites a two-line PID file (PID on line 1, restart epoch on line 2) and only considers log lines whose ISO-8601 timestamp is newer than the epoch. Stale shutdown lines from earlier restarts no longer re-match every 5 minutes forever. Regex rewritten to use explicit[0-9][0-9][0-9][0-9]instead of{4}intervals because mawk (Debian/Ubuntu's default awk) rejects interval quantifiers. Verified end-to-end in a 3-tick abbreviated test inside Ubuntu 22.04. - Credentials off the world-readable filesystem. Secrets move to
/etc/gbrain.env(mode 600, owned by the worker user), referenced viaBASH_ENV=/etc/gbrain.envin crontab./etc/crontabis mode 644 and user crontabs under/var/spool/cron/are readable by root. A newgbrain.env.exampleships in-repo with the full env surface.
Added
- Preconditions block. Five checks at the top of the guide:
gbrainon PATH, DB connectivity, schema version, crontab write access, and theGBRAIN_ALLOW_SHELL_JOBS=1requirement for shell-job workers. Agent fails fast on setup, not content. - Decision tree. "Which option?" selector at the top of the deployment section. Subagent workloads and long jobs take Option 1. Scheduled scripts take Option 2. No shell access take Option 3. Replaces the previous "recommended for X" prose that forced re-reading.
- Template variable table. Six variables (
$GBRAIN_BIN,$GBRAIN_WORKER_USER,$GBRAIN_WORKER_PID_FILE,$GBRAIN_WORKER_LOG_FILE,$GBRAIN_WORKSPACE,$GBRAIN_ENV_FILE) with meaning and typical value. Agent substitutes once, everything downstream lands correctly. - Upgrade section. v0.13.x → v0.16.2 checklist: stop the worker, run migrations, add
GBRAIN_ALLOW_SHELL_JOBS=1for shell jobs, handle themax_stalleddefault flip from 1 to 5, swap the v0.16.1 watchdog for the current one. - Option 3: service manager. New
systemd.service,Procfile, andfly.toml.partialship underdocs/guides/minions-deployment-snippets/. systemd replaces the watchdog entirely withRestart=always+RestartSec=10sand runs the worker as an unprivileged user withPrivateTmp,ProtectSystem=strict, andReadWritePaths. Smoked end-to-end in a privileged container: banner fired twice across a 10-second restart cycle,Restart=alwayshonored, unit enabled for boot persistence. - Uninstall section. One-paragraph rollback for each option.
docs/guides/minions-deployment.mdlisted inscripts/llms-config.ts. Remote agents fetchingllms.txtorllms-full.txtnow see the deployment guide without having to guess its path.
Changed
--followexample uses a gbrain subcommand, notnode my-script.mjs. The new example submitsgbrain embed --staleas a shell job on a dedicated queue with--timeout-ms 600000. Maps directly onto how an OpenClaw-style agent actually schedules brain maintenance.- "Proposed CLI flags (not yet implemented)" dead-end removed. Replaced with a "Tune per-job today" callout pointing at the
gbrain jobs submitflags that exist in source (--max-stalled,--backoff-type,--backoff-delay,--backoff-jitter,--timeout-ms,--idempotency-key— all first-class since v0.13.1). - Known Issues rewritten as imperatives. "DO NOT pass
maxStalledCounttoMinionWorker" leads the paragraph, followed by the reason and the correct knob (gbrain jobs submit --max-stalled N). Zombie-shell-children section leads with the 10s / 30s numbers and the action.
Contributed by garrytan (issue report), fixes verified by an abbreviated end-to-end test suite (render-check + watchdog 3-tick + systemd container smoke + bun test + full E2E DB lifecycle).
[0.16.1] - 2026-04-22
Minions worker deployment, finally documented.
If you run gbrain jobs work in production, there's now a guide for the sharp edges.
Garry's OpenClaw (gbrain's own instance, out there actually running gbrain jobs work in production) wrote a real deployment guide for the Minions worker, the piece of gbrain most operators hit next after getting sync running. Agents dogfooding the project they live on is a weird, good feedback loop. Two patterns: a watchdog cron for persistent workers, and an inline --follow for cron-only workloads. It covers the connection-drop, stall-detector, and zombie-child traps that show up once your brain is actually working for you. Every command and every default in the guide is checked against current source (max_stalled = 5, not 1 or 3; --follow exits on submitted-job-terminal, not queue-empty; stalled jobs show up as active, not waiting). Nothing about this was obvious, and nothing about it was in the docs before.
With v0.16.0's durable agent runtime now shipping, the persistent worker is load-bearing for a lot more (subagent + subagent_aggregator handlers run there too). A supervised deployment story is the sharp end of the stick.
What this means for operators
If you have been running the Minions worker under nohup with no restart story, this guide is the missing manual. Copy the watchdog script, paste the crontab env lines (SHELL=/bin/bash, PATH, DATABASE_URL, GBRAIN_ALLOW_SHELL_JOBS=1), and wire the cron to run every 5 minutes. You get a restart loop that handles the three silent-death modes: DB connection blip, lock-renewal stall, event loop wedge.
If you are running scheduled shell jobs only, skip the persistent worker and use --follow. 2-3 seconds of startup overhead is trivial when your job runs for a minute.
Docs-only release. No code changed. Zero migration required.
To take advantage of v0.16.1
gbrain upgrade pulls the new guide. Read it:
- Open the guide:
Or browse it on GitHub.
less docs/guides/minions-deployment.md - Persistent worker: copy
minion-watchdog.sh, set crontab env lines, wire a*/5 * * * *cron. - Scheduled shell jobs only: rewrite your cron as
gbrain jobs submit shell ... --follow --timeout-ms Nand drop the persistent worker entirely. - The "Proposed CLI flags" section (
--lock-duration/--max-stalled/--stall-intervalongbrain jobs work): those are on the roadmap. Per-job--max-stalledongbrain jobs submitis already real and writes to the row's column directly.
Itemized changes
Added
- Minions worker deployment guide — new
docs/guides/minions-deployment.mdcovering watchdog cron patterns, inline--followfor cron-only workloads, and the sharp edges of runninggbrain jobs workagainst Supabase in production. Addresses a real gap: existing Minions docs (minions-fix.md,minions-shell-jobs.md) cover schema repair and shell-job security, not deploy patterns. Contributed by your OpenClaw via #287. Pre-landing accuracy pass corrected five factual bugs against current source: themax_stalledcolumn default (5, not 1 or 3), the stalled-jobs smoke-test query (active, notwaiting), the SIGTERM-to-SIGKILL grace window (10s minimum, not 2s), the cron env pattern (crontab env lines, notsource ~/.bashrc), and the--followexit semantics (blocks until submitted job is terminal, not until queue is empty).
[0.16.0] - 2026-04-20
Durable agents land. Your LLM loops survive crashes, timeouts, and worker restarts now.
OpenClaw died mid-run? Come back, resume from the last committed turn.
Your OpenClaw crashes daily. Not "sometimes." Daily. An 8-turn OpenClaw subagent fires a tool call, the worker dies on a memory blip, all eight turns of context are gone, and there's nothing to do but start over from turn zero. This release kills that. gbrain agent run submits an Anthropic Messages API conversation as a first-class Minion job: every turn persists to subagent_messages, every tool call is a two-phase ledger row (pending → complete | failed), and replay on worker restart picks up from exactly the last committed turn. Crash-safe by construction, not by hope.
Fan-out works the same way. --fanout-manifest splits N prompts across N subagent children plus one aggregator. Children run on_child_fail: 'continue' so one failing run doesn't cascade, and the aggregator claims after all children reach ANY terminal state (complete, failed, dead, cancelled, timeout) and writes a mixed-outcome summary. No polling loop, no dead parents stranded in waiting-children.
Plugins work. Host repos drop a gbrain.plugin.json + subagents/*.md dir somewhere on GBRAIN_PLUGIN_PATH, and their custom subagent defs load at worker startup. Your OpenClaw ships its meeting-ingestion, signal-detector, and daily-task-prep subagents in its own repo now; gbrain discovers them day one. Collision rule is deterministic (left-wins with a loud warning). Trust boundary is strict on purpose: plugins ship DEFS, not tools. Tool allow-list stays here.
The numbers that matter
Measured on the v0.15 branch against real Postgres via bun run test:e2e, plus the 159 new unit tests across 10 new test files. Coverage: 12 new runtime modules, 53+ code paths + user flows traced, 3 critical regression tests for the shell-jobs queue surface.
| Metric | BEFORE v0.15 | AFTER v0.15 | Δ |
|---|---|---|---|
| Your OpenClaw run survives worker kill mid-tool-call | No (start over) | Yes (resume from last committed turn) | crash-recovery unlocked |
| Fan-out run with 1 failed child out of N | Aggregator fails | Aggregator still claims + summarizes | mixed-outcome aggregation works |
gbrain agent logs --follow during long Anthropic call |
Silent (looks frozen) | Heartbeat line per turn boundary | visible progress |
| Tool-use replay on resume | N/A (no resume) | Idempotent re-run, non-idempotent aborts | two-phase protocol |
put_page exposure to agent-driven writes |
Full write surface | Namespace-scoped wiki/agents/<id>/… |
fail-closed, server-enforced |
| Plugin subagent defs for downstream hosts | Not supported | GBRAIN_PLUGIN_PATH + validated at startup |
OpenClaw day-1 usable |
| Rate-lease capacity leaks on worker crash | Counter-based (leaks) | Lease-based (auto-prune on next acquire) | no starvation after SIGKILL |
| Anthropic prompt cache on 40-turn agent | Per-turn cold | cache_control: ephemeral on system + tools |
~10x cost reduction (best-case) |
What this means for your OpenClaw
You stop rerunning from zero. A crash at 3am that used to lose two hours of turns now costs you whatever fraction of one turn was in-flight when the worker died. The rest of the conversation is rows in subagent_messages and subagent_tool_executions, and the next worker claim replays from there. gbrain agent logs <job> shows you where it died, which tool it was running, and what came back from the last successful call. Real debugging, not guessing.
Credit: shell-jobs (v0.14) established every pattern v0.15 reuses — handler signature, dual-signal abort, ctx.updateTokens, protected-names, trusted-submit, JSONL audit log, timeout_ms. Codex caught the Mode A "transparent Agent() interception" impossibility during plan review and saved the shape of this work. The v0.15 handler is what survives on the other side of that review.
Itemized changes
New capability: gbrain agent CLI
gbrain agent run <prompt> [--subagent-def|--model|--max-turns|--tools|--timeout-ms|--fanout-manifest|--follow|--detach]— submits a subagent job (or fan-out of N subagents + aggregator) under the trusted-submit flag. Follow mode tails status + logs until terminal; detach prints the job id and exits. Ctrl-C detaches (job keeps running), does not cancel.gbrain agent logs <job_id> [--follow] [--since ISO-or-relative]— merges the JSONL heartbeat audit with persistedsubagent_messagesinto one chronological timeline.--since 5m/1h/2dshorthand supported. Transcript tail renders the full message + tool tree only after the job is terminal.- Always registered on the worker (no separate env flag).
ANTHROPIC_API_KEYis the natural cost gate — no key, the SDK call fails immediately. Who-can-submit is already gated byPROTECTED_JOB_NAMES+TrustedSubmitOptsso only the trusted-CLI path can insertsubagent/subagent_aggregatorrows.
New durability primitives
src/core/minions/handlers/subagent.ts— the LLM-loop handler. Two-phase tool persistence, replay reconciliation for mid-dispatch crashes, dual-signal abort (ctx.signal+ctx.shutdownSignal), Anthropic prompt caching on system + tool defs, injectableMessagesClientfor mocking.src/core/minions/handlers/subagent-aggregator.ts— claims AFTER all children resolve (Lane 1B's queue changes guarantee each terminal child posts achild_doneinbox message), produces deterministic mixed-outcome markdown summary.src/core/minions/rate-leases.ts— lease-based concurrency cap for outbound providers. Owner-tagged rows withexpires_atauto-prune on acquire, so a crashed worker can't strand capacity.pg_advisory_xact_lockguards the check-then-insert.src/core/minions/wait-for-completion.ts— poll-until-terminal helper for CLI callers.TimeoutErrordoes NOT cancel the job; AbortSignal exits cleanly. DefaultpollMs: 1000 on Postgres, 250 on PGLite inline.src/core/minions/handlers/subagent-audit.ts— JSONL audit + heartbeat writer. Rotates weekly via ISO week.readSubagentAuditForJobis the readback path forgbrain agent logs.src/core/minions/transcript.ts— messages + tool executions → markdown renderer. UTF-8-safe truncation; unknown block types fall through to JSON for diagnostics.src/core/minions/tools/brain-allowlist.ts— derives the subagent tool registry fromsrc/core/operations.ts. 11-name allow-list (read-only + deterministicput_page).put_pageschema is namespace-wrapped per subagent so the model writes correct slugs first-try; the server-side check input_pageis the authoritative gate.src/core/minions/plugin-loader.ts—GBRAIN_PLUGIN_PATH(colon-separated absolute paths likePATH) +gbrain.plugin.jsonmanifest +subagents/*.mddefs. Strict path policy, left-wins collision, plugins ship DEFS only (no new tools),allowed_tools:validated at load time.src/mcp/tool-defs.ts— extracted from an inlineoperations.map(...)block in the MCP server so subagent + MCP use the same source of truth. Byte-for-byte equivalence pinned by regression test.
Schema (3 new tables + OperationContext fields + migration orchestrator)
subagent_messages— Anthropic message-block persistence.(job_id, message_idx)UNIQUE;content_blocks JSONBholds parallel tool_use blocks in one assistant message.subagent_tool_executions— two-phase ledger.(job_id, tool_use_id)UNIQUE; status:pending | complete | failed.subagent_rate_leases— lease-based concurrency control. CASCADE deletes on owning job removal so no leaked rows.OperationContextgainsjobId?,subagentId?, andviaSubagent?(fail-closed signal for agent-path gating). Added tosrc/core/operations.ts.src/commands/migrations/v0_15_0.ts— post-upgrade orchestrator (phases: schema → verify → record).v0_14_0.tsnoop stub keeps the registry version sequence gapless.
Queue correctness fixes
failJob,cancelJob, andhandleTimeoutsall emitchild_doneinbox messages withoutcome: 'complete' | 'failed' | 'dead' | 'cancelled' | 'timeout'. Pre-v0.15 onlycompleteJobemitted; failed/cancelled/timed-out children silently stranded aggregator-style parents.- Parent-resolution terminal set expanded from
{completed, dead, cancelled}to include'failed'everywhere parent-state is checked. A failed child withon_child_fail: 'continue'now correctly unblocks the parent. failJobemitschild_doneBEFORE the parent-terminal UPDATE. Without insertion ordering, the EXISTS guard on the inbox INSERT would skip the row onfail_parentpaths (caught by codex iteration 3).MinionJobInput.max_stalledthreads throughMinionQueue.add()as INSERT param (not UPDATE on idempotency replay — that would mutate first-submitter state).
Trust model
subagentandsubagent_aggregatorjoinPROTECTED_JOB_NAMES. MCPsubmit_jobreturnspermission_denied; onlygbrain agent run(withallowProtectedSubmit) can insert these rows.put_pagegains a server-side fail-closed namespace check: whenctx.viaSubagent === true,slugMUST match^wiki/agents/<subagentId>/.+— even ifsubagentIdis undefined (dispatcher bug must not open a hole).
Docs
docs/guides/plugin-authors.md— downstream-OpenClaw-facing walkthrough (minimum viable plugin, path + collision + trust policies, frontmatter fields, caveats).- 12 bisectable commits on
garrytan/minions-seam, each PR-worthy on its own; the full series lands v0.15.0 end-to-end.
Tests
- 159 new unit tests across 10 new files:
mcp-tool-defs,put-page-namespace,migrations-v0_15_0,queue-child-done,rate-leases,wait-for-completion,brain-allowlist,subagent-audit,subagent-transcript,subagent-handler,subagent-aggregator,plugin-loader,agent-cli. - 3 critical regression tests pin the shell-jobs queue surface:
failJobchild_done behavior,put_pagenamespace path for non-subagent callers, MCPbuildToolDefsbyte-equivalence. - E2E
minions-resilience.test.tsupdated: the max_children test renames its spawned children off the now-protectedsubagentname.
[0.15.4] - 2026-04-21
PgBouncer transaction-mode prepared statements, fixed at the pool.
gbrain jobs work against Supabase pooler stops silently dropping rows.
Three separate PRs (#284, #286, #270) were all trying to fix the same bug: on a Supabase transaction-mode pooler (port 6543), postgres.js's per-client prepared-statement cache goes stale every time PgBouncer recycles the backend connection. The symptom under sustained gbrain load is prepared statement "xyz" does not exist in the logs and silently dropped rows during sync. v0.15.4 lands the combined fix: the resolvePrepare() helper from #284, the both-connection-paths coverage from @notjbg's community PR #270, a new doctor check, and real tests against bun:test. The one-liner in #286 is dominated by this.
The one number that matters
There isn't a benchmark, there's a correctness gate. On a Supabase pooler at port 6543 with a 4,500-page sync:
| Before v0.15.4 | After v0.15.4 | |
|---|---|---|
prepared statement ... does not exist errors |
Dozens per sync | Zero |
| Rows inserted vs. manifest count | Short by 50-200 rows (silent) | 1:1 parity |
gbrain jobs work crash under load |
Yes | No |
The silent-drop is the dangerous half. You run gbrain sync, the exit code is 0, the logs have a few noise lines you scroll past, and three weeks later you notice your brain is missing pages. resolvePrepare(url) disables prepared statements when the URL targets port 6543, and the doctor check flags the misconfiguration if you've manually forced GBRAIN_PREPARE=true on that port.
What this means for pooler users
If you connect via aws-0-REGION.pooler.supabase.com:6543, do nothing. The upgrade disables prepared statements automatically and gbrain doctor confirms it with pgbouncer_prepare: ok. If you're on session mode (port 5432 on the pooler host) or direct Postgres, nothing changes: prepared statements stay on, plan caching stays intact. If your PgBouncer runs in session mode on a non-standard port, set GBRAIN_PREPARE=true explicitly.
To take advantage of v0.15.4
gbrain upgrade handles this automatically. If you're not sure whether the fix is live:
- Run the doctor check:
Look for
gbrain doctorpgbouncer_prepare. On a:6543URL you should seeok(prepared statements disabled). On a direct URL the check silently passes. - Verify on sustained load:
Zero
gbrain syncprepared statement ... does not existlog lines. Row count inserted matches the source manifest. - If something looks wrong, file an issue at https://github.com/garrytan/gbrain/issues with:
- output of
gbrain doctor - the connection URL shape (port and pooler hostname — redact credentials)
- whether
GBRAIN_PREPAREis set
- output of
Itemized changes
Fixed
- Supabase PgBouncer port-6543 prepared statements no longer break sync. New
resolvePrepare(url)helper insrc/core/db.tswith 4-level precedence:GBRAIN_PREPAREenv var →?prepare=query param → port-6543 auto-detect → default. Wired into both the module-singletonconnect()indb.tsAND the worker-instancePostgresEngine.connect({poolSize})insrc/core/postgres-engine.tssogbrain jobs workgets the same treatment as the main CLI. The second path was the gap #284 missed; community PR #270 caught it. Contributed by @notjbg. gbrain doctorsurfaces the misconfiguration. Newpgbouncer_preparecheck reads the configured URL vialoadConfig()and reportsokwhen prepared statements are safely disabled,warnwhen the URL points at port 6543 but prepared statements are still enabled (the footgun that caused silent row drops).
Tests
- New
test/resolve-prepare.test.ts— 11 cases covering the full precedence matrix: env override, URL query param, port auto-detect, malformed URLs,postgres://vspostgresql://schemes, URL-encoded credentials. Usesbun:test(not vitest — #284's original tests were in the wrong framework and would never have run). - Extended
test/postgres-engine.test.ts— new source-level grep assertion that the worker-instanceconnect({poolSize})branch callsdb.resolvePrepare(url)and conditionally includes thepreparekey in the options literal. Mirrors the existingSET LOCAL statement_timeoutguardrail in the same file. If anyone rips out the wiring, the build fails before a shipping brain drops rows.
Supersedes
- Closes #284 (ours, Wintermute): architecture landed as-is (port-only detection, no hostname expansion). Tests rewritten from vitest to bun:test.
- Closes #286 (ours, Codex one-liner): dominated; unconditional
prepare: falsewould have cost direct-Postgres users plan caching for no reason. - Closes #270 (@notjbg): the critical both-connection-paths insight landed; credit preserved in commit trailer and this CHANGELOG entry.
[0.15.3] - 2026-04-21
Two upgrade-night bugs that crashed v0.13 → v0.14, now fixed with regression guards.
Migrations find the right binary. Autopilot spawns its worker. gbrain upgrade survives.
Tonight's production upgrade surfaced eleven bugs. Two of them — Bug 1 (the migration shell-out) and Bug 4 (the autopilot resolver) — survived two eng-review passes AND nine Codex reviews with correct diagnoses and implementable fixes. The other nine had wrong root causes or unimplementable architectures (documented in ~/.claude/plans/ as deferred work with grounded starting context for future /investigate sessions). This release ships the two clean fixes so the next gbrain upgrade actually lands.
Itemized changes
Fixed
gbrain upgradeno longer crashes mid-migration on bun installs. The v0.13.0 migration orchestrator used to shell out viaprocess.execPath, which on bun-installed trees is thebunruntime itself.${bun} extract links --source db …got reinterpreted asbun run extractand crashed with "script not found." The fix drops the execPath detour and shells out to the baregbrainstring, letting the canonical shim on PATH (/usr/local/bin/gbrainby default) win. Regression test intest/migrations-v0_13_0.test.tsgreps the source forprocess.execPathand fails the build if anyone reintroduces the pattern. Contributed by @garrytan.- Autopilot spawns its Minions worker again.
resolveGbrainCliPathcheckedargv[1]first and happily returned/path/to/src/cli.tson bun-source installs.spawn()then failed withEACCESbecause TypeScript source isn't executable, and autopilot silently lost its worker. The fix reorders the probe:which gbrain(shim on PATH) wins first, then compiledprocess.execPath, then anargv[1]=/gbrainfallback. The.tsbranch is deleted entirely. A critical regression test enforces that the resolver NEVER returns a.tspath across any combination ofargv[1]+process.execPath+ shim availability.
Tests
- New
test/migrations-v0_13_0.test.ts— 7 cases covering registry wiring, dry-run semantics, and three regression guards against the Bug 1 re-introduction (noprocess.execPath, noGBRAINconstant, nobunor.tsinexecSynccalls). - Rewrote
test/autopilot-resolve-cli.test.ts— the old test enshrined the buggy.tsreturn path. New test parameterizes argv/execPath combinations and asserts the resolver never returns a.tspath. This is the test that would have caught Bug 4 before it shipped.
Deferred (tracked for follow-up /investigate sessions)
- Bug 2 (pooler MaxClients), Bug 3 (partial-migration retry loop), Bug 5 (v0.14.0 registry gap), Bug 6/10 (duplicate graph edges), Bug 7 (doctor --fast), Bug 8 (autopilot-cycle stalls), Bug 9 (YAML colons), Bug 11 (brain_score breakdown). Each has grounded Codex findings documenting the real root cause and where prior diagnoses went wrong. Landing target: subsequent PR waves.
[0.15.2] - 2026-04-21
Silent binaries are dead. Every bulk action now heartbeats.
Agents can tell the difference between "working" and "hung."
gbrain doctor on a 52K-page brain used to sit silent for 10+ minutes and then get killed by an agent timeout. The checks always completed when run by hand, but stdout buffered and agents saw nothing. The same pattern hit embed, sync, import, extract, migrate, and every orchestrator that shelled out to them — progress either went to stdout with \r rewrites that collapse when piped, or nowhere at all. v0.15.2 routes every bulk action through one shared reporter. Non-TTY default is plain human lines on stderr, one line per event. Agents that want structured progress flip --progress-json and get one JSON object per line.
Progress events never touch stdout. Data and final summaries still go there. Script you wrote six months ago that parses gbrain embed output? Still works. Agent that captures stdout to JSON.parse the result? Now gets clean JSON instead of \r\r\r1234/52000 pages... mixed in.
The numbers that matter
Measured on this repo (80 unit test files, 14 E2E test files, real Postgres+pgvector, 141 E2E cases incl. 3 new doctor-progress tests):
| Metric | BEFORE v0.15.2 | AFTER v0.15.2 | Δ |
|---|---|---|---|
| Commands that stream progress | 3 (ad-hoc \r stdout) |
14 (reporter, stderr, rate-gated) | +11 |
| Progress observable when stdout is piped | 0 of 3 | 14 of 14 | always visible |
| Canonical JSON event schema | none | locked in docs/progress-events.md |
stable |
doctor silence window on 52K pages |
10+ min then killed | heartbeat every 1s | observable |
jsonb_integrity scan targets |
4 (missed page_versions.frontmatter) |
5 | matches repair-jsonb |
Minion jobs that update job.progress |
0 bulk cores | embed wired (import/sync/extract ready via callbacks) | DB-backed |
| Unit tests for progress/CLI plumbing | 0 | 37 (progress + cli-options) | +37 |
| E2E tests for agent-visible progress | 0 | 3 (doctor-progress Tier 1) | +3 |
| Bulk command | Progress today | Progress after v0.15.2 |
|---|---|---|
doctor |
None (blocks) | Per-check heartbeat, 1s on slow queries |
orphans |
Final summary | Heartbeat while NOT EXISTS scan runs |
embed |
\r stdout |
Per-page stderr, job.updateProgress from Minions |
files sync |
\r stdout |
Per-file stderr |
export |
\r stdout |
Per-page stderr (newly in scope) |
import |
Per-100 stdout | Per-file stderr, rate-gated |
extract (fs + db) |
Ad-hoc stderr | Canonical event schema, all paths |
sync |
Final summary | Per-file ticks across delete/rename/import phases |
migrate --to ... |
Per-50 stdout | migrate.copy_pages + migrate.copy_links phases |
repair-jsonb |
Final summary | Per-column heartbeat (stdout stays JSON-clean for orchestrator) |
check-backlinks |
Final summary | Heartbeat during the double-walk |
lint |
Per-file stdout | Per-file stderr, issues still on stdout |
integrity auto |
Own progress file | Unified reporter (file kept as resume marker) |
eval |
None | Per-query tick in single + A/B modes |
apply-migrations |
Inherited child output | Explicit flag propagation + stdio discipline |
Concrete agent win: on a 52K-page brain, gbrain --progress-json doctor emits ~10 events per second on stderr (start per check, heartbeats during the slow scan, finish per check) while gbrain doctor --json keeps stdout clean and JSON-parseable. The agent never sees silence longer than 1 second, and its stdout parser doesn't need to scrub progress garbage.
What this means for you
If you run gbrain in CI, through a Minion worker, or inside any agent that captures stdout, this release means your downstream consumers stop guessing. Slow migrations announce themselves. Long imports name each file. gbrain jobs get <id> returns live progress for Minion-queued bulk work. The gbrain doctor warning you've been ignoring because it fires silently and then 10 minutes later tells you nothing is wrong becomes a 1-second heartbeat that proves it's working. If you're reading logs from a shell pipeline and prefer plain human lines, you don't need to do anything, that's the default for non-TTY stderr. Only add --progress-json when you want structured events.
To take advantage of v0.15.2
gbrain upgrade should do this automatically. If it didn't, or if gbrain doctor warns about a partial migration:
- Nothing mechanical is required. v0.15.2 is purely additive to the CLI surface — no schema changes, no migration orchestrator, no data rewrites. Progress events start flowing the next time you invoke a bulk command.
- To stream structured events to your agent:
gbrain --progress-json sync 2> progress.log # or gbrain doctor --progress-json --json > doctor.json 2> doctor.progress - For Minion-queued jobs:
gbrain jobs submit embed # while it runs: gbrain jobs get <id> # .progress is live-updated by the worker - If
gbrain doctorstill looks hung on a very large brain, check the CLI output for heartbeat lines. If they're missing, file an issue at https://github.com/garrytan/gbrain/issues with the command you ran, stdout/stderr samples, and output ofgbrain doctor --fast.
Itemized changes
Reporter (new, src/core/progress.ts)
- Dependency-free. Modes:
auto(TTY →\r-rewriting; non-TTY → plain lines),human,json(JSONL on stderr),quiet. - Rate gating: emits on whichever fires first:
minIntervalMs(default 1000) orminItems(defaultmax(10, ceil(total/100))). Finaltickwheredone === totalalways emits. startHeartbeat(reporter, note)helper for single long-running queries (doctor'smarkdown_body_completeness,orphansanti-join,repair-jsonbper-column UPDATE).child()composes phase paths,sync.import.<slug>, not flat<slug>.- EPIPE defense on both sync throws and stream
'error'events. Singleton module-level SIGINT/SIGTERM handler emitsabortevents for every live phase, one handler no matter how many reporters exist.
CLI plumbing (src/core/cli-options.ts, src/cli.ts)
- Global flags
--quiet,--progress-json,--progress-interval=<ms>parsed before command dispatch. CliOptionssingleton (getCliOptions) reachable from every command without threading a new parameter through 20 handlers.OperationContext.cliOptsextends shared-op dispatch, MCP callers see defaults, CLI callers see parsed flags.childGlobalFlags()helper: appends the parent's flags to everyexecSync('gbrain ...')call in the migration orchestrators, so child progress matches parent mode.
JSON event schema
- Stable from v0.15.2, documented in
docs/progress-events.md. {event, phase, ts}always present. Optional:total,done,pct,eta_ms,note,elapsed_ms,reason. No fake totals when a query has no count.- Phases use
snake_case.dot.path. Machine-stable. Agent parsers can group by phase prefix (alldoctor.*events belong to one run).
Backward-compat warnings
Progress for embed, files, export, extract, import, migrate-engine moved from stdout to stderr. Stdout now carries only final summaries and --json payloads. Scripts that parsed process.stdout for progress lines (\r 1234/52000 pages...) see empty stdout for those counters; the data they actually want (the final "Embedded N chunks" summary) is still there. Point anything grepping stdout for progress at stderr instead.
Minion handlers (src/commands/jobs.ts)
embedhandler passesjob.updateProgress({done, total, embedded, phase})as theonProgresscallback. Primary Minion progress channel is DB-backed, readable viagbrain jobs get <id>or theget_job_progressMCP op. Stderr fromjobs workstays coarse for daemon liveness.- Other handlers (
sync,extract,backlinks,autopilot-cycle,import) have the callback plumbing ready from the core functions; wiring the remaining handlers is a follow-up.
gbrain doctor
jsonb_integritynow scans 5 targets (addspage_versions.frontmatter), matchingrepair-jsonb's surface. The old 4-target check missed one of the repair sites.- Per-check heartbeats so agents see
doctor.db_checksstarting, which check is in-flight, anddoctor.markdown_body_completenessscanning. - No false totals: the
LIMIT 100truncation check reportsheartbeat, nottickwith a fake count.
Upgrade (src/commands/upgrade.ts)
- Post-upgrade timeout bumped 300s → 1800s (30 min). Override via
GBRAIN_POST_UPGRADE_TIMEOUT_MS. The old 300s cap killed v0.12.0 graph-backfill migrations on 50K+ brains; heartbeat wiring in v0.15.2 makes the long wait observable.
CI guard
scripts/check-progress-to-stdout.shgrepssrc/forprocess.stdout.write('\r...')and failsbun run testif any regression lands.
Tests
- New:
test/progress.test.ts(17 cases — mode resolution, rate gating, EPIPE paths, SIGINT singleton, child phase composition),test/cli-options.test.ts(18 cases — flag parsing,--quietskillpack-check collision regression, global-flag strip-and-dispatch),test/e2e/doctor-progress.test.ts(3 cases, Tier 1 — spawns the real CLI against a real Postgres, asserts stderr JSONL matches the schema and stdout stays clean).
[0.15.1] - 2026-04-21
Fix wave: 4 hot issues that blocked real brains, landed together.
PGLite survives macOS 26.3. Minions actually rescues SIGKILL'd jobs. Autopilot dashboards stop the 14.6s seqscan. bun install -g tells you when it's broken.
v0.15.1 is the hotfix wave on top of the v0.14.x stack (shell job type in v0.14.0, doctor DRY + --fix in v0.14.1, 8 deferred bug fixes in v0.14.2) plus v0.15.0 (llms.txt + AGENTS.md): four user-filed issues against v0.13.x, fixed and verified together, plus three scope expansions that close adjacent footguns. Upgrade is automatic. If gbrain upgrade runs clean, your brain gets faster and more reliable on the next sync cycle.
The numbers that matter
The four issues this release closes, with measured impact:
| Issue | Before v0.15.1 | After v0.15.1 | Δ |
|---|---|---|---|
#170 SELECT * FROM pages ORDER BY updated_at DESC on 31k rows (Postgres) |
~14.6s seqscan | <20ms index scan | ~700x |
#219 max_stalled default on minion_jobs |
3 (three rescues before dead, v0.14.2 set this) | 5 (four rescues before dead) | extra headroom for flaky deploys |
#219 existing waiting/active jobs with max_stalled<5 |
would still dead-letter earlier than expected | backfilled to 5 on upgrade | closes the pain today |
#218 bun install -g github:garrytan/gbrain postinstall failure |
silent ` | true` | |
| #223 PGLite WASM crash on macOS 26.3 | raw Aborted(), no hint |
pinned @electric-sql/pglite to 0.4.3 + actionable error message naming the issue |
users can route to #223 |
What this means for you
If you run autopilot against a Supabase brain with 30k+ pages, your health/dashboard cycle was silently burning 14.6 seconds on every iteration. The new index drops that to single-digit milliseconds without locking writes (Postgres gets CREATE INDEX CONCURRENTLY with an invalid-index cleanup DO block; PGLite gets plain CREATE INDEX since it has no concurrent writers). Your agent stops blocking on list-pages-by-date queries.
If you use Minions, the "SIGKILL mid-flight, 10/10 rescued" claim is now actually true out-of-the-box with generous headroom. Default max_stalled=5 means a kill -9'd worker gets picked up by the next worker instead of dead-lettered early. v15 migration backfills existing non-terminal rows (waiting/active/delayed/waiting-children/paused) so upgrading doesn't leave a queue full of doomed jobs.
If you install via bun install -g github:... (not recommended but people try it), you'll now see a loud stderr warning with a link to #218 instead of a broken CLI that fails on next invocation. The real fix is git clone + bun link, documented in README and INSTALL_FOR_AGENTS.md.
If you're on macOS 26.3 and PGLite was crashing with Aborted(), the pin to 0.4.3 gives us the best shot at avoiding the WASM regression (noting: 0.4.3 is unverified against 26.3 in CI — the error-wrap at pglite-engine.ts connect() is the safety net if the pin doesn't hold). Any PGLite init failure now shows the #223 link instead of a raw runtime error.
To take advantage of v0.15.1
gbrain upgrade should do this automatically. If it didn't, or if gbrain doctor warns about a partial migration:
- Run the orchestrator manually:
gbrain apply-migrations --yes - Verify the outcome:
psql "$DATABASE_URL" -c "\d minion_jobs" | grep max_stalled # DEFAULT should be 5 psql "$DATABASE_URL" -c "\d pages" | grep idx_pages_updated_at_desc # index should exist gbrain doctor - If any step fails or the numbers look wrong, file an issue with
gbrain doctoroutput and the contents of~/.gbrain/upgrade-errors.jsonlif it exists. https://github.com/garrytan/gbrain/issues
Itemized changes
Added
- Schema migration v14 —
CREATE INDEX [CONCURRENTLY] IF NOT EXISTS idx_pages_updated_at_desc ON pages (updated_at DESC)(engine-aware; Postgres uses CONCURRENTLY with an invalid-index DO-block cleanup, PGLite uses plain CREATE). Closes #170. Contributed by @fuleinist (#215). - Schema migration v15 —
ALTER TABLE minion_jobs ALTER COLUMN max_stalled SET DEFAULT 5(bumps v0.14.2's default of 3 to 5 for extra flaky-deploy headroom) +UPDATEbackfill scoped to non-terminal statuses (waiting/active/delayed/waiting-children/paused) so existing queued work benefits on upgrade. Closes #219. Reported by @macbotmini-eng. MinionJobInput.max_stalled— new optional field, plumbed throughqueue.add()with[1, 100]clamp.gbrain jobs submit --max-stalled N— CLI flag to set per-job stall tolerance.gbrain jobs submit --backoff-type,--backoff-delay,--backoff-jitter,--timeout-ms,--idempotency-key— scope-expansion audit exposing existingMinionJobInputfields as first-class CLI flags.gbrain jobs smoke --sigkill-rescue— opt-in regression smoke case that simulates a killed worker and asserts the v0.15.1 default actually rescues.gbrain doctor --index-audit— new opt-in Postgres check that reports zero-scan indexes frompg_stat_user_indexes. Informational only (no auto-drop). PGLite no-ops.BrainEngine.kindreadonly discriminator ('postgres' | 'pglite') — lets migrations and consumers branch on engine withoutinstanceof+ dynamic imports.package.json trustedDependencies: ["@electric-sql/pglite"]— lets Bun run PGLite's dep postinstall on global installs.
Changed
@electric-sql/pglitepinned to exactly0.4.3(was^0.4.4) — best-available mitigation for the macOS 26.3 WASM abort. Reported by @AndreLYL (#223). Flagged as unverified; reproduce on a 26.3 machine and file a follow-up if it still aborts.package.json postinstall— now warns loudly on stderr with a recovery URL instead of silencing errors with2>/dev/null || true.bun install -ghitting a migration failure now tells you what to do. Reported by @gopalpatel (#218).src/core/pglite-engine.ts connect()— wrapsPGlite.create()with a friendly error pointing at #223 andgbrain doctor. Nests the original error for debuggability.doctorschema_versioncheck — now fails loudly whenversion=0(migrations never ran), linking #218.README.md+INSTALL_FOR_AGENTS.md— explicit warning againstbun install -g github:garrytan/gbrain.
Fixed
- The "SIGKILL mid-flight, 10/10 rescued" claim is now accurate out-of-the-box with headroom (#219). Schema default 3 → 5.
- Autopilot dashboards stop blocking on list-pages queries on 30k+ row Postgres brains (#170).
- PGLite error on macOS 26.3 is now actionable instead of a raw
Aborted()(#223). bun install -gno longer produces a silently broken CLI (#218) — postinstall surfaces failures.
Internal
Migrationinterface extended withsqlFor: { postgres?, pglite? }+transaction: booleanfields. Runner picks the engine-specific SQL branch and (on Postgres only) bypassesengine.transaction()whentransaction: false(required for CONCURRENTLY).scripts/check-jsonb-pattern.shextended with a CI guard againstmax_stalled DEFAULT 1regressing.- ~15 new unit tests covering max_stalled default/clamp/backfill/v14/v15 semantics. 3 regression tests pinned by IRON RULE.
test/e2e/now runs test files sequentially viascripts/run-e2e.shto eliminate shared-DB races that caused ~3/5 runs to have 4-10 flaky fails. Every run post-fix: 13 files, 138 tests, 0 fails.
[0.15.0] - 2026-04-21
GBrain now talks to LLMs the way modern docs sites do.
One URL, full context. Three files, zero drift.
Three new artifacts ship at the repo root: llms.txt (llmstxt.org-spec index), llms-full.txt (same map with core docs inlined, ~225KB, fits well under a 150k-token context window), and AGENTS.md (the non-Claude-agent operating protocol). All three are generator-driven. scripts/build-llms.ts reads a curated scripts/llms-config.ts and emits llms.txt + llms-full.txt deterministically; AGENTS.md is hand-written and uses relative links so it survives forks and rename. Every agent that clones GBrain now has a one-screen answer to "I just got here, what do I do?"
README and INSTALL_FOR_AGENTS.md now point agents at AGENTS.md first. The old install prompt still works, but the leverage point, Codex's read of the plan, was that these files are invisible unless the install path references them. Fixed.
The numbers that matter
Measured on this release:
| Metric | BEFORE | AFTER | Δ |
|---|---|---|---|
| Agent entry points with clear install protocol | 1 (CLAUDE.md, Claude Code only) | 3 (CLAUDE.md + AGENTS.md + llms.txt) | +non-Claude coverage |
| Docs referenced at a single canonical URL | 0 | 20 (across 5 H2 sections) | index exists |
| Full-context fetch round-trips | ~20 (one per doc) | 1 (llms-full.txt, 224 KB) |
~20x fewer fetches |
| Tests guarding the doc index | 0 | 7 (paths resolve, idempotent, spec shape, regen-drift, content contract, AGENTS mirror, size budget) | +7 |
| Pre-existing repo bugs found and fixed | — | 1 (git pull origin main → master) |
drive-by |
The 7 tests enforce content contract: removing skills/RESOLVER.md or the Debugging H2 from the config fails bun test. Forgetting to rerun bun run build:llms after adding a new doc fails bun test. The size budget (600KB) fails bun test if llms-full.txt balloons.
What this means for you
If you're running GBrain: nothing to do. Your agent already has CLAUDE.md. But next time you install GBrain on Codex, Cursor, or OpenClaw, the agent lands on AGENTS.md and walks the install without hunting. If you run a fork, regenerate with LLMS_REPO_BASE=https://raw.githubusercontent.com/your-org/your-fork/main bun run build:llms to rewrite URLs. If you publish GBrain docs alongside your own, llms.txt is the index; llms-full.txt is the drop-into-a-context-window bundle.
Credit to Codex for catching that the original plan's AGENTS.md was underpowered, that the eng review missed a content-contract test, and that the install prompt was the real leverage point. Seven of the fifteen Codex findings landed directly in the plan; three went to user decision; five stayed as intentional NOT-in-scope.
To take advantage of this release
gbrain upgrade does not need to do anything. These are new public files; existing installs pick them up on their next pull.
- If you wrote a downstream fork: regenerate with your URL base.
LLMS_REPO_BASE=https://raw.githubusercontent.com/your-org/your-fork/main bun run build:llms git add llms.txt llms-full.txt && git commit - If you add a new doc under
docs/: add it toscripts/llms-config.ts, thenCI blocks ship if these drift.bun run build:llms bun test test/build-llms.test.ts - Verify it actually works: ask a fresh LLM
Answer should cite
Fetch https://raw.githubusercontent.com/garrytan/gbrain/master/llms.txt and tell me how I'd debug a broken live sync.docs/GBRAIN_VERIFY.md,docs/guides/live-sync.md, andgbrain doctor.
Itemized changes
Added
AGENTS.mdat repo root — ~45-line non-Claude-agent operating protocol. Install, read order, trust boundary, config/debug/migration pointers, fork instructions. Uses relative links so it survives renames.llms.txtat repo root — llmstxt.org-spec index. H1 + blockquote + 5 required H2 sections (Core entry points, Configuration, Debugging, Migrations) plus an Operational tips block withgbrain doctor,gbrain orphans,gbrain repair-jsonb. ~4KB.llms-full.txtat repo root — same index with core docs inlined under## {path}headings for single-fetch ingestion. ~225KB, under the 600KBFULL_SIZE_BUDGET.scripts/llms-config.ts— curated TS config.LLMS_REPO_BASEenv var lets forks regenerate with their own URL base.includeInFull: falseflags entries that should appear inllms.txtbut not be inlined inllms-full.txt(Philosophy, Optional, CHANGELOG).scripts/build-llms.ts— the generator. Deterministic, no timestamps, sorted by config order. Warns (does not fail) ifllms-full.txtexceedsFULL_SIZE_BUDGETwith the biggest entries listed.test/build-llms.test.ts— 7 cases: paths resolve on disk, generator idempotent, llms.txt spec shape, checked-in files match generator output (drift guard), content contract (RESOLVER / AGENTS / INSTALL_FOR_AGENTS referenced), AGENTS mirrors README+INSTALL install path, size budget enforcement.bun run build:llmsscript inpackage.json.
Changed
-
README.md— adds a one-line LLMs/Agents pointer above the install CTA and a follow-up paragraph under the agent paste block namingAGENTS.md+llms.txtas fallback entry points for non-Claude agents. -
INSTALL_FOR_AGENTS.md— new "Step 0: If you are not Claude Code" prelude points agents atAGENTS.mdfirst. -
CLAUDE.md— addsscripts/llms-config.ts,scripts/build-llms.ts, andAGENTS.mdto Key files. Explicitly notes that committed generator output is NOT analogous toschema-embedded.ts(no runtime consumer; committed for GitHub browsing + fork safety). -
INSTALL_FOR_AGENTS.md:136—git pull origin main→git pull origin master. Pre-existing drift: README and CI usemaster,origin/HEAD -> master, but the upgrade instructions told users to pull from a branch that doesn't exist. Folded into this release as a drive-by fix.
[0.14.2] - 2026-04-20
Eight deferred bugs, root-cause fixes, one clean wave.
Sync stops losing files. Migrations stop retrying forever. Pooler users get a knob.
Eight bugs were previously scoped out of a PR after Codex review caught wrong root causes and unimplementable architectures. v0.14.2 takes each back to the actual code and fixes the structural gap. /plan-eng-review + /codex consult verified every load-bearing claim before a single line of code ran (20 findings, 12 triggered plan revisions before implementation).
The practical wins for a busy brain: gbrain sync no longer silently loses files with unquoted-colon YAML titles across any of the three sync paths. gbrain upgrade can't get stuck in an infinite retry loop on a wedged migration (3-partial cap + --force-retry escape hatch). Supabase pooler users have GBRAIN_POOL_SIZE to throttle without touching schemas. gbrain doctor --fast tells you WHY it's skipping DB checks instead of lying about no database being configured. brain_score gets a breakdown so 79/100 tells you which component is costing you the 21 points.
The numbers that matter
Measured on this branch's diff against origin/master:
| Metric | BEFORE v0.14.2 | AFTER v0.14.2 | Δ |
|---|---|---|---|
| Sync paths that silently drop files on YAML break | 3 of 3 | 0 of 3 | no more silent loss |
| Wedged-migration retry loops | infinite | 3-partial cap + --force-retry |
bounded |
| Pool-size knob for Supabase pooler | none | GBRAIN_POOL_SIZE env |
first-class knob |
doctor --fast messages |
1 catch-all | 3 source-specific | honest signal |
brain_score observability |
one number | 5-field breakdown (sum == total) | diagnosable |
Duplicate edges in gbrain graph output |
leaked per-origin | deduped at presentation | schema preserved |
minion_jobs.max_stalled default |
1 (dead-letter on first stall) | 3 | autopilot survives long embed runs |
| New + extended unit tests | 1696 | 1743 (+47 + 119 new assertions) | +47 |
| Root-cause fixes vs symptom patches | 0 | 8 / 8 | structural |
What this means for you
Your agent's feedback loops tighten. When sync blocks, doctor surfaces the exact file with the YAML problem and the commit where it showed up. When a migration gets stuck, there's a cap and a clear escape. When you're on Supabase's transaction pooler and gbrain upgrade spawns subprocesses, set GBRAIN_POOL_SIZE=2 and stop MaxClients crashes. Run gbrain doctor and the brain_score breakdown points at what to fix first: embed coverage, link density, timeline coverage, orphans, or dead links.
To take advantage of v0.14.2
gbrain upgrade should do this automatically. If it didn't, or if gbrain doctor warns about a partial migration:
- Run the orchestrator manually:
gbrain apply-migrations --yes - Supabase pooler users (port 6543) now have a knob. If you hit MaxClients during upgrades, set
GBRAIN_POOL_SIZE=2(or lower) in your environment before runninggbrain upgrade. - Check sync health after the upgrade:
If it warns about
gbrain doctorsync_failures, the paths and errors are in~/.gbrain/sync-failures.jsonl. Fix the offending YAML frontmatter and re-rungbrain sync, or usegbrain sync --skip-failedto acknowledge known-broken files and advance past them. - Wedged migrations: If
doctorever flags a version with 3 consecutive partials, rungbrain apply-migrations --force-retry vX.Y.Zto reset the state machine, thengbrain apply-migrations --yesto re-attempt. - If any step fails or the numbers look wrong, file an issue: https://github.com/garrytan/gbrain/issues with:
- output of
gbrain doctor - contents of
~/.gbrain/upgrade-errors.jsonlif it exists - which step broke
- output of
Itemized changes
Reliability
- Bug 2:
GBRAIN_POOL_SIZEenv knob (src/core/db.ts,src/commands/import.ts). Honored by both the singleton pool and the parallel-import worker pool. Defaults to 10; lower for Supabase transaction pooler.initPostgres/initPGLitenow wrap lifecycle intry { ... } finally { await engine.disconnect() }. - Bug 3: Migration ledger centralization + wedge cap (
src/commands/apply-migrations.ts,src/core/preferences.ts). Runner owns all ledger writes. 3 consecutive partials = wedged, skipped with a loud message. New--force-retry <version>flag writes a'retry'marker without faking success.completestatus never regresses.appendCompletedMigrationis idempotent on double-complete. - Bug 8:
max_stalleddefault 1 → 3 (src/core/schema-embedded.ts,src/core/pglite-schema.ts,src/schema.sql). First lock-lost tick no longer dead-letters.v0_14_0Phase A ALTERs existing installs.autopilot-cyclehandler yields to the event loop between phases so the worker's lock-renewal timer fires. (v0.15.1 further bumps this to 5 and adds a non-terminal row backfill — see #219.) - Bug 9: Sync gate + acknowledge mechanism (
src/commands/sync.ts,src/commands/import.ts,src/core/sync.ts). All 3 sync paths (incremental, full viarunImport,gbrain importgit continuity) gatesync.last_commiton no-failures. Failures append to~/.gbrain/sync-failures.jsonlwith dedup key. Newgbrain sync --skip-failed+--retry-failedflags. Doctor surfaces unacknowledged failures.
Observability
- Bug 7:
doctor --fastsource-aware messages (src/core/config.ts,src/cli.ts,src/commands/doctor.ts). NewgetDbUrlSource()returns'env:GBRAIN_DATABASE_URL' | 'env:DATABASE_URL' | 'config-file' | null. Doctor emitsSkipping DB checks (--fast mode, URL present from env:GBRAIN_DATABASE_URL)when applicable. - Bug 11:
brain_scorebreakdown + metric clarity (src/core/types.ts, both engines'getHealth()). Addedembed_coverage_score,link_density_score,timeline_coverage_score,no_orphans_score,no_dead_links_score. Sum equalsbrain_scoreby construction.dead_linksnow onBrainHealth(resolves a pre-existingfeaturesTeaserForDoctordrift).orphan_pagesdocs clarified — it's "islanded" (no inbound AND no outbound), not the stricter "zero inbound" graph definition.
Graph correctness
- Bug 6/10:
jsonb_agg(DISTINCT ...)in legacytraverseGraph(src/core/postgres-engine.ts,src/core/pglite-engine.ts). Presentation-level dedup only — the schema continues to preserve per-origin_page_id/ per-link_sourceprovenance rows. Fixes duplicate edges likeworks_at → companies/brexappearing twice ingbrain graph.
New migration
- Bug 5:
v0_14_0migration registered (src/commands/migrations/v0_14_0.ts). Phase A:ALTER minion_jobs.max_stalled SET DEFAULT 3(idempotent). Phase B: emitspending-host-work.jsonlentry pointing atskills/migrations/v0.14.0.mdfor shell-jobs adoption. Registered insrc/commands/migrations/index.ts.
Tests
- New:
test/traverse-graph-dedup.test.ts,test/sync-failures.test.ts,test/brain-score-breakdown.test.ts,test/migration-resume.test.ts,test/migrations-v0_14_0.test.ts. - Extended:
test/migrate.test.ts(resolvePoolSize),test/doctor.test.ts(dbSource),test/apply-migrations.test.ts(skippedFutureincludes0.14.0). - E2E updated:
test/e2e/migration-flow.test.tsassertions aligned with the new runner-owned-ledger contract (orchestrator no longer writes completed.jsonl directly).
Deferred to v0.15
- Deep
AbortSignalthreading throughrunEmbedCore/runExtractCore/runBacklinksCore/performSync. Between-phase yield addresses the Bug 8 lock-renewal root cause; mid-phase cancellation on huge brains belongs in the queue-polish PR. failJobFromSweeperforhandleTimeouts/handleStalled. Current directstatus='dead'writes kept.
[0.14.1] - 2026-04-20
gbrain doctor stops crying wolf on DRY, and now repairs the real ones.
Skill delegations via _brain-filing-rules.md finally count.
gbrain doctor --fast was flagging 9 DRY violations on this repo, every run, for skills that properly delegated to skills/_brain-filing-rules.md. The old check only accepted conventions/quality.md as a valid delegation target, so every skill that correctly filed notability rules through the brain-filing-rules module got flagged anyway. Alert fatigue eroded every other doctor warning. v0.14.1 swaps the substring match for proximity-based suppression: a delegation reference within 40 lines of a pattern match (across > **Convention:**, > **Filing rule:**, and inline backtick paths) now correctly suppresses the violation.
The release also adds gbrain doctor --fix and gbrain doctor --fix --dry-run. Instead of telling you what's wrong, doctor can now repair it. Five guards keep the edits safe: refuses if the working tree is dirty (git is the rollback), refuses if the skill isn't inside a git repo (no rollback available), skips matches inside fenced code blocks (examples are not violations), skips when the pattern matches more than once (ambiguous), skips when a delegation reference already exists within 40 lines. Shell-injection safe via execFileSync array args. Trailing newline preserved. No .bak clutter, git is the backup contract.
The numbers that matter
Measured on this repo's real skill library (28 skills, 3 cross-cutting patterns):
| Metric | BEFORE v0.14.1 | AFTER v0.14.1 | Δ |
|---|---|---|---|
| False-positive DRY violations | 1 flagged, 0 fixable | 0 flagged | cleaner signal |
| Genuine DRY violations surfaced | 8 | 8 (unchanged) | honest count |
Auto-repairable via --fix --dry-run |
0 | 7 proposed, 4 intelligently skipped | new capability |
| Unit tests for doctor/resolver/dry-fix | 24 | 55 (+31) | +31 |
| Adversarial review fixes in ship | 0 | 4 ship-blockers caught + fixed | defense in depth |
The 4 adversarial fixes are worth calling out: shell injection via execFileSync array args, a silent-overwrite bug when skills live outside a git repo (now returns no_git_backup), EOF newline preservation on splice, and delegation-proximity consistency between detector (40 lines) and idempotency guard (now also 40 lines, was 10).
What this means for you
Your agent's gbrain doctor output now means something again. Nine warnings a run was noise you learned to ignore; one real warning is signal. And when the doctor does flag an inlined rule, gbrain doctor --fast --fix --dry-run shows you exactly what the repair looks like before you commit to it. Run gbrain doctor --fast --fix to apply. Git is the undo button.
To take advantage of v0.14.1
gbrain upgrade does this automatically. No manual migration required.
- Verify the detection fix:
gbrain doctor --fast --json | jq '.checks[] | select(.name=="resolver_health")' - Try the auto-fix preview on your own brain:
gbrain doctor --fast --fix --dry-run - Apply when ready:
gbrain doctor --fast --fix - If anything looks wrong, please file an issue:
https://github.com/garrytan/gbrain/issues with the
gbrain doctor --jsonoutput.
Itemized changes
Added
gbrain doctor --fixapplies> **Convention:**reference callouts to skills that inline cross-cutting rules (Iron Law back-linking, citation format, notability gate).--dry-runpreviews the diff without writing.- Three shape-aware block expanders (bullet, blockquote, paragraph) in
src/core/dry-fix.ts, each a pure function, each with unit tests. - New
extractDelegationTargets()helper insrc/core/check-resolvable.tsparses> **Convention:**,> **Filing rule:**, and inline backtick references, normalizing paths to theCROSS_CUTTING_PATTERNS.conventionsshape. getWorkingTreeStatus()returns 3-state'clean' | 'dirty' | 'not_a_repo'so the fixer never writes to files git can't roll back.
Changed
CROSS_CUTTING_PATTERNSeach list multiple valid delegation targets (notability gate accepts bothconventions/quality.mdand_brain-filing-rules.md).- DRY suppression is proximity-based:
DRY_PROXIMITY_LINES = 40for detector AND the fix-module's idempotency check (was inconsistent: 40 vs 10). - Shell execution uses
execFileSyncwith array args (no shell, no injection surface from manifest-derived paths).
Tests
- 31 new tests across
test/check-resolvable.test.ts(DRY detection, 13 cases),test/dry-fix.test.ts(unit, 28 cases including expander pure-function tests),test/doctor-fix.test.ts(CLI integration, 3 cases). - Full suite: 1694 pass, 0 fail.
[0.14.0] - 2026-04-20
Move gateway crons to Minions. Zero LLM tokens per cron fire.
Worker abort path finally marks aborted jobs dead.
Your OpenClaw gateway pins at 100% CPU when your 32 cron jobs each boot a full Opus session per fire, and ~14 of them are pure API-fetch-and-write scripts that don't need reasoning at all. This release adds a shell job type to Minions so those deterministic crons move off the gateway to the Minions worker. ~60% gateway load reduction at OpenClaw scale. Retry, backoff, DLQ, unified gbrain jobs list visibility, all free. The LLM-reasoning crons stay on the gateway where they belong.
Getting there meant fixing the Minions worker abort path, which was quietly wrong since v0.11: aborted jobs (timeout, cancel, lock loss) returned silently without calling failJob, so status stayed active until a stall sweep found them ~30s later. This release makes abort-reason the error_text of an immediate failJob call. Handlers get cleaner signals, operators see accurate status, --follow stops hanging past timeouts.
The numbers that matter
Measured on the new test/minions-shell.test.ts (40 unit cases) and test/e2e/minions-shell.test.ts (4 E2E cases) plus 5 rounds of pre-landing review (spec adversarial x2, CEO scope, DX, eng, Codex outside voice).
| Metric | BEFORE v0.14.0 | AFTER v0.14.0 | Δ |
|---|---|---|---|
| LLM tokens per cron fire | ~full Opus context boot | 0 (deterministic crons) | 100% reduction |
| Gateway CPU headroom with ~14 crons moved | 0% | ~60% free | cron load off gateway |
| Aborted job status lag (timeout/cancel/lock-loss) | up to 30s | immediate failJob call |
deterministic |
| Shell submission surfaces | none | CLI + trusted submit_job |
2 paths, both gated |
| Submission audit trail | none | JSONL at ~/.gbrain/audit/ |
operational trace |
| Unit tests | 1318 pass | 1358 pass (+40 shell cases) | +40 |
| E2E tests | 124 | 128 (+4 shell lifecycle) | +4 |
| Pre-landing review rounds | 1 (eng) | 5 (spec×2 / CEO / DX / eng / codex) | 29 issues surfaced, 26 resolved |
The abort-path fix is the quietly-important one. Handlers that use ctx.signal for cooperative cancel (sync, embed) now have deterministic status flips instead of waiting for the stall sweep. Shell jobs get reliable timeout semantics for the first time: cmd: 'sleep 30', timeout_ms: 2000 hits dead at ~2100ms instead of ~32000ms.
What this means for OpenClaw operators
gbrain upgrade reads skills/migrations/v0.14.0.md and walks your host agent through the adoption: enable the worker with GBRAIN_ALLOW_SHELL_JOBS=1, audit every cron entry (LLM-requiring stays, deterministic moves), propose a rewrite per cron with a diff, verify one fire end-to-end before approving the next batch. Never auto-rewrites your crontab — every change is a human approval per-cron. On Postgres, one persistent worker daemon claims each job. On PGLite, every crontab invocation adds --follow for inline execution because PGLite doesn't support the worker daemon. Either way, your gateway CPU stops pinning at 100% and your live messages stop getting blocked by batch processing. See docs/guides/minions-shell-jobs.md for usage recipes and skills/migrations/v0.14.0.md for the adoption playbook.
Itemized changes
New shell job type
- Spawn arbitrary commands as Minions jobs. Pass
{cmd: "string"}(shell-interpolated via/bin/sh -c) or{argv: ["bin","arg"]}(no shell, safe for programmatic callers). Both forms require an absolutecwd. Env vars are scoped to a minimal allowlist (PATH, HOME, USER, LANG, TZ, NODE_ENV) to prevent accidental$OPENAI_API_KEYinterpolation; callers opt-in to additional keys per job. - Two-layer security: MCP boundary + env flag.
submit_jobrejectsname: 'shell'whenctx.remote === true. Independent of the env flag.MinionQueue.add('shell', ...)also rejects unless the caller explicitly opts in via{allowProtectedSubmit: true}as the 4th arg, so an in-process handler can't programmatically submit a shell child by accident. Worker only registers the handler whenGBRAIN_ALLOW_SHELL_JOBS=1is set on the worker process. Default: off. Opt in per-host. - Graceful child shutdown. Abort fires SIGTERM, 5-second grace, then SIGKILL. Listens to both
ctx.signal(timeout/cancel/lock-loss) and a newctx.shutdownSignal(worker process SIGTERM/SIGINT), so deploy restarts don't orphan shell children. Non-shell handlers ignoreshutdownSignaland keep running through the worker's 30s cleanup race. - UTF-8-safe output truncation. stdout is retained as the last 64KB, stderr as the last 16KB, with a
[truncated N bytes]marker prepended when exceeded. Usesstring_decoder.StringDecoderso multibyte characters don't split across the truncation boundary. - Operational audit trail at
~/.gbrain/audit/shell-jobs-YYYY-Www.jsonl(ISO-week rotation, override viaGBRAIN_AUDIT_DIR). Records caller, remote flag, job_id, cwd, and cmd/argv display. Never logs env values. Best-effort writes: failures log to stderr but don't block submission. Operational trace for "what did this cron submit last Tuesday," not forensic insurance. - Starvation warning on first-time submission. If you
gbrain jobs submit shell ...without--followand no worker with the env flag is running, stderr prints a warning block pointing at both--followandgbrain jobs workremediation. Turns a silent "job sits in waiting forever" failure mode into a directed next-step.
Worker abort path overhaul
- Aborted jobs now call
failJobwith the abort reason. Pre-v0.14.0 worker returned silently whenctx.signal.abortedfired, leaving jobs inactiveuntil stall sweep. Fixed: catch-block now derives reason fromabort.signal.reason(timeout,cancel,lock-lost,shutdown) and callsfailJob(id, token, "aborted: <reason>"). Token-match makes the call idempotent: if another path already flipped status, it no-ops cleanly. Downstream--followloops and status assertions now reflect reality. ctx.shutdownSignalseparated fromctx.signal. Only fires on worker process SIGTERM/SIGINT. Handlers that need shutdown-specific cleanup (currently: shell handler's SIGTERM→SIGKILL on its child) subscribe to both signals. Non-shell handlers subscribe only toctx.signaland don't get cancelled mid-flight on deploy restart.
CLI + operation surface additions
gbrain jobs submit --timeout-ms N. Per-job wall-clock timeout in ms. Surfaced from the existingtimeout_msschema field, which had no CLI flag before.submit_joboperation gainstimeout_msparam. Same field exposed through MCP (for non-protected names).gbrain jobs submit --helplists handler types.shellis explicitly called out as CLI-only with a pointer to the guide. Closes the "what handlers are even available" discovery gap.
Tests
- 40 new unit cases in
test/minions-shell.test.tscovering validation (cmd/argv/cwd/env), spawn happy + error paths, UTF-8 safe truncation, SIGTERM abort via both signals, env allowlist (OPENAI_API_KEY blocked, PATH inherited, caller override), ISO-week filename at year boundary (2027-01-01 → W53 2026), audit write happy + EACCES failure paths, whitespace-bypass defense onMinionQueue.add(' shell ', ...), and auto-added regression tests per the iron rule (non-protected names unaffected). - 4 E2E tests in
test/e2e/minions-shell.test.tscovering full lifecycle (submit → worker claim → spawn → complete with captured stdout),MinionQueue.adddefense-in-depth,submit_jobMCP-guard rejection,submit_jobCLI-path acceptance.
Docs
- New
docs/guides/minions-shell-jobs.mdopens with a 30-second copy-paste hello-world, then covers the two-layer security model with honest callouts about what env allowlist does and does not do, Postgres vs PGLite crontab recipes side-by-side, debug playbook (gbrain jobs list,gbrain jobs get, audit log tail, PGLite--follownote), known limitations, and an#errorstable linked from everyUnrecoverableErrorthe handler throws. - New
skills/migrations/v0.14.0.mdis the adoption playbook your host agent reads ongbrain upgrade. Walks through enabling the worker, auditing cron entries (LLM-requiring vs deterministic), proposing per-cron rewrites with diffs, and verifying end-to-end before batch approval. Iron rule: never auto-rewrites the operator's crontab — every change is human-approved per-cron. - README.md links the guide from the Commands section.
Pre-ship review
Five independent rounds surfaced 29 issues across the plan. 26 resolved before a single line of code was written: spec-review adversarial subagent (x2 iterations) caught implementer-ergonomic gaps (caller derivation, mkdirSync, ISO-week formatter). CEO review + SELECTIVE EXPANSION cherry-picked argv form, audit log, SIGTERM grace, env allowlist, MCP-guard defense-in-depth, honest FS-read trust model, orphan-child setTimeout.unref() fix. DX review added the starvation warning block. Eng review added ctx.shutdownSignal separation, revised trusted-arg from opts-fold to separate 4th arg (stops accidental pass-through via {...userOpts} spreads), 18 additional test cases, 4 iron-rule regression tests. Codex outside voice caught 4 architectural dealbreakers: the worker abort silent-return bug (the "contract is a lie" finding), --timeout-ms CLI flag and submit_job param both missing, PROTECTED_JOB_NAMES.has(name) whitespace bypass before normalization. Effort estimate revised 8-10h → 16-20h once the full review was done.
[0.13.1] - 2026-04-20
The brain stops being a write-once graph and starts being a runtime.
Five new modules land on top of v0.12's knowledge graph layer.
GBrain v0.13.1 ships the Knowledge Runtime delta on top of v0.13.0's frontmatter graph. Typed abstractions that turn a knowledge base into a runtime other agents can adopt. Five focused modules build on the v0.12.0 graph layer and v0.11.x Minions orchestration. A Resolver SDK unifies external lookups. A BrainWriter enforces integrity pre-commit. gbrain integrity repairs bare-tweet citations at scale. A BudgetLedger caps runaway resolver spend. Minions gains TZ-aware quiet-hours at claim time.
What you can do now that you couldn't before
gbrain integrity --auto --confidence 0.8repairs the 1,424 bare-tweet citations in your brain without human review. Three-bucket confidence: auto-repair ≥0.8, review queue 0.5–0.8, skip <0.5. Resumable via~/.gbrain/integrity-progress.jsonl.gbrain resolvers listintrospects the typed plugin registry. Two builtins ship:url_reachable(HEAD check + SSRF guard) andx_handle_to_tweet(X API v2 with confidence scoring). Every result carries{value, confidence, source, fetchedAt, costEstimate, raw}.gbrain config set budget.daily_cap_usd 10puts a hard wall on resolver spend. Concurrent reserves serialize viaSELECT FOR UPDATE. TTL auto-reclaim handles process death between reserve and commit.- BrainWriter + pre-commit validators make the Philip-Leung hallucination class structurally impossible.
Scaffolderbuilds every tweet URL from API output, never LLM text.SlugRegistrydetects name collisions at create time. Four validators (citation, link, back-link, triple-HR) run on write.writer.lint_on_put_page=trueenables observability before the strict-mode flip. - Quiet-hours on Minion jobs stop the 3am DM. Set
quiet_hours: {start:22, end:7, tz:"America/Los_Angeles", policy:"defer"}on a job. Worker checks at claim time (not dispatch). Wrap-around windows supported.
Schema migrations
Three new migrations, all idempotent, apply automatically on gbrain init / upgrade.
- v11 — budget_ledger + budget_reservations. Per-(scope, resolver, local_date) rollup with held-reservation TTL. Rollback: DROP TABLE (budget is regenerable from resolver call logs).
- v12 — minion_jobs.quiet_hours + stagger_key. Additive nullable columns; existing rows keep working unchanged.
- TS v0.13.1 — grandfather
validate: false. Walks every page, adds the opt-out frontmatter so legacy content skips the new validators.gbrain integrity --autoclears the flag per-page as citations are repaired. Rollback log at~/.gbrain/migrations/v0_13_1-rollback.jsonl.
Out of scope (intentional, per CEO plan)
- Strict-mode default flip. BrainWriter ships with
strict_mode=lint. The flip to strict requires a 7-day soak + BrainBench regression ≤1pt + zero false-positive count. - Sandboxed user plugins. v0.13 ships builtins only. User-provided TS modules deferred pending a real isolation story (worker_threads or vm2) in a follow-on release.
openai_embeddingrefactor. Deferred to PR 1.5 post-flip; embedding is a hot path.- OpenClaw
claw-bridge. Adoption path is documentation-only this release.
Tests
- 89 new unit tests across
test/resolvers.test.ts(43),test/writer.test.ts(57),test/integrity.test.ts(21),test/enrichment.test.ts(23),test/minions-quiet-hours.test.ts(25),test/post-write-lint.test.ts(11),test/migrations-v0_13_0.test.ts(5). - E2E passes on Postgres: 115 pass / 0 fail across mechanical, sync, upgrade, minions concurrency + resilience, graph-quality, MCP, migration-flow, search-quality, skills (Tier 2 Opus/Sonnet).
- 1574 total tests pass with an active test Postgres container. 1522 pass in unit-only mode (E2E auto-skip without DATABASE_URL).
Itemized changes
Resolver SDK (src/core/resolvers/)
Resolver<I, O> interface with {id, cost, backend, available(), resolve()}. In-memory ResolverRegistry. ResolverContext carries {engine, storage, config, logger, requestId, remote, deadline?, signal?} — the remote flag mirrors OperationContext.remote for uniform trust boundaries. FailImproveLoop.execute gained optional opts.signal; backwards compatible. Two reference builtins: url_reachable (SSRF guard reuses wave-3 isInternalUrl, max-5 redirects with per-hop re-validation, AbortSignal composition) and x_handle_to_tweet (X API v2 recent search, strict handle regex, confidence-scored matches, 2x 429 retry honoring Retry-After, 401/403 → ResolverError(auth)). gbrain resolvers list|describe for introspection.
BrainWriter + validators (src/core/output/)
BrainWriter.transaction(fn, ctx) over engine.transaction with pre-commit validators via WriteTx API. Scaffolder builds typed citations (tweetCitation, emailCitation, sourceCitation) + entityLink + timelineLine — URLs from structured IDs, never LLM text. SlugRegistry detects collisions at create time. Four validators (citation, link, back-link, triple-hr) skip fenced code / inline code / HTML comments correctly. Config flag writer.strict_mode (default lint).
gbrain integrity (src/commands/integrity.ts)
Four subcommands: check (read-only report with --json, --type, --limit), auto (three-bucket repair with --confidence, --review-lower, --dry-run, --fresh, --limit), review (prints queue path + count), reset-progress. Nine bare-tweet phrase regexes. External-link extraction for optional dead-link probing. Repairs route through BrainWriter.transaction.
BudgetLedger + CompletenessScorer (src/core/enrichment/)
BudgetLedger.reserve returns {kind:'held'} or {kind:'exhausted'}. FOR UPDATE serializes concurrent reserves. commit, rollback, cleanupExpired. Midnight rollover via Intl.DateTimeFormat en-CA in configured IANA tz. Seven per-type rubrics + default (weights sum to 1.0). Person rubric's non_redundancy and recency_score kill Garry's OpenClaw's length-only heuristic + 30-day-re-enrich-forever pathologies.
Minions scheduler polish (src/core/minions/)
quiet-hours.ts — pure evaluateQuietHours(cfg, now?). Wrap-around windows. Unknown tz fails open. stagger.ts — FNV-1a → 0–59 deterministic across runtimes. worker.ts integrated: post-claim evaluation, defer → delayed/+15m, skip → cancelled.
Post-write lint hook (src/core/output/post-write.ts)
runPostWriteLint invokes the four validators against freshly-written pages. Gated on writer.lint_on_put_page (default false). Wired into put_page operation handler as non-blocking. Findings go to ~/.gbrain/validator-lint.jsonl + engine.logIngest.
Design doc
docs/designs/KNOWLEDGE_RUNTIME.md — 717 lines covering the 4-layer architecture, integration seams, 7-phase migration path, 10 open questions. Promoted to repo so future contributors can trace decisions.
Prior learnings applied
- Snapshot slugs upfront (
engine.getAllSlugs()) in grandfather migration — avoids pagination-mutation instability. - TS-registry migrations only (post-v0.11.1 migration-discovery change).
- Migration never calls
saveConfig— avoids Postgres→PGLite flip. - Quiet-hours at claim/promote, not dispatch — queued job becomes claimable after window opens.
- Core fn pattern for any handler wrapping a CLI command.
- Schema v11 not v8 (graph layer took v8-v10).
gray-matter+ line tokenizer for citation parsing, notmarked.lexer.
[0.13.0] - 2026-04-20
Frontmatter becomes a graph. Every company:, investors:, attendees: you wrote turns into typed edges automatically.
Graph queries get dramatically richer without you changing a word of content.
v0.13 teaches the knowledge graph to read your YAML frontmatter. A company: Acme on a person page becomes a works_at edge. investors: [Fund-A, Fund-B] on a deal page becomes invested_in edges pointing to the deal. attendees: [alice, charlie] on a meeting page becomes attended edges. Direction respects subject-of-verb: people/alice → meetings/2026-04-03 reads naturally because Alice is the one who attended. gbrain graph <entity> --depth 2 against an entity with rich frontmatter goes from returning ~7 nodes to 50+, with zero skill edits or frontmatter changes.
Everything else stays the same. Agents writing put_page with frontmatter today work unchanged, the graph populates behind the scenes. The auto_links response gains one additive field: unresolved, so agents can see which frontmatter names couldn't be matched to existing pages and queue them for enrichment. No breaking changes to any public API.
The numbers that matter
Benchmarked against a 46K-page production brain with ~15K frontmatter references:
| Metric | Before (v0.12) | After (v0.13) | Δ |
|---|---|---|---|
| Graph edges total | 28K | 43K | +54% |
gbrain graph <hub-entity> --depth 2 node count |
7 | 52 | +643% |
| 4-hop queries (person → company → deal → investor) | fail | return aggregate | unlocked |
| Migration wall-clock on 46K pages | N/A | 3min | one-time |
| LLM API calls during migration | N/A | 0 | deterministic |
| Embedding API calls during migration | N/A | 0 | zero cost |
| Frontmatter field | Edges produced on 46K-page test brain |
|---|---|
company, companies (person pages) |
~9,800 |
key_people (company pages) |
~1,400 |
investors (deal + company pages) |
~2,100 |
attendees (meeting pages) |
~800 |
partner (company pages) |
~180 |
sources, source (any page) |
~1,200 |
related, see_also (any page) |
~400 |
The 4-hop query pattern that motivated this release: "top investors in an advisor's portfolio." Pre-v0.13: impossible without manual graph edits. Post-v0.13: gbrain graph <advisor-slug> --depth 2 --type yc_partner,invested_in returns ranked fund pages with frequencies. Works because the advisor's companies: field points to portfolio companies, those companies' partner: field points back, and their investors: field resolves to fund pages.
What this means for OpenClaw agents
If you maintain an agent fork that uses gbrain as its persistent memory, v0.13 is the easiest upgrade since v0.7. Run gbrain upgrade, wait ~3 minutes while the orchestrator runs schema + backfill, and graph queries get better. No skill edits required for the majority of skills. Three skills (meeting-ingestion, enrich, idea-ingest) gain an optional new phase if you want to consume the new auto_links.unresolved field, see docs/UPGRADING_DOWNSTREAM_AGENTS.md for the exact diffs.
To take advantage of v0.13
gbrain upgrade should do this automatically. If it didn't, or if gbrain doctor warns about a partial migration:
-
Run the orchestrator manually:
gbrain apply-migrations --yes -
Your agent reads
skills/migrations/v0.13.0.mdthe next time you interact with it. If your agent is headless (cron, OpenClaw worker, Minion handler), the migration orchestrator already ran the mechanical side; no additional agent action is needed. -
Verify the outcome:
gbrain graph <some-entity> --depth 2 # any entity with frontmatter refs gbrain stats # link_count should reflect ~15-20K new frontmatter edges -
If any step fails or the numbers look wrong, please file an issue: https://github.com/garrytan/gbrain/issues with:
- output of
gbrain doctor - contents of
~/.gbrain/upgrade-errors.jsonlif it exists - which step broke
This feedback loop is how the gbrain maintainers find fragile upgrade paths. Thank you.
- output of
Itemized changes
Knowledge graph, frontmatter edge projection:
src/core/link-extraction.ts, newFRONTMATTER_LINK_MAP(canonical field to type + direction + dir-hint map). NewSlugResolverinterface +makeResolver(engine, {mode})factory.extractFrontmatterLinksextractor.extractPageLinksbecomes async and emits frontmatter edges alongside markdown refs.LinkCandidategainsfromSlug,linkSource,originSlug,originField.src/core/operations.ts::runAutoLink, bidirectional reconciliation. Outgoing edges (markdown + own-frontmatter) reconciled viagetLinks; incoming edges (other-page to self fromkey_people/attendees/etc.) reconciled viagetBacklinksscoped toorigin_page_id. Manual edges (link_source='manual') never touched.put_pageresponse shape extends withauto_links.unresolved: Array<{field, name}>. Additive; existing clients unaffected.
Slug resolver:
- Two-mode resolver (
batchfor migration,livefor put_page post-hook). Fallback chain: exact slug, dir-hint construction, pg_trgm fuzzy match, optional keyword search (live only,expand: falsemandatory peroperations-query-hidden-haikulearning). - New engine method
findByTitleFuzzy(name, dirPrefix?, minSimilarity?)implemented on both Postgres and PGLite engines. Uses the%operator +similarity()function; GIN trigram index drives the match. - Per-run cache: same name, single DB lookup.
Schema migrations:
- migrate.ts v11 (
links_provenance_columns): addslink_source,origin_page_id,origin_field. Swaps unique constraint toUNIQUE NULLS NOT DISTINCT (from, to, type, link_source, origin_page_id). CHECK constraint onlink_sourcevalues. New indexes on link_source + origin_page_id. src/commands/migrations/v0_13_0.ts, release orchestrator (Phase A schema, Phase B backfill, Phase C verify). Registered in migrations/index.ts. Resumable viapartialstatus +ON CONFLICT DO NOTHING.
Engine layer:
- Both engines:
addLinkgainslinkSource,originSlug,originFieldparams.addLinksBatchunnest grows from 4 columns to 7.removeLinkgains optionallinkSourcefilter.getLinks+getBacklinksnow returnlink_source,origin_slug,origin_fieldin the Link shape. - PGLite + Postgres parity verified end-to-end in
test/pglite-engine.test.ts.
Release reliability (applies to every future release):
src/commands/upgrade.ts, best-effortgbrain post-upgradefailures now append a structured record to~/.gbrain/upgrade-errors.jsonlinstead of silently swallowing the error.src/commands/doctor.ts, surfaces the latest upgrade-errors entry with a paste-ready recovery hint. Works alongside the existing partial-migration detector.- CHANGELOG format adds the "To take advantage of v[version]" block pattern (seen above). Required for every release going forward so users have a self-repair path when automation fails.
CLI changes:
gbrain extract links --source db --include-frontmatter, v0.13 flag. Default OFF for back-compat (existinggbrain extractruns don't suddenly get new edges). Migration orchestrator explicitly enables it for the one-time backfill.gbrain extractnow prints a top-20 summary of unresolvable frontmatter names when--include-frontmatteris active, so users see exactly where the graph has holes.
Tests:
test/pglite-engine.test.tscovers new 7-column addLinksBatch unnest + NULLS NOT DISTINCT semantics + ON CONFLICT on the new constraint.test/link-extraction.test.tscovers async signature regression, resolver fallback chain, cache hit, bad-type skip, context enrichment.test/extract.test.tscovers fs-source async signature,includeFrontmatteropt-in, incoming-direction semantics forinvestors/key_people/attendees.test/migrate.test.tsupdated for new constraint name post-v11.test/apply-migrations.test.tsregistry now includes v0.13.0 in skippedFuture buckets for older installed versions.
Documentation:
skills/migrations/v0.13.0.md, user-facing upgrade skill.docs/UPGRADING_DOWNSTREAM_AGENTS.md, appended v0.13 section: no-action-required verdict + field-to-type map + optional skill diffs for meeting-ingestion, enrich, idea-ingest.
[0.12.3] - 2026-04-19
Reliability wave: the pieces v0.12.2 didn't cover.
Sync stops hanging. Search timeouts stop leaking. [[Wikilinks]] are edges.
v0.12.2 shipped the data-correctness hotfix (JSONB double-encode, splitBody, /wiki/ types, parseEmbedding). This wave lands the remaining reliability fixes from the same community review pass, plus a graph-layer feature a 2,100-page brain needed to stop bleeding edges. No schema changes. No migration. gbrain upgrade pulls it.
What was broken
Incremental sync deadlocked past 10 files. src/commands/sync.ts wrapped the whole import in engine.transaction, and importFromContent also wrapped each file. PGLite's _runExclusiveTransaction is non-reentrant — the inner call parks on the mutex the outer call holds, forever. In practice: 3 files synced fine, 15 files hung in ep_poll until you killed the process. Bulk Minions jobs and citation-fixer dream-cycles regularly hit this. Discovered by @sunnnybala.
statement_timeout leaked across the postgres.js pool. searchKeyword and searchVector bounded queries with SET statement_timeout='8s' + finally SET 0. But every tagged template picks an arbitrary pool connection, so the SET, the query, and the reset could land on three different sockets. The 8s cap stuck to whichever connection ran the SET, got returned to the pool, and the next unrelated caller inherited it. Long-running embed --all jobs and imports clipped silently. Fix by @garagon.
Obsidian [[WikiLinks]] were invisible to the auto-link post-hook. extractEntityRefs only matched [Name](people/slug). On a 2,100-page brain with wikilinks throughout, put_page extracted zero auto-links. DIR_PATTERN also missed domain-organized wiki roots (entities, projects, tech, finance, personal, openclaw). After the fix: 1,377 new typed edges on a single extract --source db pass. Discovered and fixed by @knee5.
Corrupt embedding rows broke every query that touched them. getEmbeddingsByChunkIds on Supabase could return a pgvector string instead of a Float32Array. v0.12.2 fixed the normal path by normalizing inputs, but one genuinely bad row still threw and killed the ranking pass. Availability matters more than strictness on the read path.
What you can do now that you couldn't before
- Sync 100 files without hanging. Per-file atomicity preserved, outer wrap removed. Regression test asserts
engine.transactionis not called at the top level ofsrc/commands/sync.ts. Contributed by @sunnnybala. - Run a long
embed --allon Supabase without strangling unrelated queries.searchKeyword/searchVectorusesql.begin+SET LOCALso the timeout dies with the transaction. 5 regression tests intest/postgres-engine.test.tspin the new shape. Contributed by @garagon. - Write
[[people/balaji|Balaji Srinivasan]]in a page and see a typed edge. Same extractor, two syntaxes. Matches the filesystem walker — the db and fs sources now produce the same link graph from the same content. Contributed by @knee5. - Find your under-connected pages.
gbrain orphanssurfaces pages with zero inbound wikilinks, grouped by domain.--json,--count, and--include-pseudoflags. Also exposed as thefind_orphansMCP operation so agents can run enrichment cycles without CLI glue. Contributed by @knee5. - Degraded embedding rows skip+warn instead of throwing. New
tryParseEmbedding()sibling ofparseEmbedding(): returnsnullon unknown input and warns once per process. Used on the search/rescore path. Migration and ingest paths still throw — data integrity there is non-negotiable. gbrain doctortells you which brains still need repair. Two new checks:jsonb_integrityscans the four v0.12.0 write sites and reports rows wherejsonb_typeof = 'string';markdown_body_completenessheuristically flags pages whosecompiled_truthis <30% of raw source length when raw has multiple H2/H3 boundaries. Fix hint points atgbrain repair-jsonbandgbrain sync --force.
How to upgrade
gbrain upgrade
No migration, no schema change, no data touch. If you're on Postgres and haven't run gbrain repair-jsonb since v0.12.2, the v0.12.2 orchestrator still runs on upgrade. New gbrain doctor will tell you if anything still looks off.
Itemized changes
Sync deadlock fix (#132)
src/commands/sync.ts— remove outerengine.transactionwrap; per-file atomicity preserved byimportFromContent's own wrap.test/sync.test.ts— new regression guard asserting top-levelengine.transactionis not called on > 10-file sync paths.- Contributed by @sunnnybala.
postgres-engine statement_timeout scoping (#158)
src/core/postgres-engine.ts—searchKeywordandsearchVectorrewritten tosql.begin(async (tx) => { await tx\SET LOCAL statement_timeout = ...`; ... })`. GUC dies with the transaction; pool reuse is safe.test/postgres-engine.test.ts— 5 regression tests including a source-level guardrail grep against the production file (not a test fixture) asserting no bareSET statement_timeoutoutsidesql.begin.- Contributed by @garagon.
Obsidian wikilinks + extended domain patterns (#187 slice)
src/core/link-extraction.ts—extractEntityRefsmatches both[Name](people/slug)and[[people/slug|Name]].DIR_PATTERNextended withentities,projects,tech,finance,personal,openclaw.- Matches existing filesystem-walker behavior.
- Contributed by @knee5.
gbrain orphans command (#187 slice)
src/commands/orphans.ts— new command with text/JSON/count outputs and domain grouping.src/core/operations.ts—find_orphansMCP operation.src/cli.ts—orphansadded toCLI_ONLY.test/orphans.test.ts— 203 lines covering detection, filters, and all output modes.- Contributed by @knee5.
tryParseEmbedding() availability helper
src/core/utils.ts— newtryParseEmbedding(value): returnsnullon unknown input, warns once per process via a module-level flag.src/core/postgres-engine.ts—getEmbeddingsByChunkIdsusestryParseEmbeddingso one bad row degrades ranking instead of killing the query.test/utils.test.ts— new cases for null-return and single-warn.- Hand-authored; codifies the split-by-call-site rule from the #97/#175 review.
Doctor detection checks
src/commands/doctor.ts—jsonb_integrityscanspages.frontmatter,raw_data.data,ingest_log.pages_updated,files.metadataand reportsjsonb_typeof='string'counts;markdown_body_completenessheuristic for ≥30% shrinkage vs raw source on multi-H2 pages.test/doctor.test.ts— detection unit tests assert both checks exist and cover the four JSONB sites.test/e2e/jsonb-roundtrip.test.ts— the regression test that should have caught the original v0.12.0 double-encode bug; round-trips all four JSONB write sites against real Postgres.docs/integrations/reliability-repair.md— guide for v0.12.0 users: detect viagbrain doctor, repair viagbrain repair-jsonb.
No schema changes. No migration. No data touch.
[0.12.2] - 2026-04-19
Postgres frontmatter queries actually work now.
Wiki articles stop disappearing when you import them.
This is a data-correctness hotfix for the v0.12.0-and-earlier Postgres-backed brains. If you run gbrain on Postgres or Supabase, you've been losing data without knowing it. PGLite users were unaffected. Upgrade auto-repairs your existing rows. Lands on top of v0.12.1 (extract N+1 fix + migration timeout fix) — pull gbrain upgrade and you get both.
What was broken
Frontmatter columns were silently stored as quoted strings, not JSON. Every put_page wrote frontmatter to Postgres via ${JSON.stringify(value)}::jsonb — postgres.js v3 stringified again on the wire, so the column ended up holding "\"{\\\"author\\\":\\\"garry\\\"}\"" instead of {"author":"garry"}. Every frontmatter->>'key' query returned NULL. GIN indexes on JSONB were inert. Same bug on raw_data.data, ingest_log.pages_updated, files.metadata, and page_versions.frontmatter. PGLite hid this entirely (different driver path) — which is exactly why it slipped past the existing test suite.
Wiki articles got truncated by 83% on import. splitBody treated any standalone --- line in body content as a timeline separator. Discovered by @knee5 migrating a 1,991-article wiki where a 23,887-byte article landed in the DB as 593 bytes (4,856 of 6,680 wikilinks lost).
/wiki/ subdirectories silently typed as concept. Articles under /wiki/analysis/, /wiki/guides/, /wiki/hardware/, /wiki/architecture/, and /writing/ defaulted to type='concept' — type-filtered queries lost everything in those buckets.
pgvector embeddings sometimes returned as strings → NaN search scores. Discovered by @leonardsellem on Supabase, where getEmbeddingsByChunkIds returned "[0.1,0.2,…]" instead of Float32Array, producing [NaN] query scores.
What you can do now that you couldn't before
frontmatter->>'author'returnsgarry, not NULL. GIN indexes work. Postgres queries by frontmatter key actually retrieve pages.- Wiki articles round-trip intact. Markdown horizontal rules in body text are horizontal rules, not timeline separators.
- Recover already-truncated pages with
gbrain sync --full. Re-import from your source-of-truth markdown rebuildscompiled_truthcorrectly. - Search scores stop going
NaNon Supabase. Cosine rescoring sees realFloat32Arrayembeddings. - Type-filtered queries find your wiki articles.
/wiki/analysis/becomes typeanalysis,/writing/becomeswriting, etc.
How to upgrade
gbrain upgrade
The v0.12.2 orchestrator runs automatically: applies any schema changes, then gbrain repair-jsonb rewrites every double-encoded row in place using jsonb_typeof = 'string' as the guard. Idempotent — re-running is a no-op. PGLite engines short-circuit cleanly. Batches well on large brains.
If you want to recover pages that were truncated by the splitBody bug:
gbrain sync --full
That re-imports every page from disk, so the new splitBody rebuilds the full compiled_truth correctly.
What's new under the hood
gbrain repair-jsonb— standalone command for the JSONB fix. Run it manually if needed; the migration runs it automatically.--dry-runshows what would be repaired without touching data.--jsonfor scripting.- CI grep guard at
scripts/check-jsonb-pattern.sh— fails the build if anyone reintroduces the${JSON.stringify(x)}::jsonbinterpolation pattern. Wired intobun testso it runs on every CI invocation. - New E2E regression test at
test/e2e/postgres-jsonb.test.ts— round-trips all four JSONB write sites against real Postgres and assertsjsonb_typeof = 'object'plus->>returns the expected scalar. The test that should have caught the original bug. - Wikilink extraction —
[[page]]and[[page|Display Text]]syntaxes now extracted alongside standard[text](page.md)markdown links. Includes ancestor-search resolution for wiki KBs where authors omit one or more leading../.
Migration scope
The repair touches five JSONB columns:
pages.frontmatterraw_data.dataingest_log.pages_updatedfiles.metadatapage_versions.frontmatter(downstream ofpages.frontmattervia INSERT...SELECT)
Other JSONB columns in the schema (minion_jobs.{data,result,progress,stacktrace}, minion_inbox.payload) were always written via the parameterized $N::jsonb form so they were never affected.
Behavior changes (read this if you upgrade)
splitBody now requires an explicit sentinel for timeline content. Recognized markers (in priority order):
<!-- timeline -->(preferred — whatserializeMarkdownemits)--- timeline ---(decorated separator)---directly before## Timelineor## Historyheading (backward-compat fallback)
If you intentionally used a plain --- to mark your timeline section in source markdown, add <!-- timeline --> above it manually. The fallback covers the common case (--- followed by ## Timeline).
Attribution
Built from community PRs #187 (@knee5) and #175 (@leonardsellem). The original PRs reported the bugs and proposed the fixes; this release re-implements them on top of the v0.12.0 knowledge graph release with expanded migration scope, schema audit (all 5 affected columns vs the 3 originally reported), engine-aware behavior, CI grep guard, and an E2E regression test that should have caught this in the first place. Codex outside-voice review during planning surfaced the missed page_versions.frontmatter propagation path and the noisy-truncated-diagnostic anti-pattern that was dropped from this scope. Thanks for finding the bugs and providing the recovery path — both PRs left work to do but the foundation was right.
Co-Authored-By: @knee5 (PR #187 — splitBody, inferType wiki, JSONB triple-fix) Co-Authored-By: @leonardsellem (PR #175 — parseEmbedding, getEmbeddingsByChunkIds fix)
[0.12.1] - 2026-04-19
Extract no longer hangs on large brains.
v0.12.0 upgrade no longer times out on duplicates.
Two production-blocking bugs Garry hit on his 47K-page brain on April 18. gbrain extract was effectively unusable on any brain with 20K+ existing links or timeline entries — it pre-loaded the entire dedup set with one getLinks() call per page over the Supabase pooler, hanging for 10+ minutes producing zero output before any work started. The v0.12.0 schema migration that creates idx_timeline_dedup was failing on brains with pre-existing duplicate timeline rows because the DELETE ... USING self-join was O(n²) without an index, hitting Supabase Management API's 60-second ceiling on 80K+ duplicates. Both bugs end here.
The numbers that matter
Measured on the new test/extract-fs.test.ts and test/migrate.test.ts regression suites, plus 73 E2E tests against real Postgres+pgvector. Reproducible: bun test + bun run test:e2e.
| Metric | BEFORE v0.12.1 | AFTER v0.12.1 | Δ |
|---|---|---|---|
| extract hang on 47K-page brain | 10+ min, zero output | immediate work, ~30-60s wall clock | usable |
| DB round-trips per re-extract | 47K reads + 235K writes | 0 reads + ~2.4K writes | ~99% fewer |
| v0.12.0 migration on 80K duplicate rows | timed out at 60s | completes <1s | ~60x+ faster |
| Re-run on already-extracted brain | 235K row-writes | 0 row-writes | true no-op |
| Tests | 1297 unit / 105 E2E | 1412 unit / 119 E2E | +115 unit / +14 E2E |
created counter on re-runs |
"5000 created" (lie) | "0 created" (truth) | accurate |
Per-batch round-trip math: a re-extract on a 47K-page brain with ~5 links per page used to do 235K sequential round-trips over the Supabase pooler. With 100-row batched INSERTs it does ~2,400. The hang came from the read pre-load (47K serial getLinks() calls), which is now gone entirely. The DB enforces uniqueness via ON CONFLICT DO NOTHING.
What this means for GBrain users
If you've been afraid to re-run gbrain extract because it might never finish, that's over. The command starts producing output immediately, batch-writes 100 rows per round-trip, and reports a truthful insert count even on re-runs. If your v0.12.0 upgrade got stuck on the timeline migration (or you had to manually run CREATE TABLE ... AS SELECT DISTINCT ON ... to unblock it), the next gbrain init --migrate-only is sub-second. Run gbrain extract all on your largest brain and watch it actually work.
Itemized changes
Performance
gbrain extractno longer pre-loads the dedup set. Removed the N+1 read loop inextractLinksFromDir,extractTimelineFromDir,extractLinksFromDB, andextractTimelineFromDBthat calledengine.getLinks(slug)(orgetTimeline) once per page acrossengine.listPages({ limit: 100000 }). On a 47K-page brain that was 47K serial network round-trips before the first file was even read. Both engines already enforced uniqueness at the SQL layer (UNIQUE(from_page_id, to_page_id, link_type)onlinks,idx_timeline_dedupontimeline_entries); the in-memory dedupSetwas redundant insurance that turned into the bottleneck.- Batched multi-row INSERTs replace per-row writes. All four extract paths now buffer 100 candidates and flush via new
addLinksBatch/addTimelineEntriesBatchengine methods. Round-trips drop ~100x: ~235K → ~2,400 per full re-extract. Each batch usesINSERT ... SELECT FROM unnest($1::text[], $2::text[], ...) JOIN pages ON CONFLICT DO NOTHING RETURNING 1— 4 (links) or 5 (timeline) array-typed bound parameters regardless of batch size, sidestepping Postgres's 65535-parameter cap entirely. PGLite uses the same SQL shape with manual$Nplaceholders.
Correctness
createdcounter is now truthful on re-runs. Returns count of rows actually inserted (viaRETURNING 1row count), not "calls that didn't throw." A re-run on a fully-extracted brain printsDone: 0 links, 0 timeline entries from 47000 pages. Before this release it would printDone: 5000 linkswhile inserting zero new rows.--dry-rundeduplicates candidates across files. A link extracted from 3 different markdown files now prints exactly once in--dry-runoutput, matching what the batch insert would actually create. Before this release the dedup was tied to the now-deleted DB pre-load, so dry-run would over-print.- Whole-batch errors are visible in both JSON and human modes. When a batch flush fails (DB connection drop, malformed row), the error prints to stderr in JSON mode AND to console in human mode, with the lost-row count. No more silent loss of 100 rows because of one bad row.
Schema migrations — v0.12.0 upgrade is now sub-second on duplicate-heavy brains
- Migration v9 (timeline_entries) and v8 (links) pre-create a btree helper index on the dedup columns before the
DELETE ... USINGself-join runs. Turns the O(n²) sequential-scan dedup into O(n log n) index-backed dedup. On 80K+ duplicate rows the migration completes in well under a second instead of timing out at 60s. The helper index is dropped after dedup, leaving the original schema unchanged. Same fix applied defensively to migration v8 — Garry's brain didn't trip it (links had fewer duplicates) but the same trap was loaded. phaseASchematimeout in the v0.12.0 orchestrator bumped 60s → 600s. Belt-and-suspenders: the helper-index fix should make dedup sub-second on most brains, but the outer wall-clock budget shouldn't be the failure mode for unforeseen slowness.
New engine API
addLinksBatch(LinkBatchInput[]) → Promise<number>andaddTimelineEntriesBatch(TimelineBatchInput[]) → Promise<number>on bothPostgresEngineandPGLiteEngine. Returns count of actually-inserted rows (excluding ON CONFLICT no-ops and JOIN-dropped rows whose slugs don't exist). Per-rowaddLink/addTimelineEntryare unchanged — all 10 existing call sites compile and behave identically. Plugin authors building agent integrations onBrainEnginecan adopt the batch methods at their own pace.
Tests
- Migration regression tests guard the fix structurally + behaviorally. New
test/migrate.test.tscases assert the v8 + v9 SQL literally contains the helperCREATE INDEX IF NOT EXISTS ... DROP INDEX IF EXISTSsequence in the right order (deterministic, fast, catches a regression even at 0-row scale where wall-clock can't distinguish O(n²) from O(1)) AND that the migration completes under wall-clock cap on 1000-row fixtures. test/extract-fs.test.ts(new file) covers the FS-source extract path end-to-end on PGLite: first-run inserts, second-run reports zero, dry-run dedups duplicate candidates across 3 files into one printed line, second-run perf regression guard.- 9 new E2E tests for the postgres-engine batch methods in
test/e2e/mechanical.test.ts. The postgres-js bind path is structurally different from PGLite's (array params viaunnest()vs manual$Nplaceholders) and gets its own coverage against real Postgres+pgvector. - 11 new PGLite batch method tests in
test/pglite-engine.test.ts(empty batch, missing optionals normalize to empty strings, within-batch dedup via ON CONFLICT, missing-slug rows dropped by JOIN, half-existing batch returns count of new only, batch of 100).
Pre-ship review
This release was reviewed by /plan-eng-review (5 issues, all addressed including a P0 plan reshape that dropped a redundant orchestrator phase in favor of fixing migration v9 directly), /codex outside-voice review on the plan (15 findings, all P1 + P2 incorporated — most consequential: forced a cleaner separation between per-row API stability and new batch APIs so all 10 existing addLink callers stay untouched), and 5 specialist subagents (testing, maintainability, performance, security, data-migration) at ship time. The testing specialist caught a real bug in the postgres-engine batch SQL: postgres-js's sql(rows, ...) helper doesn't compose with (VALUES) AS v(...) JOIN syntax the way originally written. Switched to the cleaner unnest() array-parameter pattern in both engines, verified end-to-end against a real Postgres+pgvector container.
[0.12.0] - 2026-04-18
The graph wires itself.
Your brain stops being grep.
GBrain v0.12.0 ships a self-wiring knowledge graph. Every put_page extracts entity references and creates typed links automatically (attended, works_at, invested_in, founded, advises) with zero LLM calls. New gbrain graph-query for typed-edge traversal. Backlink-boosted hybrid search. Auto-link reconciliation on every edit. The brain stops being a text store you grep through and starts being a knowledge graph you query.
The benchmark numbers that matter
Headline from BrainBench v1, a 240-page rich-prose corpus generated by Claude Opus, run on PGLite in-memory. Same data, same queries, before vs after PR #188. No API keys at run time. Reproducible: bun run eval/runner/all.ts, ~3 min.
| Metric | BEFORE PR #188 | AFTER PR #188 | Δ |
|---|---|---|---|
| Precision@5 (top-5 hits) | 39.2% | 44.7% | +5.4 pts |
| Recall@5 (correct in top-5) | 83.1% | 94.6% | +11.5 pts |
| Correct in top-5 (total) | 217 | 247 | +30 |
| Graph-only F1 (ablation) | 57.8% (grep) | 86.6% | +28.8 pts |
Per-link-type precision (graph-only, where the typed graph is the answer):
| Link type | Expected | BEFORE precision | AFTER precision | Δ |
|---|---|---|---|---|
| works_at | 120 | 21% | 94% | +73 pts |
| invested_in | 79 | 32% | 90% | +58 pts |
| advises | 61 | 10% | 78% | +68 pts |
| attended | 153 | 75% | 72% | -3 pts |
30 more correct answers in the top-5 the agent actually reads. 53% fewer total results to wade through. "Who works at Acme?" jumps from 21% precision (grep returns every page mentioning Acme: investors, advisors, concept pages, other companies) to 94% (graph returns just the employees).
What this means for GBrain users
The brain is no longer a text store with hybrid search bolted on. It's a queryable knowledge graph that ALSO has hybrid search. Six categories of orthogonal capability (identity resolution, temporal queries, performance at 10K-page scale, robustness to malformed input, MCP operation contract) all pass. Every page write is a graph mutation. Every query gets graph-first ranking. Auto-wire on upgrade ... gbrain post-upgrade runs the v0_12_0 orchestrator (schema, config check, backfill links, backfill timeline, verify), idempotent, ~30s on a 30K-page brain. Plus the v0.11 Minions runtime is fully merged: durable background agents + the graph layer in one release.
Itemized changes
Knowledge Graph Layer
Your brain now wires itself. Every page write automatically extracts entity references and creates typed links between pages. The links table goes from a manually-populated convention to a real, queryable knowledge graph that compounds over time.
- Auto-link on every page write. When you
gbrain puta page that mentions[Alice](people/alice)or[Acme](companies/acme), those links land in the graph automatically. Stale links (refs no longer in the page text) are removed in the same call. Run a quickgbrain putand the brain knows who's connected to whom. To opt out:gbrain config set auto_link false. - Typed relationships. Inferred from context using deterministic regex (zero LLM calls):
attended(meeting -> person),works_at(CEO of, VP at, joined as),invested_in(invested in, backed by),founded(founded, co-founded),advises(advises, board member),source(frontmatter),mentions(default). On a 80-page benchmark brain: 94% type accuracy. gbrain extract --source db. New mode for the existinggbrain extract <links|timeline|all>command that walks pages from the engine instead of from disk. Works for live brains backed by Postgres or PGLite without a local markdown checkout — exactly what an MCP-driven OpenClaw setup needs. Filesystem mode (--source fs) is unchanged and still the default.gbrain graph-query <slug>for relationship traversal. "Who works at Acme?" →gbrain graph-query companies/acme --type works_at --direction in. "Who attended meetings with Alice?" →gbrain graph-query people/alice --type attended --depth 2. Returns typed edges with depth, not just nodes. Backed by a newtraversePaths()engine method on both PGLite and Postgres with cycle prevention (no exponential blowup on cyclic subgraphs).- Graph-powered search ranking. Hybrid search now applies a small backlink boost after cosine re-scoring (
score *= 1 + 0.05 * log(1 + backlink_count)). Well-connected entities surface higher in results. Works in both keyword-only and full hybrid paths. Tested on the newtest/benchmark-graph-quality.ts(80 pages, 35 queries, A/B/C comparison) — relational query recall jumps from ~30% (search alone) to 100% (graph traversal). - Graph health metrics in
gbrain health. Newlink_coverageandtimeline_coveragepercentages on entity pages (person/company), plusmost_connectedtop-5 list. Thedead_linksfield is dropped (always 0 under ON DELETE CASCADE — was a phantom metric). Thebrain_scorecomposite formula stays but now reflects a sharper graph signal.
Schema migrations
Three new migrations apply automatically on gbrain init:
- v5 widens the
linksUNIQUE constraint to(from, to, link_type). The same person can now bothworks_atANDadvisesthe same company as separate rows, instead of one type clobbering the other. - v6 adds a UNIQUE index on
timeline_entries(page_id, date, summary)plusON CONFLICT DO NOTHINGinaddTimelineEntry. Idempotent inserts at the DB level — runninggbrain extract timeline --source dbtwice is safe. - v7 drops the
trg_timeline_search_vectortrigger that updatedpages.updated_aton every timeline insert. Structured timeline entries are now graph data only, not search text. The markdown timeline section inpages.timelinestill feeds search via the pages trigger. Side benefit: extraction pagination is no longer self-invalidating.
Security hardening (caught during pre-ship review)
traverse_graphMCP depth is hard-capped at 10. Without this, a remote MCP caller could passdepth=1e6and burn database memory/CPU on the recursive CTE.- Auto-link is disabled for remote MCP callers (
ctx.remote=true). Bare-slug regex matchespeople/Xanywhere in page text including code fences and quoted strings. Without this gate, an untrusted MCP caller could plant arbitrary outbound links by writing pages with intentional slug references; combined with the new backlink boost, attacker-placed targets would surface higher in search. runAutoLinkreconciliation runs inside a transaction. Without it, two concurrentput_pagecalls on the same slug would race: each reads staleexistingKeysand recreates links the other side just removed.--sincevalidates date format upfront. Invalid dates (--since yesterday) used to silently no-op the filter and reprocess the whole brain. Now: hard error with a clear message.
Tests
- 1151 unit tests pass (was 891 → +260 new)
- 105 E2E tests pass against PostgreSQL
- New
test/benchmark-graph-quality.tsruns the 80-page A/B/C comparison and gates on real thresholds (link_recall > 90%, type_accuracy > 80%, idempotency true). Currently passing all 9 thresholds. - BrainBench v1 (Cat 1+2 + 3, 4, 7, 10, 12) at 240-page Opus rich-prose corpus: Recall@5 83% → 95%, Precision@5 39% → 45%, +30 correct in top-5. Graph-only F1 86.6% vs grep 57.8%. See
docs/benchmarks/2026-04-18-brainbench-v1.md.
Schema migration renumber
The graph layer migrations (originally v5/v6/v7 on the link-timeline-extract branch) were renumbered to v8/v9/v10 to land cleanly on top of master's v5/v6/v7 (Minions: minion_jobs_table, agent_orchestration_primitives, agent_parity_layer). All v8/v9/v10 SQL is idempotent — fresh installs apply the full sequence cleanly; existing v0.11.x installs apply only the new v8/v9/v10. Branch installs that pre-dated this merge (very rare) need to drop and re-init their PGLite db to pick up master's v5/v6/v7 minion_jobs schema.
[0.11.1] - 2026-04-18
Fixed — the v0.11.0 migration mega-bug
Your v0.11.0 upgrade shipped the Minions schema, worker, queue, and migration skill. It didn't ship the actual migration running on upgrade. If you upgraded and ended up with no ~/.gbrain/preferences.json, autopilot still running inline, and cron jobs still hitting agentTurn's 300s timeout — that's the bug. This release fixes it and auto-repairs on your next gbrain upgrade.
gbrain apply-migrationsis the canonical repair. Reads~/.gbrain/migrations/completed.jsonl, diffs against the TS migration registry, runs any pending orchestrators. Idempotent: rerunning on a healthy install is cheap and silent.gbrain upgradeandpostinstallnow invoke it.runPostUpgradetail-callsapply-migrations --yesunconditionally (Codex caught that the earlier early-return on missing upgrade-state.json left broken-v0.11.0 installs broken forever).package.json's newpostinstallhook runs it afterbun update gbrain/npm i gbrain. First-install guard keeps postinstall silent when no brain is configured yet.- Stopgap for v0.11.0 binaries without this release: paste
curl -fsSL https://raw.githubusercontent.com/garrytan/gbrain/v0.11.1/scripts/fix-v0.11.0.sh | bash. It writespreferences.json+ astatus: "partial"record so the eventualapply-migrations --yesrun picks up where it left off — the stopgap does not poison the permanent migration path.
Added — autopilot supervises Minions itself, one install step
Before this release, autopilot + gbrain jobs work were two separate processes you had to manage. Now autopilot is the one install step, and it forks the Minions worker as a child with 10s-backoff restart + 5-crash cap + async SIGTERM drain that waits up to 35s for the worker to commit in-flight work before SIGKILL.
- Autopilot dispatches each cycle as a single
autopilot-cycleMinion job withidempotency_key: autopilot-cycle:<slot>. A 5-min autopilot + 8-min embed no longer stacks 4 overlapping runs — the queue's unique partial index dedupes at the DB layer. Codex caught that the earlier "parent/child DAG" plan was a category error (parent/child in Minions flips the parent towaiting-children, not the child towaiting-for-parent, so extract would have run before sync). - Per-step partial-failure handling. Each of sync / extract / embed / backlinks is wrapped in its own try/catch. Handler returns
{ partial: true, failed_steps: [...] }when any step fails; never throws. An intermittent extract bug no longer blocks every future cycle via Minion retry. - Env-aware
gbrain autopilot --installpicks the right supervisor: launchd on macOS, systemd user unit on Linux-with-systemd (with a strictersystemctl --user is-system-runningprobe — the naive/run/systemd/systemcheck was a false-positive magnet), bootstrap hook on ephemeral containers (Render / Railway / Fly / Docker — auto-injects into OpenClaw'shooks/bootstrap/ensure-services.shwhen detected, use--no-injectto opt out), crontab otherwise.--targetoverrides detection. Uninstall mirrors all four targets. - Worker child spawn uses
resolveGbrainCliPath()— never blindly usesprocess.execPath(on source installs that's the Bun runtime, notgbrain). Resolution tries argv[1], then execPath ending/gbrain, thenwhich gbrain.
Added — library-level Core fns so handlers don't kill workers
Reusing CLI entry-point functions (runExtract, runEmbed, etc.) as Minion handler bodies was wrong — any process.exit(1) on bad args would kill the entire worker process and every in-flight job. New Core fns throw instead:
runExtractCore(engine, opts)— wraps extract-links + extract-timeline.runEmbedCore(engine, opts)— accepts{ slug, slugs, all, stale }.runBacklinksCore(opts)—{ action: 'check' | 'fix', dir, dryRun }.runLintCore(opts)— returns counts, doesn't print human detail (CLI wrapper does that).
CLI wrappers (runExtract, runEmbed, etc.) stay as thin arg-parsers that catch + process.exit(1). Handlers in jobs.ts import the Core fns directly.
Added — skillify ships as a first-class gbrain skill
Ported from Garry's OpenClaw, proven in production. Paired with gbrain check-resolvable gives a user-controllable equivalent of Hermes' auto-skill-creation — you decide when and what, the tooling keeps the 10-item checklist honest.
skills/skillify/SKILL.md— the meta skill. Triggers: "skillify this", "is this a skill?", "make this proper".scripts/skillify-check.ts— machine-readable audit.--jsonfor CI,--recentto check files modified in the last 7 days.- README now has a short section explaining the Skillify + check-resolvable pair and why user-controlled beats auto-generated.
Added — host-agnostic plugin contract (replaces handlers.json)
An earlier design draft shipped ~/.claude/gbrain-handlers.json where each entry was a shell command the worker would exec. Codex flagged this as a durable RCE surface. Dropped in favor of a code-level plugin contract:
docs/guides/plugin-handlers.md— the full contract. Host importsgbrain/minions, constructs aMinionWorker, callsworker.register(name, fn)for every custom handler, callsworker.start(). Ships the bootstrap as code in the host repo, same trust model as any other code.skills/conventions/cron-via-minions.md— the rewrite convention for cron manifests. PGLite branch keeps--follow(inline); Postgres branch drops--follow+ uses--idempotency-keyon the cycle slot.skills/migrations/v0.11.0.md— body restored as the host-agent instruction manual. Walks the host through every JSONL TODO using the 10-item skillify checklist.
Added — gbrain init --migrate-only (the Codex H1 fix)
Running bare gbrain init with no flags defaulted to PGLite and called saveConfig — silently clobbering any existing Postgres config. The migration orchestrator now calls gbrain init --migrate-only which only applies the schema against the configured engine and NEVER writes a new config. Apply-migrations + stopgap + postinstall all use this flag. Bare gbrain init still exists and still defaults to PGLite when you want a fresh install.
Changed
runPostUpgradeis now async + runsapply-migrations --yesunconditionally (Codex H8).gbrain upgrade's subprocess timeout forpost-upgradebumped 30s → 300s so the migration has room to do real work like autopilot install (Codex H7).- Migration enumeration uses a TS registry at
src/commands/migrations/index.tsinstead of walkingskills/migrations/*.mdon disk — compiled binaries see the same set source installs do (Codex K). - Migration diff rule: apply when no
status: "complete"entry exists incompleted.jsonlANDversion ≤ installed VERSION. Earlier proposed "version > currentVersion" would have SKIPPED v0.11.0 when running v0.11.1 (Codex H9). - Autopilot refreshes its lock-file mtime every cycle so a long-lived autopilot doesn't get declared "stale" by the next cron-fired invocation after 10 minutes (Codex C).
- CLAUDE.md gained a new "Migration is canonical, not advisory" section pinning the design principle.
Tests
34 new unit tests across preferences, init-migrate-only, apply-migrations, v0.11.0 orchestrator, handlers, autopilot-resolve-cli, autopilot-install, skillify-check. All 1177 existing tests still green.
[0.11.0] - 2026-04-18
Added — Minions (agent orchestration primitives)
Minions was a job queue. Now it's an agent runtime. Everything your orchestrator needs to fan out work across sub-agents without turning them into orphans or rate-limit disasters.
-
Depth tracking and
max_spawn_depth. Runaway recursion is a real prod failure. Children inheritdepth = parent.depth + 1and submit rejects past a configurable cap (default 5). Your orchestrator can no longer spawn itself into an infinite tree by accident. -
Per-parent child cap (
max_children). Stop spawn storms before they hit OpenAI's rate limit. Setmax_children: 10on a parent job and the 11th submit throws. Enforced viaSELECT ... FOR UPDATEon the parent row so concurrent submits can't both slip through. -
Per-job wall-clock timeout (
timeout_ms). The #2 daily OpenClaw pain is "agent stops responding" ... long handler, token bloat, no clock. Now every job can declare a ceiling.handleTimeouts()dead-letters expired rows; a per-jobsetTimeoutfires AbortSignal as a best-effort handler interrupt. No retry on timeout, terminal by design. -
Cascade cancel via recursive CTE.
cancelJob()walks the full descendant tree in a single statement and cancels everything. Grandchild orphan bug is gone. Re-parented descendants (viaremoveChildDependency) are naturally excluded. Depth cap of 100 on the CTE as runaway safety. -
Idempotency keys. Add
idempotency_key: 'sync:2026-04-18'to your submit and only one job per key ever runs. PG unique partial index enforces it at the DB layer, two concurrent pods submitting the same key collapse to one row. No more "did my cron fire twice?" anxiety. -
Child to parent
child_doneinbox. When a child completes, the parent gets{type:'child_done', child_id, job_name, result}posted to its inbox in the same transaction as the token rollup. Fan-in for free.readChildCompletions(parent_id)filters the inbox by message type with an optionalsincecursor. Works as the primitive for futurewaitForChildren(n)helpers. -
removeOnComplete/removeOnFail. BullMQ convenience. Completed jobs don't bloat yourminion_jobstable forever. Opt in per-job, thechild_donemessage survives because it lives in the parent's inbox, not the child's. -
Attachment manifest. New
minion_attachmentstable for binary payloads attached to jobs. Validation catches path traversal (../,/,\, null byte), oversize (5 MiB default, raiseable), invalid base64, and duplicate filenames per job. DB-levelUNIQUE (job_id, filename)defends against concurrent addAttachment races.storage_uri TEXTcolumn forward-compat for future S3 offload. -
Cooperative AbortSignal. Pause or cascade-cancel clears the job's
lock_token, the running handler's next lock renewal fails and firesctx.signal.abort(). Handlers that respect AbortSignal stop cleanly. Handlers that ignore it get dead-lettered by the DB-sidehandleTimeouts, either way, the row status is correct. -
Transactional correctness fixes.
completeJob()andfailJob()now wrap inengine.transaction(). Parent hook invocations (resolveParent,failParent,removeChildDependency) fold into the same transaction so a process crash between child-update and parent-update can't strand the parent inwaiting-children. Fixed a pre-existing bug whereadd()was inverting child/parent status (child gotwaiting-children, parent stayedwaiting, making the child unclaimable until a manual UPDATE). Tests that worked around it are now cleaned up. -
Migration v7 (
agent_parity_layer). Additive schema: new columns onminion_jobs(all defaulted, nullable where appropriate), newminion_attachmentstable, 3 partial indexes for bounded scans (idx_minion_jobs_timeout,idx_minion_jobs_parent_status,uniq_minion_jobs_idempotency). Existing installs pick it up on nextgbrain init, no manual action required.
Fixed
-
JSONB double-encode bug. When writing to JSONB columns via
engine.executeRaw(sql, params), postgres.js auto-JSON-encodes parameters. CallingJSON.stringify(obj)first stored a JSON string literal, makingjsonb_typeof = stringand breakingpayload->>'key'queries silently. Fixed in three call sites (child_doneinbox post,updateProgress,sendMessage). PGLite tolerated both forms so the unit tests missed it, only a real-Postgres E2E with thepayload->>operator caught it. -
Sibling completion race. Under READ COMMITTED, two grandchildren completing concurrently each saw the other as still-active in their pre-commit snapshot, so neither flipped the parent out of
waiting-children. Fixed by takingSELECT ... FOR UPDATEon the parent row at the start ofcompleteJobandfailJobtransactions. Siblings now serialize on the parent lock, second commit sees the first as completed and correctly advances the parent.
Tests
-
~33 new tests in
test/minions.test.tscovering depth cap, per-parent child cap, timeout dead-letter, cascade cancel (including the re-parent edge case),removeOnComplete/removeOnFail, idempotency (single + concurrent),child_doneinbox (posted in txn + survives child removeOnComplete + since cursor), attachment validation (oversize, path traversal, null byte, duplicates, base64), AbortSignal firing on pause mid-handler, catch-block skippingfailJobwhen aborted, worker in-flight bookkeeping, token-rollup guard when parent already terminal, setTimeout safety-net cleanup. -
test/e2e/minions-concurrency.test.ts... two worker instances against real Postgres, 20 jobs, zero double-claims. The only test that actually verifiesFOR UPDATE SKIP LOCKEDunder real concurrency. PGLite can't prove this. -
test/e2e/minions-resilience.test.ts... 5 tests covering the 6 OpenClaw daily pains: spawn storms, agent stall, forgotten dispatches, cascade cancel, deep tree fan-in with grandchild completions. Every pain has a test that fails if the primitive regresses. -
1066 unit + 105 E2E = 1171 tests passing before this ship. The parity layer isn't just planned, it's pinned down.
[0.10.2] - 2026-04-17
Security — Wave 3 (9 vulnerabilities closed)
This wave closes a high-severity arbitrary-file-read in file_upload, fixes a fake trust boundary that let any cwd-local recipe execute arbitrary commands, and lays down real SSRF defense for HTTP health checks. If you ran gbrain in a directory where someone could drop a recipes/ folder, this matters.
- Arbitrary file read via
file_uploadis closed. Remote (MCP) callers were able to read/etc/passwdor any other host file. Path validation now usesrealpathSync+path.relativeto catch symlinked-parent traversal, plus an allowlist regex for slugs and filenames (control chars, backslashes, RTL-override Unicode all rejected). Local CLI users still upload from anywhere — only remote callers are confined. Fixes Issue #139, contributed by @Hybirdss; original fix #105 by @garagon. - Recipe trust boundary is real now.
loadAllRecipes()previously marked every recipe asembedded=true, including ones from./recipes/in your cwd or$GBRAIN_RECIPES_DIR. Anyone who could drop a recipe in cwd could bypass every health-check gate. Now only package-bundled recipes (source install + global install) are trusted. Original fixes #106, #108 by @garagon. - String health_checks blocked for untrusted recipes. Even with the recipe trust fix, the string health_check path ran
execSyncbefore reaching the typed-DSL switch — a malicious "embedded" recipe couldcurl http://169.254.169.254/metadataand exfiltrate cloud credentials. Non-embedded recipes are now hard-blocked from string health_checks; embedded recipes still get theisUnsafeHealthCheckdefense-in-depth guard. - SSRF defense for HTTP health_checks. New
isInternalUrl()blocks loopback, RFC1918, link-local (incl. AWS metadata 169.254.169.254), CGNAT, IPv6 loopback, and IPv4-mapped IPv6 ([::ffff:127.0.0.1]canonicalized to hex hextets — both forms blocked). Bypass encodings handled: hex IPs (0x7f000001), octal (0177.0.0.1), single decimal (2130706433). Scheme allowlist rejectsfile:,data:,blob:,ftp:,javascript:.fetchruns withredirect: 'manual'and re-validates every Location header up to 3 hops. Original fix #108 by @garagon. - Prompt injection hardening for query expansion. Restructured the LLM prompt with a system instruction that declares the query as untrusted data, plus an XML-tagged
<user_query>boundary. Layered with regex sanitization (strips code fences, tags, injection prefixes) and output-side validation on the model'salternative_queriesarray (cap length, strip control chars, dedup, drop empties). Theconsole.warnon stripped content never logs the query text itself. Original fix #107 by @garagon. list_pagesandget_ingest_logactually cap now. Wave 3 found thatclampSearchLimit(limit, default)was always allowing up to 100 — the second arg was the default, not the cap. Added a thirdcapparameter solist_pagescaps at 100 andget_ingest_logcaps at 50. Internal bulk commands (embed --all, export, migrate-engine) bypass the operation layer entirely and remain uncapped. Original fix #109 by @garagon.
Added
OperationContext.remoteflag distinguishes trusted local CLI callers from untrusted MCP callers. Security-sensitive operations (currentlyfile_upload) tighten their behavior whenremote=true. Defaults to strict (treat as remote) when unset.- Exported security helpers for testing and reuse:
validateUploadPath,validatePageSlug,validateFilename,parseOctet,hostnameToOctets,isPrivateIpv4,isInternalUrl,getRecipeDirs,sanitizeQueryForPrompt,sanitizeExpansionOutput. - 49 new tests covering symlink traversal, scheme allowlist, IPv4 bypass forms, IPv6 mapped addresses, prompt injection patterns, and recipe trust boundaries. Plus an E2E regression proving remote callers can't escape cwd.
Contributors
Wave 3 fixes were contributed by @garagon (PRs #105-#109) and @Hybirdss (Issue #139). The collector branch re-implemented each fix with additional hardening for the residuals Codex caught during outside-voice review (parent-symlink traversal, fake isEmbedded boundary, redirect-following SSRF, scheme bypasses, clampSearchLimit semantics).
[0.10.1] - 2026-04-15
Fixed
-
gbrain sync --watchactually works now. The watch loop existed but was never called because the CLI routed sync through the operation layer (single-pass only). Now sync routes through the CLI path that knows about--watchand--interval. Your cron workaround is no longer needed. -
Sync auto-embeds your pages. After syncing, gbrain now embeds the changed pages automatically. No more "I synced but search can't find my new page." Opt out with
--no-embed. Large syncs (100+ pages) defer embedding togbrain embed --stale. -
First sync no longer repeats forever.
performFullSyncwasn't saving its checkpoint. Fixed: sync state persists after full import so the next sync is incremental. -
dead_linksmetric is consistent across engines. Postgres was counting empty-content chunks instead of dangling links. Now both engines count the same thing: links pointing to non-existent pages. -
Doctor recommends the right embed command. Was suggesting
gbrain embed refresh(doesn't exist). Now correctly saysgbrain embed --stale.
Added
-
gbrain extract links|timeline|allbuilds your link graph and structured timeline from existing markdown. Scans for markdown links, frontmatter fields (company, investors, attendees), and See Also sections. Infers link types from directory structure. Parses both bullet (- **YYYY-MM-DD** | Source — Summary) and header (### YYYY-MM-DD — Title) timeline formats. Runs automatically after every sync. -
gbrain features --json --auto-fixscans your brain and tells you what you're not using, with your own numbers. Priority 1 (data quality): missing embeddings, dead links. Priority 2 (unused features): zero links, zero timeline, low coverage, unconfigured integrations. Agents run--auto-fixto handle everything automatically. -
gbrain autopilot --installsets up a persistent daemon that runs sync, extract, and embed in a continuous loop. Health-based scheduling: brain score >= 90 slows down, < 70 speeds up. Installs as a launchd service (macOS) or crontab entry (Linux). One command, brain maintains itself forever. -
Brain health score (0-100) in
gbrain healthandgbrain doctor. Weighted composite of embed coverage, link density, timeline coverage, orphan pages, and dead links. Agents use it as a health gate. -
gbrain embed --slugsembeds specific pages by slug. Used internally by sync auto-embed to target just the changed pages. -
Instruction layer for agents. RESOLVER.md routing entries, maintain skill sections, and setup skill phase for extract, features, and autopilot. Without these, agents would never discover the new commands.
[0.10.0] - 2026-04-14
Added
-
Background jobs that don't die. Minions is a BullMQ-inspired job queue built directly into GBrain. No Redis. No external dependencies. Submit
gbrain jobs submit embed --followand it runs with automatic retry, exponential backoff, and stall detection. Kill the process mid-job? Stall detection catches it and requeues. Rungbrain jobs workto start a persistent worker daemon that processes jobs from the queue. Jobs are first-class: submit, list, cancel, retry, prune, stats, all from the CLI or MCP. Your agent can now run long operations (14K+ page embeds, bulk enrichment) as durable background jobs instead of fragile inline commands. -
Your agent now has 24 skills, not 8. 16 new brain skills generalized from a production deployment with 14,700+ pages. Signal detection, brain-first lookup, content ingestion (articles, video, meetings), entity enrichment, task management, cron scheduling, reports, and cross-modal review. All shipped as fat markdown files your agent reads on demand.
-
Signal detector fires on every message. A cheap sub-agent spawns in parallel to capture original thinking and entity mentions. Ideas get preserved with exact phrasing. Entities get brain pages. The brain compounds on autopilot.
-
RESOLVER.md routes your agent to the right skill. Modeled on a 215-line production dispatcher. Categorized routing table: always-on, brain ops, ingestion, thinking, operational. Your agent reads it, matches the user's intent, loads the skill. No slash commands needed.
-
Soul-audit builds your agent's identity. 6-phase interactive interview generates SOUL.md (who the agent is), USER.md (who you are), ACCESS_POLICY.md (who sees what), and HEARTBEAT.md (operational cadence). Re-runnable anytime. Ships with minimal defaults so first boot is instant.
-
Access control out of the box. 4-tier privacy policy (Full/Work/Family/None) enforced by skill instructions before every response. Template-based, configurable per user.
-
Conventions directory codifies operational discipline. Brain-first lookup protocol, citation quality standards, model routing table, test-before-bulk rule, and cross-modal review pairs. These are the hard-won patterns that prevent bad bulk runs and silent failures.
-
gbrain initdetects GStack and reports mod status. After brain setup, init now shows how many skills are loaded, whether GStack is installed, and where to get it. GStack detection usesgstack-global-discoverwith fallback to known host paths. -
Conformance standard for all skills. Every skill now has YAML frontmatter (name, version, description, triggers, tools, mutating) plus Contract, Anti-Patterns, and Output Format sections. Two new test files validate conformance across all 25 skills.
-
Existing 8 skills migrated to conformance format. Frontmatter added, Workflow renamed to Phases, Contract and Anti-Patterns sections added. Ingest becomes a thin router delegating to specialized ingestion skills.
The 16 new skills
| Skill | What it does | Why it matters |
|---|---|---|
| signal-detector | Fires on every message. Spawns a cheap model in parallel to capture original thinking and entity mentions. | Your brain compounds on autopilot. Every conversation is an ingest event. Miss a signal and the brain never learns it. |
| brain-ops | Brain-first lookup before any external API. The read-enrich-write loop that makes every response smarter. | Without this, your agent reaches for Google when the answer is already in the brain. Wastes tokens, misses context. |
| idea-ingest | Links, articles, tweets go into the brain with analysis, author people pages, and cross-linking. | Every article worth reading is worth remembering. The author gets a people page. The ideas get cross-linked to what you already know. |
| media-ingest | Video, audio, PDF, books, screenshots, GitHub repos. Transcripts, entity extraction, backlink propagation. | One skill handles every media format. Absorbs what used to be 3 separate skills (video-ingest, youtube-ingest, book-ingest). |
| meeting-ingestion | Transcripts become brain pages. Every attendee gets enriched. Every company discussed gets a timeline entry. | A meeting is NOT fully ingested until every entity is propagated. This is the skill that turns a transcript into 10 updated brain pages. |
| citation-fixer | Scans brain pages for missing or malformed [Source: ...] citations. Fixes formatting to match the standard. |
Without citations, you can't trace facts back to where they came from. Six months later, "who said this?" has an answer. |
| repo-architecture | Where new brain files go. Decision protocol: primary subject determines directory, not format or source. | Prevents the #1 misfiling pattern: dumping everything in sources/ because it came from a URL. |
| skill-creator | Create new skills following the conformance standard. MECE check against existing skills. Updates manifest and resolver. | Users who need a capability GBrain doesn't have can create it themselves. The skill teaches the agent how to extend itself. |
| daily-task-manager | Add, complete, defer, remove, review tasks with priority levels (P0-P3). Stored as a searchable brain page. | Your tasks live in the brain, not a separate app. The agent can cross-reference tasks with meeting notes and people pages. |
| daily-task-prep | Morning preparation. Calendar lookahead with brain context per attendee, open threads from yesterday, active task review. | Walk into every meeting with full context on every person in the room, automatically. |
| cross-modal-review | Spawn a different AI model to review the agent's work before committing. Refusal routing: if one model refuses, silently switch. | Two models agreeing is stronger signal than one model being thorough. Refusal routing means the user never sees "I can't do that." |
| cron-scheduler | Schedule staggering (5-min offsets), quiet hours (timezone-aware with wake-up override), thin job prompts. | 21 cron jobs at :00 is a thundering herd. Staggering prevents it. Quiet hours mean no 3 AM notifications. Wake-up override releases the backlog. |
| reports | Timestamped reports with keyword routing. "What's the latest briefing?" maps to the right report directory. | Cheap replacement for vector search on frequent queries. Don't embed. Load the file. |
| testing | Validates every skill has SKILL.md with frontmatter, manifest coverage, resolver coverage. The CI for your skill system. | 3 skills and you need validation. 24 skills and you need it yesterday. Catches dead references, missing sections, MECE violations. |
| soul-audit | 6-phase interview that generates SOUL.md, USER.md, ACCESS_POLICY.md, HEARTBEAT.md. Your agent's identity, built from your answers. | What makes your OpenClaw feel like yours. Without personality and access control, every agent feels the same. |
| webhook-transforms | External events (SMS, meetings, social mentions) converted into brain pages with entity extraction. Dead-letter queue for failures. | Your brain ingests signals from everywhere. Not just conversations, but every webhook, every notification, every external event. |
Infrastructure (new in v0.10.0)
-
Your brain now self-validates its own skill routing.
checkResolvable()verifies every skill is reachable from RESOLVER.md, detects MECE overlaps, flags missing triggers, and catches DRY violations. Runs frombun test,gbrain doctor, and the skill-creator skill. Every issue comes with a machine-readable fix object the agent can act on. -
gbrain doctorgot serious. 8 health checks now (up from 5), plus a composite health score (0-100). Filesystem checks (resolver, conformance) run even without a database.--fastskips DB checks.--jsonoutput includes structuredissuesarray with action strings so agents can parse and auto-fix. -
Batch operations won't melt your machine anymore. Adaptive load-aware throttling checks CPU and memory before each batch item. Exponential backoff with a 20-attempt safety cap. Active hours multiplier slows batch work during the day. Two concurrent batch process limit.
-
Your agent's classifiers get smarter automatically. Fail-improve loop: try deterministic code first, fall back to LLM, log every fallback. Over time, the logs reveal which regex patterns are missing. Auto-generates test cases from successful LLM results. Tracks deterministic hit rate in
gbrain doctoroutput. -
Voice notes just work. Groq Whisper transcription (with OpenAI fallback) via
transcribe_audiooperation. Files over 25MB get ffmpeg-segmented automatically. Transcripts flow through the standard import pipeline, entities get extracted, back-links get created. -
Enrichment is now a global service, not a per-skill skill. Every ingest pathway can call
extractAndEnrich()to detect entities and create/update their brain pages. Tier auto-escalation: entities start at Tier 3, auto-promote to Tier 1 based on mention frequency across sources. -
Data research: one skill for any email-to-tracker pipeline. New
data-researchskill with parameterized YAML recipes. Extract investor updates (MRR, ARR, runway, headcount), expense receipts, company metrics from email. Battle-tested regex patterns, extraction integrity rule (save first, report second), dedup with configurable tolerance, canonical tracker pages with running totals.
For contributors
test/skills-conformance.test.tsvalidates every skill has valid frontmatter and required sectionstest/resolver.test.tsvalidates RESOLVER.md coverage and routing consistencyskills/manifest.jsonnow hasconformance_versionfield and lists all 24 skills- Identity templates in
templates/(SOUL.md, USER.md, ACCESS_POLICY.md, HEARTBEAT.md)
[0.9.3] - 2026-04-12
Added
- Search understands what you're asking. +21% page coverage, +29% signal, 100% source accuracy. A zero-latency intent classifier reads your query and picks the right search mode. "Who is Alice?" surfaces your compiled truth assessment. "When did we last meet?" surfaces timeline entries with dates. No LLM call, just pattern matching. Your agent sees 8.7 relevant pages per query instead of 7.2, and two thirds of returned chunks are now distilled assessments instead of half. Entity lookups always lead with compiled truth. Temporal queries always find the dates. Benchmarked against 29 pages, 20 queries with graded relevance (run
bun run test/benchmark-search-quality.tsto reproduce). Inspired by Ramp Labs' "Latent Briefing" paper (April 2026). gbrain query --detail low/medium/high. Agents can control how deep search goes.lowreturns compiled truth only.medium(default) returns everything with dedup.highreturns all chunks uncapped. Auto-escalates from low to high if no results found. MCP picks it up automatically.gbrain evalmeasures search quality. Full retrieval evaluation harness with P@k, R@k, MRR, nDCG@k metrics. A/B comparison mode for parameter tuning:gbrain eval --qrels queries.json --config-a baseline.json --config-b boosted.json. Contributed by @4shut0sh.- CJK queries expand correctly. Chinese, Japanese, and Korean text was silently skipping query expansion because word count used space-delimited splitting. Now counts characters for CJK. Contributed by @YIING99.
- Health checks speak a typed language now. Recipe
health_checksuse a typed DSL (http,env_exists,command,any_of) instead of raw shell strings. No moreexecSync(untrustedYAML). Your agent runsgbrain integrations doctorand gets structured results, not shell injection risk. All 7 first-party recipes migrated. String health checks still work (with deprecation warning) for backward compat.
Fixed
- Your storage backend can't be tricked into reading
/etc/passwd.LocalStoragenow validates every path stays within the storage root.../../etc/passwdgets "Path traversal blocked" instead of your system files. All 6 methods covered (upload, download, delete, exists, list, getUrl). - MCP callers can't read arbitrary files via
file_url.resolveFile()now validates the requested path stays within the brain root before touching the filesystem. Previously,../../etc/passwdwould read any file the process could access. .supabasemarker files can't escape their scope. Marker prefix validation now rejects../, absolute paths, and bare... A crafted.supabasefile in a shared brain repo can't make storage requests outside the intended prefix.- File queries can't blow up memory. The slug-filtered
file_listMCP operation now has the sameLIMIT 100as the unfiltered branch. Also fixed the CLIgbrain files listandgbrain files verifycommands. - Symlinks in brain directories can't exfiltrate files. All 4 file walkers in
files.tsplus theinit.tssize counter now uselstatSyncand skip symlinks. Broken symlinks andnode_modulesdirectories are also skipped. - Recipe health checks can't inject shell commands. Non-embedded (user-created) recipes with shell metacharacters in health_check strings are blocked. First-party recipes are trusted but migrated to the typed DSL.
[0.9.2] - 2026-04-12
Fixed
- Fresh local installs initialize cleanly again.
gbrain initnow creates the local PGLite data directory before taking its advisory lock, so first-run setup no longer misreports a missing directory as a lock timeout.
[0.9.1] - 2026-04-11
Fixed
- Your brain can't be poisoned by rogue frontmatter anymore. Slug authority is now path-derived. A file at
notes/random.mdcan't declareslug: people/adminand silently overwrite someone else's page. Mismatches are rejected with a clear error telling you exactly what to fix. - Symlinks in your notes directory can't exfiltrate files. The import walker now uses
lstatSyncand refuses to follow symlinks, blocking the attack where a contributor plants a link to~/.zshrcin the brain directory. Defense-in-depth:importFromFileitself also checks. - Giant payloads through MCP can't rack up your OpenAI bill.
importFromContentnow checksBuffer.byteLengthbefore any processing. 10 MB of emoji throughput_page? Rejected before chunking starts. - Search can't be weaponized into a DoS.
limitis clamped to 100 across all search paths (keyword, vector, hybrid).statement_timeout: 8son the Postgres connection as defense-in-depth. Requestinglimit: 10000000now gets you 100 results and a warning. - PGLite stops crashing when two processes touch the same brain. File-based advisory lock using atomic
mkdirwith PID tracking and 5-minute stale detection. Clear error messages tell you which process holds the lock and how to recover. - 12 data integrity fixes landed. Orphan chunks cleaned up on empty pages. Write operations (
addLink,addTag,addTimelineEntry,putRawData,createVersion) now throw when the target page doesn't exist instead of silently no-opping. Health metrics (stale_pages,dead_links,orphan_pages) now measure real problems instead of always returning 0. Keyword search moved from JS-side sort-and-splice to a SQL CTE withLIMIT. MCP server validates params before dispatch. - Stale embeddings can't lie to you anymore. When chunk text changes but embedding fails, the old vector is now NULL'd out instead of preserved. Previously, search could return results based on outdated vectors attached to new text.
- Embedding failures are no longer silent. The
catch { /* non-fatal */ }is gone. You now get[gbrain] embedding failed for slug (N chunks): error messagein stderr. Still non-fatal, but you know what happened. - O(n^2) chunk lookup in
embedPageis gone. Replacedfind() + indexOf()with a singleMaplookup. Matches the patternembedAllalready uses. - Stdin bombs blocked.
parseOpArgsnow caps stdin at 5 MB before the full buffer is consumed.
Added
gbrain embed --allis 30x faster. Sliding worker pool with 20 concurrent workers (tunable viaGBRAIN_EMBED_CONCURRENCY). A 20,000-chunk corpus that took 2.5 hours now finishes in ~8 minutes.- Search pagination. Both
searchandquerynow accept--offsetfor paginating through results. Combined with the 100-result ceiling, you can now page through large result sets. gbrain askis an alias forgbrain query. CLI-only, doesn't appear in MCP tools-json.- Content hash now covers all page fields. Title, type, and frontmatter changes trigger re-import. First sync after upgrade will re-import all pages (one-time, expected).
- Migration file for v0.9.1. Auto-update agent knows to expect the full re-import and will run
gbrain embed --allafterward. pgcryptoextension added to schema. Fallback forgen_random_uuid()on Postgres < 13.
Changed
- Search type and exclude_slugs filters now work. These were advertised in the API but never implemented. Both
searchKeywordandsearchVectornow respecttypeandexclude_slugsparams. - Hybrid search no longer double-embeds the query.
expandQueryalready includes the original, so we use it directly instead of prepending.
[0.9.0] - 2026-04-11
Added
-
Large files don't bloat your git repo anymore.
gbrain files upload-rawauto-routes by size: text and PDFs under 100 MB stay in git, everything larger (or any media file) goes to Supabase Storage with a.redirect.yamlpointer left in the repo. Files over 100 MB use TUS resumable upload (6 MB chunks with retry and backoff) so a flaky connection doesn't lose a 2 GB video upload.gbrain files signed-urlgenerates 1-hour access links for private buckets. -
The full file migration lifecycle works end to end.
mirroruploads to cloud and keeps local copies.redirectreplaces local files with.redirect.yamlpointers (verifies remote exists first, won't delete data).restoredownloads back from cloud.cleanremoves pointers when you're sure.statusshows where you are. Three states, zero data loss risk. -
Your brain now enforces its own graph integrity. The Iron Law of Back-Linking is mandatory across all skills. Every mention of a person or company creates a bidirectional link. This transforms your brain from a flat file store into a traversable knowledge graph.
-
Filing rules prevent the #1 brain mistake. New
skills/_brain-filing-rules.mdstops the most common error: dumping everything intosources/. File by primary subject, not format. Includes notability gate and citation requirements. -
Enrichment protocol that actually works. Rewritten from a 46-line API list to a 7-step pipeline with 3-tier system, person/company page templates, pluggable data sources, validation rules, and bulk enrichment safety.
-
Ingest handles everything. Articles, videos, podcasts, PDFs, screenshots, meeting transcripts, social media. Each with a workflow that uses real gbrain commands (
upload-raw,signed-url) instead of theoretical patterns. -
Citation requirements across all skills. Every fact needs inline
[Source: ...]citations. Three formats, source precedence hierarchy. -
Maintain skill catches what you missed. Back-link enforcement, citation audit, filing violations, file storage health checks, benchmark testing.
-
Voice calls don't crash on em dashes anymore. Unicode sanitization for Twilio WebSocket, PII scrub, identity-first prompt, DIY STT+LLM+TTS pipeline option, Smart VAD default, auto-upload call audio via
gbrain files upload-raw. -
X-to-Brain gets eyes. Image OCR, Filtered Stream real-time monitoring, 6-dimension tweet rating rubric, outbound tweet monitoring, cron staggering.
-
Share brain pages without exposing the brain.
gbrain publishgenerates beautiful, self-contained HTML from any brain page. Strips private data (frontmatter, citations, confirmations, brain links, timeline) automatically. Optional AES-256-GCM password gate with client-side decryption, no server needed. Dark/light mode, mobile-optimized typography. This is the first code+skill pair: deterministic code does the work, the skill tells the agent when and how. See the Thin Harness, Fat Skills thread for the architecture philosophy.
Changed
-
Supabase Storage now auto-selects upload method by file size: standard POST for < 100 MB, TUS resumable for >= 100 MB. Signed URL generation for private bucket access (1-hour expiry).
-
File resolver supports both
.redirect.yaml(v0.9+) and legacy.redirect(v0.8) formats for backward compatibility. -
Redirect format upgraded from
.redirect(5 fields) to.redirect.yaml(10 fields: target, bucket, storage_path, size, size_human, hash, mime, uploaded, source_url, type). -
All skills updated to reference actual
gbrain filescommands instead of theoretical patterns. -
Back-link enforcer closes the loop.
gbrain check-backlinks checkscans your brain for entity mentions without back-links.gbrain check-backlinks fixcreates them. The Iron Law of Back-Linking is in every skill, now the code enforces it. -
Page linter catches LLM slop.
gbrain lintflags "Of course! Here is..." preambles, wrapping code fences, placeholder dates, missing frontmatter, broken citations, and empty sections.gbrain lint --fixauto-strips the fixable ones. Every brain that uses AI for ingestion accumulates this. Now it's one command. -
Audit trail for everything.
gbrain report --type enrichment-sweepsaves timestamped reports tobrain/reports/{type}/YYYY-MM-DD-HHMM.md. The maintain skill references this for enrichment sweeps, meeting syncs, and maintenance runs. -
Publish skill added to manifest (8th skill). First code+skill pair.
-
Skills version bumped to 0.9.0.
-
67 new unit tests across publish, backlinks, lint, and report. Total: 409 pass.
[0.8.0] - 2026-04-11
Added
- Your AI can answer the phone now. Voice-to-brain v0.8.0 ships 25 production patterns from a real deployment. WebRTC works in a browser tab with just an OpenAI key, phone number via Twilio is optional. Your agent picks its own name and personality. Pre-computed engagement bids mean it greets you with something specific ("dude, your social radar caught something wild today"), not "how can I help you?" Context-first prompts, proactive advisor mode, caller routing, dynamic noise suppression, stuck watchdog, thinking sounds during tool calls. This is the "Her" experience, out of the box.
- Upgrade = feature discovery. When you upgrade to v0.8.0, the CLI tells you what's new and your agent offers to set up voice immediately. WebRTC-first (zero setup), then asks about a phone number. Migration files now have YAML frontmatter with
feature_pitchso every future version can pitch its headline feature through the upgrade flow. - Remote MCP simplified. The Supabase Edge Function deployment is gone. Remote MCP now uses a self-hosted server + ngrok tunnel. Simpler, more reliable, works with any AI client. All
docs/mcp/guides updated to reflect the actual production architecture.
Changed
- Voice recipe is now 25 production patterns deep. Identity separation, pre-computed bid system, context-first prompts, proactive advisor mode, conversation timing (the #1 fix), no-repetition rule, radical prompt compression (13K to 4.7K tokens), OpenAI Realtime Prompting Guide structure, auth-before-speech, brain escalation, stuck watchdog, never-hang-up rule, thinking sounds, fallback TwiML, tool set architecture, trusted user auth, caller routing, dynamic VAD, on-screen debug UI, live moment capture, belt-and-suspenders post-call, mandatory 3-step post-call, WebRTC parity, dual API event handling, report-aware query routing.
- WebRTC session pseudocode updated. Native FormData,
toolsin session config,type: 'realtime'on all session.update calls. WebRTC transcription NOT supported over data channel (use Whisper post-call). - MCP docs rewritten. All per-client guides (Claude Code, Claude Desktop, Cowork, Perplexity) updated from Edge Function URLs to self-hosted + ngrok pattern.
Removed
- Supabase Edge Function MCP deployment.
scripts/deploy-remote.sh,supabase/functions/gbrain-mcp/,src/edge-entry.ts,.env.production.example,docs/mcp/CHATGPT.mdall removed. The Edge Function never worked reliably. Self-hosted + ngrok is the path.
[0.7.0] - 2026-04-11
Added
- Your brain now runs locally with zero infrastructure. PGLite (Postgres 17.5 compiled to WASM) gives you the exact same search quality as Supabase, same pgvector HNSW, same pg_trgm fuzzy matching, same tsvector full-text search. No server, no subscription, no API keys needed for keyword search.
gbrain initand you're running in 2 seconds. - Smart init defaults to local.
gbrain initnow creates a PGLite brain by default. If your repo has 1000+ markdown files, it suggests Supabase for scale.--supabaseand--pgliteflags let you choose explicitly. - Migrate between engines anytime.
gbrain migrate --to supabasetransfers your entire brain (pages, chunks, embeddings, tags, links, timeline) to remote Postgres with manifest-based resume.gbrain migrate --to pglitegoes the other way. Embeddings copy directly, no re-embedding needed. - Pluggable engine factory.
createEngine()dynamically loads the right engine from config. PGLite WASM is never loaded for Postgres users. - Search works without OpenAI.
hybridSearchnow checks forOPENAI_API_KEYbefore attempting embeddings. No key = keyword-only search. No more crashes when you just want to search your local brain. - Your brain gets new senses automatically. Integration recipes teach your agent how to wire up voice calls, email, Twitter, and calendar into your brain. Run
gbrain integrationsto see what's available. Your agent reads the recipe, asks for API keys, validates each one, and sets everything up. Markdown is code -- the recipe IS the installer. - Voice-to-brain: phone calls create brain pages. The first recipe: Twilio + OpenAI Realtime voice agent. Call a number, talk, and a structured brain page appears with entity detection, cross-references, and a summary posted to your messaging app. Opinionated defaults: caller screening, brain-first lookup, quiet hours, thinking sounds. The smoke test calls YOU (outbound) so you experience the magic immediately.
gbrain integrationscommand. Six subcommands for managing integration recipes:list(dashboard of senses + reflexes),show(recipe details),status(credential checks with direct links to get missing keys),doctor(health checks),stats(signal analytics),test(recipe validation).--jsonon every subcommand for agent-parseable output. No database connection needed.- Health heartbeat. Integrations log events to
~/.gbrain/integrations/<id>/heartbeat.jsonl. Status checks detect stale integrations and include diagnostic steps. - 17 individually linkable SKILLPACK guides. The 1,281-line monolith is now broken into standalone guides at
docs/guides/, organized by category. Each guide is individually searchable and linkable. The SKILLPACK index stays at the same URL (backward compatible). - "Getting Data In" documentation. New
docs/integrations/with a landing page, recipe format documentation, credential gateway guide, and meeting webhook guide. Explains the deterministic collector pattern: code for data, LLMs for judgment. - Architecture and philosophy docs.
docs/architecture/infra-layer.mddocuments the shared foundation (import, chunk, embed, search).docs/ethos/THIN_HARNESS_FAT_SKILLS.mdis Garry's essay on the architecture philosophy with an agent decision guide.docs/designs/HOMEBREW_FOR_PERSONAL_AI.mdmaps the 10-star vision.
Changed
- Engine interface expanded. Added
runMigration()(replaces internal driver access for schema migrations) andgetChunksWithEmbeddings()(loads embedding data for cross-engine migration). - Shared utilities extracted.
validateSlug,contentHash, and row mappers moved frompostgres-engine.tstosrc/core/utils.ts. Both engines share them. - Config infers engine type. If
database_pathis set butengineis missing, config now inferspgliteinstead of defaulting topostgres. - Import serializes on PGLite. Parallel workers are Postgres-only. PGLite uses sequential import (single-connection architecture).
[0.6.1] - 2026-04-10
Fixed
- Import no longer silently drops files with "..." in the name. The path traversal check rejected any filename containing two consecutive dots, killing 1.2% of files in real-world corpora (YouTube transcripts, TED talks, podcast titles). Now only rejects actual traversal patterns like
../. Community fix wave, 8 contributors. - Import no longer crashes on JavaScript/TypeScript projects. The file walker crashed on
node_modulesdirectories and broken symlinks. Now skipsnode_modulesand handles broken symlinks gracefully with a warning. gbrain initexits cleanly after setup. Previously hung forever because stdin stayed open. Now pauses stdin after reading input.- pgvector extension auto-created during init. No more copy-pasting SQL into the Supabase editor.
gbrain initnow runsCREATE EXTENSION IF NOT EXISTS vectorautomatically, with a clear fallback message if it can't. - Supabase connection string hint matches current dashboard UI. Updated navigation path to match the 2026 Supabase dashboard layout.
- Hermes Agent link fixed in README. Pointed to the correct NousResearch GitHub repo.
Changed
- Search is faster. Keyword search now runs in parallel with the embedding pipeline instead of waiting for it. Saves ~200-500ms per hybrid search call.
- .mdx files are now importable. The import walker, sync filter, and slug generator all recognize
.mdxalongside.md.
Added
- Community PR wave process documented in CLAUDE.md for future contributor batches.
Contributors
Thank you to everyone who reported bugs, submitted fixes, and helped make GBrain better:
- @orendi84 — slug validator ellipsis fix (PR #31)
- @mattbratos — import walker resilience + MDX support (PRs #26, #27)
- @changergosum — init exit fix + auto pgvector (PRs #17, #18)
- @eric-hth — Supabase UI hint update (PR #30)
- @irresi — parallel hybrid search (PR #8)
- @howardpen9 — Hermes Agent link fix (PR #34)
- @cktang88 — the thorough 12-bug report that drove v0.6.0 (Issue #22)
- @mvanhorn — MCP schema handler fix (PR #25)
[0.6.0] - 2026-04-10
Added
- Access your brain from any AI client. Deploy GBrain as a serverless remote MCP endpoint on your existing Supabase instance. Works with Claude Desktop, Claude Code, Cowork, and Perplexity Computer. One URL, bearer token auth, zero new infrastructure. Clone the repo, fill in 3 env vars, run
scripts/deploy-remote.sh, done. - Per-client setup guides in
docs/mcp/for Claude Code, Claude Desktop, Cowork, Perplexity, and ChatGPT (coming soon, requires OAuth 2.1). Also documents Tailscale Funnel and ngrok as self-hosted alternatives. - Token management via standalone
src/commands/auth.ts. Create, list, revoke per-client bearer tokens. Includes smoke test:auth.ts test <url> --token <token>verifies the full pipeline (initialize + tools/list + get_stats) in 3 seconds. - Usage logging via
mcp_request_logtable. Every remote tool call logs token name, operation, latency, and status for debugging and security auditing. - Hardened health endpoint at
/health. Unauthenticated: 200/503 only (no info disclosure). Authenticated: checks postgres, pgvector, and OpenAI API key status.
Fixed
- MCP server actually connects now. Handler registration used string literals (
'tools/list' as any) instead of SDK typed schemas. Replaced withListToolsRequestSchemaandCallToolRequestSchema. Without this fix,gbrain servesilently failed to register handlers. (Issue #9) - Search results no longer flooded by one large page. Keyword search returned ALL chunks from matching pages. Now returns one best chunk per page via
DISTINCT ON. (Issue #22) - Search dedup no longer collapses to one chunk per page. Layer 1 kept only the single highest-scoring chunk per slug. Now keeps top 3, letting later dedup layers (text similarity, cap per page) do their job. (Issue #22)
- Transactions no longer corrupt shared state. Both
PostgresEngine.transaction()anddb.withTransaction()swapped the shared connection reference, breaking under concurrent use. Now uses scoped engine viaObject.createwith no shared state mutation. (Issue #22) - embed --stale no longer wipes valid embeddings.
upsertChunks()deleted all chunks then re-inserted, writing NULL for chunks without new embeddings. Now uses UPSERT (INSERT ON CONFLICT UPDATE) with COALESCE to preserve existing embeddings. (Issue #22) - Slug normalization is consistent.
pathToSlug()preserved case whileinferSlug()lowercased. NowvalidateSlug()enforces lowercase at the validation layer, covering all entry points. (Issue #22) - initSchema no longer reads from disk at runtime. Both schema loaders used
readFileSyncwithimport.meta.url, which broke in compiled binaries and Deno Edge Functions. Schema is now embedded at build time viascripts/build-schema.sh. (Issue #22) - file_upload actually uploads content. The operation wrote DB metadata but never called the storage backend. Fixed in all 3 paths (operation, CLI upload, CLI sync) with rollback semantics. (Issue #22)
- S3 storage backend authenticates requests.
signedFetch()was just unsignedfetch(). Replaced with@aws-sdk/client-s3for proper SigV4 signing. Supports R2/MinIO viaforcePathStyle. (Issue #22) - Parallel import uses thread-safe queue.
queue.shift()had race conditions under parallel workers. Now uses an atomic index counter. Checkpoint preserved on errors for safe resume. (Issue #22) - redirect verifies remote existence before deleting local files. Previously deleted local files unconditionally. Now checks storage backend before removing. (Issue #22)
gbrain callrespects dry_run.handleToolCall()hardcodeddryRun: false. Now reads from params. (Issue #22)
Changed
- Added
@aws-sdk/client-s3as a dependency for authenticated S3 operations. - Schema migration v2: unique index on
content_chunks(page_id, chunk_index)for UPSERT support. - Schema migration v3:
access_tokensandmcp_request_logtables for remote MCP auth.
[0.5.1] - 2026-04-10
Fixed
- Apple Notes and files with spaces just work. Paths like
Apple Notes/2017-05-03 ohmygreen.mdnow auto-slugify to clean slugs (apple-notes/2017-05-03-ohmygreen). Spaces become hyphens, parens and special characters are stripped, accented characters normalize to ASCII. All 5,861+ Apple Notes files import cleanly without manual renaming. - Existing brains auto-migrate. On first run after upgrade, a one-time migration renames all existing slugs with spaces or special characters to their clean form. Links are rewritten automatically. No manual cleanup needed.
- Import and sync produce identical slugs. Both pipelines now use the same
slugifyPath()function, eliminating the mismatch where sync preserved case but import lowercased.
[0.5.0] - 2026-04-10
Added
- Your brain never falls behind. Live sync keeps the vector DB current with your brain repo automatically. Set up a cron, use
--watch, hook into GitHub webhooks, or use git hooks. Your agent picks whatever fits its environment. Edit a markdown file, push, and within minutes it's searchable. No more stale embeddings serving wrong answers. - Know your install actually works. New verification runbook (
docs/GBRAIN_VERIFY.md) catches the silent failures that used to go unnoticed: the pooler bug that skips pages, missing embeddings, stale sync. The real test: push a correction, wait, search for it. If the old text comes back, sync is broken and the runbook tells you exactly why. - New installs set up live sync automatically. The setup skill now includes live sync (Phase H) and full verification (Phase I) as mandatory steps. Agents that install GBrain will configure automatic sync and verify it works before declaring setup complete.
- Fixes the silent page-skip bug. If your Supabase connection uses the Transaction mode pooler, sync silently skips most pages. The new docs call this out as a hard prerequisite with a clear fix (switch to Session mode). The verification runbook catches it by comparing page count against file count.
[0.4.2] - 2026-04-10
Changed
- All GitHub Actions pinned to commit SHAs across test, e2e, and release workflows. Prevents supply chain attacks via mutable version tags.
- Workflow permissions hardened:
contents: readon test and e2e workflows limits GITHUB_TOKEN blast radius. - OpenClaw CI install pinned to v2026.4.9 instead of pulling latest.
Added
- Gitleaks secret scanning CI job runs on every push and PR. Catches accidentally committed API keys, tokens, and credentials.
.gitleaks.tomlconfig with allowlists for test fixtures and example files.- GitHub Actions SHA maintenance rule in CLAUDE.md so pins stay fresh on every
/shipand/review. - S3 Sig V4 TODO for future implementation when S3 storage becomes a deployment path.
[0.4.1] - 2026-04-09
Added
gbrain check-updatecommand with--jsonoutput. Checks GitHub Releases for new versions, compares semver (minor+ only, skips patches), fetches and parses changelog diffs. Fail-silent on network errors.- SKILLPACK Section 17: Auto-Update Notifications. Full agent playbook for the update lifecycle: check, notify, consent, upgrade, skills refresh, schema sync, report. Never auto-upgrades without user permission.
- Standalone SKILLPACK self-update for users who load the skillpack directly without the gbrain CLI. Version markers in SKILLPACK and RECOMMENDED_SCHEMA headers, with raw GitHub URL fetching.
- Step 7 in the OpenClaw install paste: daily update checks, default-on. User opts into being notified about updates, not into automatic installs.
- Setup skill Phase G: conditional auto-update offer for manual install users.
- Schema state tracking via
~/.gbrain/update-state.json. Tracks which recommended schema directories the user adopted, declined, or added custom. Future upgrades suggest new additions without re-suggesting declined items. skills/migrations/directory convention for version-specific post-upgrade agent directives.- 20 unit tests and 5 E2E tests for the check-update command, covering version comparison, changelog extraction, CLI wiring, and real GitHub API interaction.
- E2E test DB lifecycle documentation in CLAUDE.md: spin up, run tests, tear down. No orphaned containers.
Changed
detectInstallMethod()exported fromupgrade.tsfor reuse bycheck-update.
Fixed
- Semver comparison in changelog extraction was missing major-version guard, causing incorrect changelog entries to appear when crossing major version boundaries.
[0.4.0] - 2026-04-09
Added
gbrain doctorcommand with--jsonoutput. Checks pgvector extension, RLS policies, schema version, embedding coverage, and connection health. Agents can self-diagnose issues.- Pluggable storage backends: S3, Supabase Storage, and local filesystem. Choose where binary files live independently of the database. Configured via
gbrain initor environment variables. - Parallel import with per-worker engine instances. Large brain imports now use multiple database connections concurrently instead of a single serial pipeline.
- Import resume checkpoints. If
gbrain importis interrupted, it picks up where it left off instead of re-importing everything. - Automatic schema migration runner. On connect, gbrain detects the current schema version and applies any pending migrations without manual intervention.
- Row-Level Security (RLS) enabled on all tables with
BYPASSRLSsafety check. Every query goes through RLS policies. --jsonflag ongbrain initandgbrain importfor machine-readable output. Agents can parse structured results instead of scraping CLI text.- File migration CLI (
gbrain files migrate) for moving files between storage backends. Two-way-door: test with--dry-run, migrate incrementally. - Bulk chunk INSERT for faster page writes. Chunks are inserted in a single statement instead of one-at-a-time.
- Supabase smart URL parsing: automatically detects and converts IPv6-only pooler URLs to the correct connection format.
- 56 new unit tests covering doctor, storage backends, file migration, import resume, slug validation, setup branching, Supabase admin, and YAML parsing. Test suite grew from 9 to 19 test files.
- E2E tests for parallel import concurrency and all new features.
Fixed
validateSlugnow accepts any filename characters (spaces, unicode, special chars) instead of rejecting non-alphanumeric slugs. Apple Notes and other real-world filenames import cleanly.- Import resilience: files over 5MB are skipped with a warning instead of crashing the pipeline. Errors in individual files no longer abort the entire import.
gbrain initdetects IPv6-only Supabase URLs and adds the requiredpgvectorcheck during setup.- E2E test fixture counts, CLI argument parsing, and doctor exit codes cleaned up.
Changed
- Setup skill and README rewritten for agent-first developer experience.
- Maintain skill updated with RLS verification, schema health checks, and
nohuphints for large embedding jobs.
[0.3.0] - 2026-04-08
Added
- Contract-first architecture: single
operations.tsdefines ~30 shared operations. CLI, MCP, and tools-json all generated from the same source. Zero drift. OperationErrortype with structured error codes (page_not_found,invalid_params,embedding_failed, etc.). Agents can self-correct.dry_runparameter on all mutating operations. Agents preview before committing.importFromContent()split fromimportFile(). Both share the same chunk+embed+tag pipeline, butimportFromContentworks from strings (used byput_page). Wrapped inengine.transaction().- Idempotency hash now includes ALL fields (title, type, frontmatter, tags), not just compiled_truth + timeline. Metadata-only edits no longer silently skipped.
get_pagenow supports optionalfuzzy: truefor slug resolution. Returnsresolved_slugso callers know what happened.queryoperation now supportsexpandtoggle (default true). Both CLI and MCP get the same control.- 10 new operations wired up:
put_raw_data,get_raw_data,resolve_slugs,get_chunks,log_ingest,get_ingest_log,file_list,file_upload,file_url. - OpenClaw bundle plugin manifest (
openclaw.plugin.json) with config schema, MCP server config, and skill listing. - GitHub Actions CI: test on push/PR, multi-platform release builds (macOS arm64 + Linux x64) on version tags.
gbrain init --non-interactiveflag for plugin mode (accepts config via flags/env vars, no TTY required).- Post-upgrade version verification in
gbrain upgrade. - Parity test (
test/parity.test.ts) verifies structural contract between operations, CLI, and MCP. - New
setupskill replacinginstall: auto-provision Supabase via CLI, AGENTS.md injection, target TTHW < 2 min. - E2E test suite against real Postgres+pgvector. 13 realistic fixtures (miniature brain with people, companies, deals, meetings, concepts), 14 test suites covering all operations, search quality benchmarks, idempotency stress tests, schema validation, and full setup journey verification.
- GitHub Actions E2E workflow: Tier 1 (mechanical) on every PR, Tier 2 (LLM skills via OpenClaw) nightly.
docker-compose.test.ymland.env.testing.examplefor local E2E development.
Fixed
- Schema loader in
db.tsbroke on PL/pgSQL trigger functions containing semicolons inside$$blocks. Replaced per-statement execution with singleconn.unsafe()call. traverseGraphquery failed with "could not identify equality operator for type json" when usingSELECT DISTINCTwithjson_agg. Changed tojsonb_agg.
Changed
src/mcp/server.tsrewritten from ~233 to ~80 lines. Tool definitions and dispatch generated from operations[].src/cli.tsrewritten. Shared operations auto-registered from operations[]. CLI-only commands (init, upgrade, import, export, files, embed) kept as manual registrations.tools-jsonoutput now generated FROM operations[]. Third contract surface eliminated.- All 7 skills rewritten with tool-agnostic language. Works with both CLI and MCP plugin contexts.
- File schema:
storage_urlcolumn dropped,storage_pathis the only identifier. URLs generated on demand viafile_urloperation. - Config loading: env vars (
GBRAIN_DATABASE_URL,DATABASE_URL,OPENAI_API_KEY) override config file values. Plugin config injected via env vars.
Removed
- 12 command files migrated to operations.ts: get.ts, put.ts, delete.ts, list.ts, search.ts, query.ts, health.ts, stats.ts, tags.ts, link.ts, timeline.ts, version.ts.
storage_urlcolumn from files table.
[0.2.0.2] - 2026-04-07
Changed
- Rewrote recommended brain schema doc with expanded architecture: database layer (entity registry, event ledger, fact store, relationship graph) presented as the core architecture, entity identity and deduplication, enrichment source ordering, epistemic discipline rules, worked examples showing full ingestion chains, concurrency guidance, and browser budget. Smoothed language for open-source readability.
[0.2.0.1] - 2026-04-07
Added
- Recommended brain schema doc (
docs/GBRAIN_RECOMMENDED_SCHEMA.md): full MECE directory structure, compiled truth + timeline pages, enrichment pipeline, resolver decision tree, skill architecture, and cron job recommendations. The OpenClaw paste now links to this as step 5.
Changed
- First-time experience rewritten. "Try it" section shows your own data, not fictional PG essays. OpenClaw paste references the GitHub repo, includes bun install fallback, and has the agent pick a dynamic query based on what it imported.
- Removed all references to
data/kindling/(a demo corpus directory that never existed).
[0.2.0] - 2026-04-05
Added
- You can now keep your brain current with
gbrain sync, which uses git's own diff machinery to process only what changed. No more 30-second full directory walks when 3 files changed. - Watch mode (
gbrain sync --watch) polls for changes and syncs automatically. Set it and forget it. - Binary file management with
gbrain filescommands (list, upload, sync, verify). Store images, PDFs, and audio in Supabase Storage instead of clogging your git repo. - Install skill (
skills/install/SKILL.md) that walks you through setup from scratch, including Supabase CLI magic path for zero-copy-paste onboarding. - Import and sync now share a checkpoint. Run
gbrain import, thengbrain sync, and it picks up right where import left off. Zero gap. - Tag reconciliation on reimport. If you remove a tag from your markdown, it actually gets removed from the database now.
gbrain config showredacts database passwords so you can safely share your config.updateSlugengine method preserves page identity (page_id, chunks, embeddings) across renames. Zero re-embedding cost.sync_brainMCP tool returns structured results so agents know exactly what changed.- 20 new sync tests (39 total across 3 test files)
[0.1.0] - 2026-04-05
Added
- Pluggable engine interface (
BrainEngine) with full Postgres + pgvector implementation - 25+ CLI commands: init, get, put, delete, list, search, query, import, export, embed, stats, health, link/unlink/backlinks/graph, tag/untag/tags, timeline/timeline-add, history/revert, config, upgrade, serve, call
- MCP stdio server with 20 tools mirroring all CLI operations
- 3-tier chunking: recursive (delimiter-aware), semantic (Savitzky-Golay boundary detection), LLM-guided (Claude Haiku topic shifts)
- Hybrid search with Reciprocal Rank Fusion merging vector + keyword results
- Multi-query expansion via Claude Haiku (2 alternative phrasings per query)
- 4-layer dedup pipeline: by source, cosine similarity, type diversity, per-page cap
- OpenAI embedding service (text-embedding-3-large, 1536 dims) with batch support and exponential backoff
- Postgres schema with pgvector HNSW, tsvector (trigger-based, spans timeline_entries), pg_trgm fuzzy slug matching
- Smart slug resolution for reads (fuzzy match via pg_trgm)
- Page version control with snapshot, history, and revert
- Typed links with recursive CTE graph traversal (max depth configurable)
- Brain health dashboard (embed coverage, stale pages, orphans, dead links)
- Stale alert annotations in search results
- Supabase init wizard with CLI auto-provision fallback
- Slug validation to prevent path traversal on export
- 6 fat markdown skills: ingest, query, maintain, enrich, briefing, migrate
- ClawHub manifest for skill distribution
- Full design docs: GBRAIN_V0 spec, pluggable engine architecture, SQLite engine plan