OpenExpertise

🧱

Code-as-Law

YAML schema validates the graph structure before runtime. LLMs only fill the gaps inside nodes — they can't rewrite the flow at runtime. No drift, no surprises, the same DAG every run.

🧩

6 node kinds, one graph

tool (deterministic code) · agent (LLM with structured output) · skill (SKILL.md packages) · dataset (file / SQLite / HTTP) · experience (nested) · cli-agent (Claude Code / Codex / Gemini).

💾

Persistent SQLite state

Every node's writes land in a typed blackboard. oe state findings works hours later. Resume with oe resume <run-id> and replay cached steps.

🧬

Self-improving

oe evolve <run-id> reads the events + state diff and proposes graph upgrades as git apply-ready diffs. oe evolve --runs a,b,c finds stable patterns across runs, not one-off blips. The author → run → evolve loop closes.

🔗

Two-way agentic-CLI integration

Outbound — delegate a node to Claude Code / Codex / Gemini. Inbound — oe-mcp exposes 8 OE tools (incl. oe_graph) so the same CLIs can run, render, and evolve experiences from their own sessions.

⚡

Parallel + 429-aware

--concurrency N runs independent nodes (and for_each iterations) in parallel. Both Anthropic and OpenAI clients retry on HTTP 429 with exponential backoff.

🎯

Quality-loop authoring

oe ultra "<task>" synthesizes a complete experience.yaml + tool stubs + prompts, then runs an internal critique→revise loop — a critic scores the draft and an incremental reviser fixes it, keeping the best round. oe ultra-revise applies your feedback to an existing draft.

📊

Visualize & share runs

oe graph renders the DAG as a Mermaid flowchart you can paste into a README. oe inspect --html emits a self-contained run report — the graph colored by per-node status, an events timeline, and per-node tokens & duration.

🪟

htop-grade observability

--tui shows each node's live status, current activity ("calling claude-sonnet-4-6"), accumulated per-node tokens, and a run-level total in the header.

✨

Editor autocomplete

oe init wires a JSON Schema + $schema header into every new experience, so VS Code / any yaml-language-server editor gives autocomplete, hover docs, and inline validation. Existing projects — oe schema --write.

📦

Multi-LLM provider

Anthropic + OpenAI out of the box. OPENAI_BASE_URL redirects OpenAI calls to any compatible endpoint — vLLM, Ollama, LM Studio, your own internal API.

Example	What it shows	Featuring
`hello-tool`	Smallest possible flow	`tool`
`agent-echo`	Single LLM agent with structured output	`agent`
`dataset-aggregate`	Load CSV → aggregate	`dataset` + `tool`
`review-branch` ★	The hero demo — multi-dim review + verifier + score + evolution	`tool` + `agent` ×3
`oncall-runbook`	Fan out an investigation across 3 dimensions	`for_each`
`issue-triage`	Classify → search dupes → conditional dedup → route	`when:` edges
`release-gates`	License + changelog + coverage + Claude-Code security scan → gate	`tool` + `cli-agent` + `agent`
`cli-orchestration`	Claude Code summarizes; Codex critiques	`cli-agent` ×2
`tri-cli-orchestration` ★	Claude → Codex → Gemini in one DAG	`cli-agent` ×3
`deep-research`	Multi-source research with cross-referencing	`agent` fan-in
`systematic-debugging`	Hypothesize → localize → fix → verify loop	`tool` + `agent`
`brainstorming`	Diverge → cluster → critique → synthesize top 3	`cli-agent` fan-out + `agent`

OpenExpertiseAI-era Makefile

Code-as-Law

6 node kinds, one graph

Persistent SQLite state

Self-improving

Two-way agentic-CLI integration

Parallel + 429-aware

Quality-loop authoring

Visualize & share runs

htop-grade observability

Editor autocomplete

Multi-LLM provider

What is it, really?

The 60-second story

Three rival AI coding CLIs talking to each other, in one graph

Pick the example closest to your use case

When should I reach for OpenExpertise?

OpenExpertiseAI-era Makefile

Code-as-Law

6 node kinds, one graph

Persistent SQLite state

Self-improving

Two-way agentic-CLI integration

Parallel + 429-aware

Quality-loop authoring

Visualize & share runs

htop-grade observability

Editor autocomplete

Multi-LLM provider

What is it, really? ​

The 60-second story ​

Three rival AI coding CLIs talking to each other, in one graph ​

Pick the example closest to your use case ​

When should I reach for OpenExpertise? ​

What is it, really?

The 60-second story

Three rival AI coding CLIs talking to each other, in one graph

Pick the example closest to your use case

When should I reach for OpenExpertise?