Building Your Own AI-Powered CMS (2026) — A Stack-Agnostic Architecture & Blueprint

EN · 22 ch

Chapter 10: Agentic Editorial Workflows

Chapter 10 of 22 · ~17 min read

Overview

This chapter is about turning a content idea into a published, fact-checked, SEO-ready page using a chain of cooperating LLM "agents" — and doing it without losing your sanity, your audit trail, or your editorial standards. We cover how to decompose the editorial lifecycle (research → draft → edit → fact-check → SEO → publish) into discrete steps, how to choose an orchestration substrate (n8n, queues, LangGraph, Temporal/Inngest, or hand-rolled), where to place human-in-the-loop (HITL) checkpoints, the "five LLM jobs" discipline that keeps each model call sane, and the non-negotiable production concerns — idempotency, durability, and audit. The argument throughout: an agentic editorial pipeline is a workflow with some LLM-shaped nodes, not a magical autonomous writer. Reliability comes from the workflow, not the model.

Content

10.1 The editorial lifecycle as a directed graph

The classic content lifecycle — brief, research, outline, draft, edit, fact-check, optimize for search, schedule, publish, distribute — maps cleanly onto a directed graph where each node is a bounded task and each edge is a state transition with a precondition. Modeling it as a graph rather than a single "write me an article" prompt is the central design decision of this chapter, because it lets you (a) put a deterministic check between every probabilistic step, (b) retry or roll back individual nodes, and (c) insert humans exactly where editorial judgment is irreplaceable.

A reference editorial graph looks like this:

Stage	Node type	Primary LLM job	Deterministic check after	HITL?
Brief intake	Tool/code	— (parse form/webhook)	Schema validation	Optional approve
Research	Agent (tool-using)	none-as-writer; gather	Source count ≥ N, dedup, domain allowlist	—
Source vetting	LLM	classify (reliable/biased/dated)	Drop below-threshold sources	—
Outline	LLM	(structure only)

Option	Control-flow author	Durability / retries	State & HITL	Best for editorial when…
n8n (queue mode)	Visual, low-code	Per-execution retries; queue mode separates main + worker instances	Manual approval nodes; wait-for-webhook	Non-engineers own the pipeline; you want 500+ integrations (CMS, Slack, Sheets) out of the box; ~220 exec/s single instance
Job queue + workers (BullMQ/Redis, SQS, Celery)	Your code	You build retries/dedup yourself	You build state (DB rows)	You already run a backend and want full control with minimal new infra
LangGraph	Your code (Python/JS, graph DSL)	First-class checkpointers; "time-travel" replay	First-class `interrupt()` for HITL; persistent state	The pipeline is agentic/branchy and you want pause/resume + debugging via LangSmith
Temporal / Inngest / Restate (durable execution)	Your code (workflow functions)	Exactly-once step semantics, automatic resumption	Durable timers, signals for HITL	Reliability is paramount; long-running (hours/days waiting on human approval)
Custom / bespoke	Your code, fully	You own everything	You own everything	You have unusual constraints and a strong platform team

Intake (code). Webhook/form → validate against a brief schema → create article_id and an agent_run audit row.
Research (agent, orchestrator-worker). A research orchestrator spawns worker searches; each worker fetches and summarizes. This is the one genuinely agentic node — subtasks aren't predefined. Output: a deduped source set behind a domain allowlist. Check: ≥ N independent sources.
Source vetting (LLM — classify). Score each source for reliability/recency/bias as a constrained enum. Drop below threshold. Cheap model.
Outline (LLM — draft, structure only). Produce H2/H3 structure and the argument. HITL gate: editor approves the angle. (Cheapest correction point.)
Draft (LLM — draft, frontier model). Write the body grounded in the vetted sources, with inline source IDs. Checks: word-count band, banned-phrase regex, reading-level target.
Line-edit (LLM — critique → rewrite; evaluator-optimizer loop). A critic returns structured style/clarity feedback; an optimizer applies it; loop until the critic's score clears the bar or max iterations hit. Different model from the drafter.
Fact-check (agent — extract → decide). Extract every checkable claim; for each, retrieve evidence and decide supported / unsupported / needs-source, each with a citation and rationale. Check: every claim has ≥ 1 grounded citation. HITL exception: only flagged/unsupported claims go to a human editor. (RAG-grounded verification is reported to cut hallucination rates up to ~71% vs. ungrounded generation.)
SEO / GEO (LLM — draft + code). Draft title, meta description, and FAQ candidates; code emits JSON-LD Article/BlogPosting, Organization, and BreadcrumbList schema. Check: title/meta length, schema.org validation. (In 2026, structured data is primarily an AI-citation trust signal after Google narrowed FAQ/HowTo rich results — pages with proper schema are reported ~2.5× likelier to surface in AI answers.) Optionally update llms.txt.
Publish (code — idempotent). Render a preview; HITL gate: final publish approval. On approval, call the CMS publish API with idempotency key publish:{article_id}:{content_hash}; record the revision; mark the run complete. A re-run with the same hash is a no-op.
Distribute (code + LLM — draft). Generate per-channel social variants (length/format checked) and enqueue them, each with its own idempotency key.

Building Your Own AI-Powered CMS (2026) — A Stack-Agnostic Architecture & Blueprint

Chapter 10: Agentic Editorial Workflows

Overview

Content

10.1 The editorial lifecycle as a directed graph

10.2 The "five LLM jobs" discipline

10.3 Orchestration choices

10.4 Human-in-the-loop checkpoints

10.5 Idempotency: the rule that prevents double-publishing

10.6 Audit: who/what/when/why for every node

10.7 Worked pipeline: research → publish

Key Takeaways

Key References