Building Your Own AI-Powered CMS (2026) — A Stack-Agnostic Architecture & Blueprint

EN · 22 ch

Chapter 8: The Authoring Experience (Making It Easy to Use)

Chapter 8 of 22 · ~17 min read

Overview

A content model can be flawless, an API blazing fast, and an AI pipeline state-of-the-art — and the whole CMS will still fail if the people who write in it dread opening it every morning. This chapter makes the vague "must be easy to use" requirement concrete. It covers the three editing paradigms (structured, WYSIWYG, and visual), the four rich-text frameworks you will actually choose between in 2026 (Tiptap/ProseMirror, Lexical, Plate/Slate, and BlockNote), live preview and inline validation, media insertion, AI-in-the-editor, the role and permission model that protects non-technical editors from themselves, and the central engineering decision: build the editor or adopt one.

Content

8.1 The three editing paradigms — and why an AI-native CMS leans structured

Editors come in three families, and the choice of family is more consequential than the choice of library inside it.

Paradigm	What the editor sees	Stored as	Strength	Weakness
Structured / field-based	Discrete, typed fields and blocks (title, hero, body, CTA)	JSON tree (e.g., Portable Text), typed records	Clean data, multi-channel reuse, AI-readable	Editors can't "see the page" while writing
WYSIWYG rich text	A document that looks roughly like the output	HTML or a rich-text JSON document	Familiar (Word/Google Docs feel)	Mixes content with presentation; hard to reuse
Visual / on-page	The actual rendered page, click-to-edit	Whatever the CMS stores, mapped back via source links	Marketers love it; instant context	Couples content to one layout; brittle

For an AI-native CMS, the bias should be toward structured content with a rich-text field inside it, not a single free-form WYSIWYG blob. Sanity's own guidance captures why: "When an AI agent reads or writes content, unstructured HTML or Markdown is genuinely harder to reason about. Portable Text's JSON structure gives agents something they can work with precisely — the semantic intent of each block is explicit, not implied by markup conventions" (Sanity, , 2025). An LLM asked to "rewrite the second paragraph and leave the pull-quote alone" can address a structured block by id; against an HTML soup it must parse, guess, and re-serialize, losing fidelity each round-trip.

Framework	Foundation	Bundle (core)	Model	Best for	License
Tiptap	ProseMirror	~moderate, modular	Headless, 50+ extensions	The default "fastest path to production"	MIT core; paid Cloud/Pro features
Lexical	Built by Meta, standalone	~22 KB core	Headless, performance-first	Huge docs, mobile, scale	MIT
Plate	Slate	Component-heavy	Headless + shadcn/ui components	React + shadcn/ui + "own the components"	MIT
BlockNote	ProseMirror + Tiptap	Batteries-included	Block-based, Notion-style, styled UI out of the box	Fastest Notion-style editor with minimal code	MPL-2.0 core; GPL-3.0 "XL" packages
ProseMirror (raw)	itself	small	Low-level toolkit	Maximum control, custom schemas	MIT

Feature	Risk	Notes
Inline autocomplete (Tab)	Low	Editor stays in control; never auto-commits
Tone/grammar/readability suggestions	Low	Advisory marks, accept/reject
"Improve / shorten / expand selection"	Medium	Operate on selection, show as suggestion
AI-generated alt text & summaries/SEO	Medium	Always human-reviewed before publish
Agentic multi-step edits (AI Toolkit / agent)	Higher	Require track-changes + explicit approval
Full-draft generation	Highest	Useful for scaffolding; never auto-publish

Role	Can	Cannot
Contributor / Author	Create and edit drafts	Publish; change settings
Editor	Review, edit, approve, publish; manage editorial workflow	Change schema, roles, or site config
Publisher (optional split)	Push approved content live	Author from scratch
Admin / Developer	Schema, roles, integrations, config	—

Posture	What you do	When it fits	Risk
Adopt a CMS's editor wholesale	Use Sanity Studio / Storyblok / Contentful's editor as-is	You also adopted that CMS; team is small	You inherit their UX and pace
Adopt a framework, build the editor UI	Tiptap/Lexical/Plate/BlockNote + your own toolbar/blocks	Custom CMS, custom design system (most readers of this report)	You own UX bugs; framework churn (e.g., Tiptap AI deprecations)
Build from raw ProseMirror/Slate	Custom schema and rendering	The editor is a core differentiator	High cost; you maintain a hard problem

Building Your Own AI-Powered CMS (2026) — A Stack-Agnostic Architecture & Blueprint

Chapter 8: The Authoring Experience (Making It Easy to Use)

Overview

Content

8.1 The three editing paradigms — and why an AI-native CMS leans structured

8.2 The rich-text field: the four frameworks you'll actually choose between

8.3 Headless vs. all-in-one editors

8.4 Live preview: parallel, inline, and source-mapped

8.5 Inline validation: catch errors where they happen

8.6 Media insertion: the most-used, most-neglected workflow

8.7 AI in the editor — the 2026 state of the art

8.8 Roles for non-technical editors — protecting people from sharp edges

8.9 Accessibility of the editor itself

8.10 Build vs. adopt the editor

Key Takeaways

Key References