Blog Redesign: Agent-Native Design System

Problem

kyle.pericak.com works but looks dated and is built on an incoherent styling foundation. Verified from the repo (apps/blog/blog/):

Three styling systems coexist. Material UI 5 + Emotion is the real one (a createTheme in pages/_app.js with bespoke typography variants and four hardcoded hex colors). On top of that, [email protected] is installed but inert — tailwind.config.ts scans ./src/** globs that don't exist, so it emits no utilities — and styled-components@6 sits in dependencies as further dead weight.
No design tokens. Colors are hardcoded hex (#337ab7, #333333, …) inside the MUI theme. There is no single source of truth; changing the brand color means hunting through sx props and _app.js.
No dark mode. Single light theme only.
No component library or component documentation. Six ad-hoc components (SiteLayout, IndexPage, BlogPostContentPage, BlogSidebar, WikiPage, Pagination) with styling inline via sx. Nothing renders a component in isolation; there is no catalogue.
Visual quality is "2014 Bootstrap." The look does not represent the technical sophistication of the content or the agent tooling behind it.

The blog is the public face of a heavily AI-native engineering practice. Right now the content demonstrates that sophistication and the design undercuts it. Kyle wants the design to become a showpiece in its own right — something he can put in front of other full-stack experts as a proof of how a small team (one human + an agent fleet) builds a first-class, modern frontend.

Goal

Give the blog a clearly-defined, modern, demonstrable design system — token-driven, documented in a real component library, built on mainstream building blocks chosen to last 3–5+ years — and ship a visibly nicer redesign on top of it, executed mostly autonomously by the agent team.

Success Metrics

Single source of visual truth: 100% of the redesigned UI's colors, spacing, type, radius, and shadows resolve to named design tokens. A grep guard finds zero raw hex / raw px (except 0/1px) in component code. MUI, Emotion, dead Tailwind v3, and styled-components are fully removed from dependencies.
Demonstrable component library: Every shipped component has a Storybook story showing its canonical states; the static Storybook builds clean and is the artifact Kyle shows other engineers.
Autonomy: ≥80% of the implementation tasks (by task count in the design doc) are completed by the agent pipeline / Claude Code sessions with Kyle reviewing PRs rather than writing code.
Quality bar held: Lighthouse/axe accessibility ≥ the current site on every redesigned page (target: zero serious/critical axe violations), and the production static export still builds and deploys unchanged.

Non-Goals

Not a framework migration. Stay on Next.js 15 Pages Router with output: 'export'. No App Router, no SSR, no re-platform.
Not a content migration. The markdown posts, the remark→HTML pipeline, Prism syntax highlighting, and the .html internal-link convention stay as-is. The design system styles the rendered output; it does not change how posts are authored.
Not a full TypeScript migration of the blog. New design-system code is authored in TypeScript (the tooling is already present), but legacy pages/*.js are not force-converted beyond what a component swap requires.
Not a rebrand of identity (logo, name, voice). This is a visual system refresh, not a brand exercise — though a refreshed, tokenized color palette and type scale are in scope.
Not a public component registry / npm package in v1. The library is internal to apps/blog. (Publishing a shadcn-compatible registry is a noted future option, not a v1 deliverable.)

User Stories

Story: Kyle demos the design system to other full-stack experts

As the blog's owner, I want a browsable component library with a documented token system, so that I can show another senior engineer "here's the design system" and have it stand up to scrutiny.

Acceptance criteria:

A Storybook instance renders every component in its canonical states (default, variants, dark mode), buildable with one command and as a static site.
Tokens (color, space, type, radius, shadow) are visible and documented — a "foundations" section in Storybook plus the token source files.
The stack is composed of named, mainstream tools a reviewer would recognize as current best practice (Tailwind, Storybook, an accessible primitive layer, a token pipeline).

Story: Kyle gets a visibly modern, nicer-looking blog

As a reader-facing outcome, I want the blog to look modern and polished, so that the design matches the sophistication of the content.

Acceptance criteria:

Home, post, wiki, and about pages are visually refreshed against the new token system (improved type scale, spacing rhythm, color, and component polish).
Dark mode is supported and toggles without a flash or layout shift.
Redesigned pages are responsive and verified at mobile / tablet / desktop breakpoints via the Playwright MCP screenshot loop.

Story: The agent team builds and maintains it with Kyle mostly out of the loop

As the operator of an agent fleet, I want the redesign executed and maintained primarily by agents, so that I review and steer rather than hand-code.

Acceptance criteria:

The work is decomposed into dependency-ordered tasks (in the design doc) that a Claude Code /goal session can pick up and execute one at a time.
A machine-checkable verification gate (bin/verify-design-system.sh or equivalent: build + lint + Storybook build + Playwright/axe) lets an agent self-verify a change before opening a PR.
A short apps/blog/blog/design-system/AGENTS.md (mirrored through .ruler/) tells any agent how to add, change, or remove a component so future maintenance is "ask an agent," not "relearn the system."

Story: A future component change is straightforward through AI

As the maintainer, I want adding or changing a component to be a simple, well-paved request to an agent, so that maintenance stays cheap.

Acceptance criteria:

Component source lives as readable, checked-in files (not opaque node_modules), so an agent can open and edit them directly.
There is a documented, repeatable path ("add a component", "change a token") an agent can follow end to end, ending in a green verification gate.

Scope

In scope (v1):

A token layer (color, spacing, typography, radius, shadow, breakpoints) as the single source of truth, surfaced as CSS variables.
Tailwind (upgraded and correctly wired) as the styling layer consuming those tokens.
An accessible component library covering the blog's real surface: layout primitives, site header/footer/nav, post card + post layout, code block, callout, tag/category, pagination, breadcrumbs, table of contents, and markdown element styling (the <Prose> wrapper).
Storybook as the component workshop, documentation surface, and visual
- a11y test host.
Dark mode via token swap.
Full removal of MUI, Emotion, dead Tailwind v3 config, and styled-components by the end.
Agent-facing docs + a verification gate that make the system autonomously buildable and maintainable.

Deferred:

Publishing a public/shared component registry or npm package.
Figma / Tokens Studio design-tool integration (only if Kyle starts designing in Figma).
Hosted visual-review SaaS (Chromatic) — adopt only if local visual regression proves too noisy at scale.
Any redesign of non-blog apps in the monorepo.

Decisions (resolved 2026-06-13)

Kyle's calls on the original open questions:

Component model: shadcn/ui-seeded owned .tsx. Kept small and Storybook-documented so "maintain via AI" stays tractable. (Not a versioned npm library.)
Visual direction: agents generate 3 mockups; Kyle picks one before mass component work. This is the one intentional human gate.
React: bias to newest → upgrade to React 19 as part of the work (shadcn/ui targets 19; aligns with the "modern, 3–5yr" goal).
TypeScript depth: new DS code in .tsx, legacy pages stay .js.
Token format: Tailwind v4 @theme is the load-bearing source of truth; DTCG export optional and non-blocking (removes the young-spec risk Kyle flagged).
Tailwind: upgrade the inert v3 to v4 for the @theme token model.
Autonomy routing: a /goal session, not autolearn Linear sub-issues. Kyle pastes a goal that drives Claude Code through the design doc's task breakdown until the verification gate is green.

Open Questions

None blocking. Remaining choices (exact palette/type for the 3 mockups, whether to publish a registry later) are deferred and surface during implementation.

Risks

Three-way styling coexistence inflates the bundle mid-migration. MUI + Tailwind running together temporarily is expected; the metric (remove MUI/Emotion/styled-components) only lands at the end. Mitigation: page-by-page migration with the old system removed last.
Visual regression is noisy (anti-aliasing, fonts, emoji across machines). Mitigation: run snapshots in the existing Docker/Cloud Build environment, set diff thresholds, and lean on Storybook's isolated rendering rather than full-page diffs.
output: 'export' constraints. No SSR theming; dark mode must be a no-flash inline script + CSS-variable swap, not a server decision. Mitigation: standard class/data-theme on <html> with a tiny blocking script.
Autonomy depends on a trustworthy gate. If the verification gate is weak, autonomous PRs ship visual regressions. Mitigation: the gate (build + lint + Storybook build + Playwright/axe) is itself a v1 deliverable, landed before mass migration.
Scope creep into a full TS / App Router rewrite. Explicitly a non-goal; the design doc must keep tasks additive.

Kyle Pericak

Problem

Goal

Success Metrics

Non-Goals

User Stories

Story: Kyle demos the design system to other full-stack experts

Story: Kyle gets a visibly modern, nicer-looking blog

Story: The agent team builds and maintains it with Kyle mostly out of the loop

Story: A future component change is straightforward through AI

Scope

Decisions (resolved 2026-06-13)

Open Questions

Risks