Two complete :15 spec campaigns — fourteen final frames and two finished films — produced solo through an AI-assisted production pipeline under human art direction. Stills and motion generated via API, beat-cut in code with Remotion.
Spec work — not commissioned by or affiliated with Crocs, Inc.The test: take a real brand — tone, product line, visual language studied from the live platform — and deliver two distinct, finished social campaigns using current-generation image and video models, with every creative decision made by a human art director and every frame measured against a single bar: it has to pass as real, or it doesn’t ship.
The hard part of AI production isn't generating images. It's consistency — the same camera, the same product colorways, the same character, frame after frame, until nine separate generations read as one campaign. That's a discipline problem, not a prompting problem, and it's what this project was built to prove.
One fixed top-down camera. Feet in Crocs, dead center, every frame — everything else changes. Nine ordinary moments from one life, beat-cut at 124 BPM into a :15 loop. The discipline of the locked composition is what makes nine generations read as one campaign.









One character, one fixed frame, five looks — the Crocs change with every outfit. Hard cuts land mid-gesture; the styling coordination is the content. Character consistency held across every still and clip from a single locked face reference.






Brand study from the live platform; tone pillars and a hard avoid-list set before any generation.
Product reference library built from catalog photography — colorways pixel-verified, geometry treated as law.
One hero frame iterated to approval, then locked as the composition reference for every frame after it.
Generated against the locks, zoom-QA’d per select; anything that doesn’t pass as real gets cut.
Micro-motion only — subjects frozen, environments alive. Identity refs ride along on every clip.
Cut grids built in code from each track’s measured BPM; one global grade across every select.
Anyone can generate a good-looking AI image. The difference between a demo and a campaign is whether frame seven matches frame one — same camera, same colorway, same charms in the same holes. That takes reference discipline, a hard bar — every frame has to pass as real — and an art director willing to throw out good frames that don't match.
This project demonstrates a repeatable brief-to-campaign pipeline: brand study, reference locks, generation under constraint, zoom-level QA, and a code-driven edit — with a full generation log behind every select. The pipeline produced two campaigns for one brand. The process generalizes to any brand.