Tips and Techniques

How to Prompt AI Typography That Actually Works.

By Adam Morgan20 May 202612 min read

AI can generate stunning type treatments, but vague prompts produce unusable results. Here's how to describe typography with enough precision to get output you can actually use.

Why Typography Prompts Fail (and What They're Missing)

Most failed typography prompts describe a feeling. "Elegant serif." "Clean and modern." "Bold but refined." These phrases communicate atmosphere, not structure, and a language model has no more to work with than a typographer handed a mood board with no brief. The output reflects that absence.

The gap is rooted in how designers think versus how models process language. Working typographers reason in x-height ratios, optical sizing, tracking values, weight contrast, and the relationship between display and text cuts. Models respond to reference-anchored, role-specific language because their understanding of visual concepts is built from co-occurring words in training data. "Elegant serif" maps to an enormous semantic neighbourhood. "Didone-style high-contrast serif, display optical size, influenced by late-Modernist editorial covers" concentrates the model's attention in a narrower, better-documented region where better outputs live.

Three failure modes appear consistently across disciplines. A graphic designer asking for a logotype direction receives a render with letterforms that would never survive vector redraw. A motion designer gets a typeface description with no account of how weight behaves across animation states. A game developer receives a UI type recommendation with no hierarchy, no contrast logic, no acknowledgement of the four or five distinct label roles a HUD actually requires. In each case, the prompt delivered one dimension of the brief and left the rest to chance.

The fix operates at three levels simultaneously:

Classification: What kind of type is it? Display, text, label, logotype, wayfinding?
Context: Where does it live? A printed exhibition panel, a mobile game HUD, a paid social carousel, an automotive instrument cluster?
Constraint: What must it not do? No decorative terminals. Must remain legible reversed on dark surfaces. Cannot exceed two weights.

Research from the Nielsen Norman Group on AI prototyping workflows found that vague descriptors like "clean, modern dashboard" consistently produced cluttered, unfocused results. The fix, which applies directly to type prompting, was to specify named visual styles, attach real content rather than placeholder copy, and reference documented design systems rather than adjectives. That finding holds whether you are designing a packaging label, a brand identity, or a campaign landing page.

Mood words describe the effect you want. Structural descriptors describe the cause. Models need the cause.

The Anatomy of a Type Prompt That Delivers

A type prompt that reliably produces useful output follows a consistent structure. It is not a sentence. It is a stack of decisions, written in order of specificity.

The working formula: [type classification] + [weight and proportion descriptor] + [historical or stylistic reference] + [surface and scale context] + [hard constraint]

Run that against a real brief. A product designer specifying type for a premium skincare label might write: "Humanist serif text face, regular weight, moderate x-height, proportions influenced by early-twentieth-century book typography, set at 8pt on uncoated stock, no swash alternates, no condensed width." Compare that to "something elegant for a luxury label." The first prompt gives the model a documented visual language and a set of boundaries. The second gives it an aspiration and nothing else.

Classification terms that work across disciplines:

Graphic design / branding: "geometric sans display logotype", "slab serif wordmark", "transitional serif at editorial scale"
Web and UX: "variable weight UI label stack", "humanist text face for body copy, 400 and 600 weights, 16px base size, WCAG AA contrast compliant"
Game development: "condensed sans HUD label, legible at 11px on 1080p, no thin strokes, high x-height"
Motion design: "grotesque display headline, alternating between light and black weight across keyframe states, no optical size shifting between states"
Exhibition design: "wayfinding sans at 48pt minimum, bilingual spacing considered, legible under variable gallery lighting"

Historical references outperform mood words for a specific reason. "Influenced by Swiss International Typographic Style" gives the model access to a documented visual language with known characteristics: grid-based composition, flush-left setting, restrained weight contrast, and typefaces associated with a specific era of practice. "Clean and professional" gives it nothing beyond a vague preference shared by every brief ever written.

The constraint layer is where many prompts stop short. Adding a negative instruction, even a single one, significantly narrows the output space. "No decorative terminals" rules out an entire category of display faces. "Legible reversed on dark surfaces" rules out anything with hairline strokes. "Must read at 12px on screen" eliminates anything with tight apertures or low x-height. Each constraint removes a class of wrong answers before the model generates a single word.

Discipline-specific context closes the brief. A web/UX designer should name the viewport size and the weight range available in the chosen typeface. A product designer specifying type for physical packaging should name the print process, because uncoated offset demands different optical compensation than digital print. An automotive designer working on HMI type for an instrument cluster should note the screen resolution, the viewing angle range, and whether the type must comply with any regulatory legibility standards. None of this is obvious to a model. All of it is obvious to you. Put it in.

One hard constraint in a prompt does more work than three additional adjectives. Always include at least one "must not".

Model Selection for Type Work on Stensyl

Model choice inside Stensyl's Write studio and Canvas LLM Chat node directly affects the quality of type-related reasoning. The difference is not subtle. Smaller, faster models handle quick-turn ideation well. Larger models reason more consistently across complex, multi-role type systems. Matching the model to the task avoids burning credits on capability you do not need, or cutting corners on work that actually requires depth.

Task type	Recommended model	Tier
Quick headline copy with basic type direction	GPT-5.4 mini, Gemini Flash	Lite (£10/mo)
Type system documentation, multi-page design briefs	GPT-5.5, Gemini Pro	Starter (£22/mo)
Editorial work requiring tonal coherence across long outputs	Claude Sonnet 4.6, Claude Opus 4.7	Pro (£42/mo)

For rapid ideation, GPT-5.4 mini and Gemini Flash handle quick-turn prompts without burning credits on complexity. If you are a content or social designer generating headline options with broad type direction for a campaign carousel, these models return usable output fast. The reasoning is not deep, but for this class of task, depth is not the requirement.

For detailed type system documentation, brand voice guidelines that include typographic rationale, or multi-page design briefs, GPT-5.5 or Gemini Pro reason more consistently across longer structured outputs. A product design team building a component library needs a model that can hold the entire hierarchy in working memory and reason about how display, body, caption, and label roles relate to one another. The Starter tier at £22/month gives you access to both.

For nuanced editorial work where typographic tone must match a brand's existing voice, Claude Sonnet 4.6 or Claude Opus 4.7 handle conceptual coherence better across extended outputs. Consider an exhibition catalogue where type rationale must align with curatorial voice, or an automotive brand standards document where the reasoning behind typeface selection needs to hold up across a hundred pages. These are briefs where a model that loses the thread halfway through creates real rework. Both Claude models are available on the Pro tier at £42/month.

Before committing credits to a generation task, use Stensyl's Ray surface to get a model recommendation. Describe your typography task and Ray will point you toward the right model for the job. It takes thirty seconds and runs on a fast Anthropic Haiku model, so it does not cost you a meaningful credit spend.

Match the model to the depth of the type task. Quick ideation and detailed system documentation are different briefs and they need different models.

Prompting Type Across Different Output Formats

Typography behaves differently depending on where it appears, and Stensyl's generation surfaces each have different requirements for how you brief type within them. A prompt that works in Generate will not map cleanly to Motion or Graphics. The format must be part of the brief.

Generate surface: image-based type

When prompting for type inside generated images, the first decision is whether the type is the subject or a design element. A poster where the headline is the visual centrepiece needs a different prompt structure than a product render where a label appears as a secondary detail. You also need to decide whether legibility is required. A marketing key visual for a fashion campaign might accept decorative ambiguity in a background typographic element. A social graphic for a product launch cannot. State that decision explicitly, because the model will not infer it.

Current image generation models, including those available through Stensyl's Generate surface, still struggle with precise letterforms. Outputs often require vector redraw for any serious identity or packaging work. Prompt for type direction and atmosphere, then plan for manual refinement downstream.

Motion surface: type across time

The Motion surface uses Remotion as its foundation, which means the output is code-driven and the type needs to behave across animation states, not just look right as a still. Prompts for motion type should describe the weight contrast between keyframe start and end conditions. "Grotesque headline animating from light weight at 0% opacity to black weight at full opacity over 24 frames" gives the model structural information. "Animate the title elegantly" does not.

For motion designers building title sequences or lower-thirds, also specify whether the type is entering, exiting, or transforming, and whether it must hold legibility throughout the animation or only at a defined point. Letters that look sharp in a freeze frame often deform during transitions in video generation models, so distinguishing between code-driven motion and generative video motion is important when writing the brief.

Graphics surface: vector and print-oriented type

In the Graphics surface, name the intended output behaviour. "Clean paths suitable for expansion to large-format print" tells the model the type needs to survive scaling without aliasing artefacts. "Icon-weight stroke at 24px grid" tells it the type is operating at small scale in a constrained pixel environment. These are structurally different briefs even if the visual style overlaps.

Social surface: platform-specific legibility

For carousel headlines and paid ad copy generated in the Social surface, legibility requirements at thumbnail scale differ significantly from editorial layouts. A headline that reads clearly at full width on desktop will become unreadable compressed to a 1:1 Instagram post preview. Name the platform, name the crop dimensions, and specify whether the type needs to carry the message at small scale or whether supporting copy handles that load. A marketing team running Meta ads and a content designer building an organic LinkedIn carousel have different constraints even if both are working with the same brand typeface.

Web surface: hierarchy and accessibility

For the Web surface, specify the typeface role in the hierarchy explicitly. Primary display, secondary body, tertiary label. These are not interchangeable positions and a model that does not know which role it is designing for will conflate them. If accessibility is a constraint, include the contrast ratio requirement. WCAG 2.1 AA for body copy. AAA for small-scale labels if the platform serves users with visual impairments. These are not details to add in review. They are part of the brief.

Building a Repeatable Type Prompt Library

A single well-crafted prompt has value once. A library of tested prompts has value every time someone on your team opens a brief. The investment in building one pays back immediately once a second person uses it.

Store proven prompts in Stensyl's Projects surface alongside the brand or campaign they belong to. When a team member opens a new deliverable for an established client, they pull from tested language rather than starting from scratch and rediscovering the same wrong turns. The prompt becomes part of the brand asset, not just an ephemeral input.

Create a prompt template per discipline context. A product design team's packaging type prompt will differ structurally from a game developer's HUD type prompt even if both use geometric sans-serif type. The packaging brief must reference the print process, the stock, and the minimum legible size at production scale. The game UI brief must reference the screen resolution, the platform (console versus mobile), and the number of distinct label roles in the hierarchy. The underlying type style might converge. The contextual constraints will not.

Version your prompts. When a prompt produces a strong output, note which model produced it and what the exact prompt string was. Small wording changes, swapping "influenced by" for "in the tradition of", adding or removing a constraint, changing the order of descriptors, shift results in ways that are not always predictable. A version log costs nothing and prevents you rebuilding from memory six months later when a similar brief arrives.

Use Moodboards in Stensyl to attach visual reference to a saved prompt set. The written language and the visual anchor work together in ways neither achieves alone. When a new team member or a freelance collaborator joins a project, they can read the prompt, look at the moodboard, and understand the brief without a lengthy handover call.

Treat the prompt library as a living document inside Write. Claude Opus 4.7 on the Pro tier is well-suited to drafting and refining the rationale text that explains why each template is structured the way it is. That rationale is not decorative. It is what allows someone else to adapt the template intelligently rather than copy it blindly when the brief shifts.

Common Mistakes to Stop Making Right Now

These are not edge cases. They appear in most first attempts at AI-assisted type work, across every discipline.

Using font names as shorthand

Asking for "something like Helvetica" primes the model to approximate rather than reason about the underlying design properties you actually want. Helvetica is a name, not a description. What you probably mean is: neutral grotesque, even stroke contrast, closed apertures, moderate x-height, optimised for signage and UI applications. Write that instead. The model reasons better from described properties than from brand associations.

Skipping the negative constraint

Always include at least one "must not" instruction. For a luxury automotive campaign, "no condensed widths, no novelty letterforms" prevents the two most predictable wrong turns before the model takes them. For an exhibition wayfinding system, "no thin strokes under 1pt at production scale" prevents a class of legibility failures that would require a full redraw. The negative constraint is the most efficient edit you can make to a prompt.

Ignoring rendering context entirely

A typeface described for a printed exhibition panel will not automatically translate to a web microsite or a game HUD. The optical properties that make a face work beautifully at large scale in print, high contrast, fine detail, can actively harm legibility at small scale on screen. Name the medium explicitly every time, not as a formality but as a structural input that changes the output.

Treating the first output as final

Strong type prompts are iterative. The first output identifies which variables the model interpreted differently from your intent. If the model returned a heavy display face when you wanted a text-weight solution, the word "display" was probably missing or buried. If it returned a decorative direction when you wanted a functional one, the constraint layer was absent. Use the first output as a diagnostic, tighten the specific descriptors that drifted, and run a second pass. Research on prompt failure consistently supports iterative refinement over one-shot prompting for any task with real complexity.

Conflating a type style with a type system

A single headline treatment and a full typographic hierarchy are different briefs. A brand identity studio developing a complete type system for a product company needs to specify every role in the stack: the display face, the body face, the UI label face, the caption style, and how they relate to one another in weight and proportion. Prompting for "the headline typeface" and expecting the model to infer the rest is not a prompt failure. It is a brief failure.

NN/g's research on AI prototyping found that using AI to critique prompts, before running the main generation, produced better-structured briefs and less rework downstream. Build that step into your workflow: write the prompt, then ask the model to identify what is ambiguous or missing before you commit the full generation.

The prompt library, the versioning habit, and the critique step are not process overhead. They are what separates a team that uses AI type tools well from one that uses them often.

Typography is a discipline built on precision. The prompts that serve it well are precise in the same way: specific classification, documented references, explicit context, and at least one hard constraint. Every element of the type brief that you leave to inference is a decision you have handed to probability. The best type prompts leave nothing to inference that you already know.

Keep reading.

Negative Prompting Done Right: What to Cut for Cleaner Renders

8 min read

AI Prompt Chaining for Multi-Step Design Workflows

7 min read

Which AI Image Model Should You Use? A Discipline-by-Discipline Breakdown

11 min read

Try Stensyl for yourself

Image, video, 3D, chat, and document drafting. Every AI model, one studio. Plans from $11/month.

Explore the Studio

← Back to all articles