StoryVoxStoryVox

Field Notes

How to Add a Pronunciation Guide to Your Audiobook

·audiobook production · self-publishing · tutorials

Mispronounce a character's name once in an audiobook, and listeners notice. Mispronounce it 200 times across a six-hour recording, and you'll find those one-star reviews waiting for you. For indie authors, a pronunciation guide isn't a nice-to-have — it's the difference between a professional production and an embarrassing one. Whether you're working with a human narrator or an AI voice platform, knowing how to add a pronunciation guide to your audiobook is one of the most practical skills you can develop before you hit record.

Why Pronunciation Guides Matter More Than You Think

According to a Reddit thread on r/audiobooks with hundreds of upvotes, high-end audiobook productions routinely obtain pronunciation guides directly from living authors — professional narrators don't invent pronunciations, they research them. That research process can take hours per book. When you provide a clear guide upfront, you eliminate that guesswork entirely.

The stakes are higher than they might seem. Fantasy and science fiction authors often invent names with no obvious phonetic cues — is "Caelan" KAY-lan or SEE-lan? Historical fiction writers use place names from languages their narrator may never have studied. Even contemporary fiction can trip up a narrator with a surname like "Beauchamp" (which is, confusingly, pronounced "BEE-chum" in British English). A single ambiguous name, repeated across dozens of chapters, compounds the problem with every listen.

For AI narration specifically, pronunciation guides are even more critical. AI voices don't have the common sense to look up a word or ask a colleague. They render what they're given, using their training data as a reference. If your invented city "Vyreth" doesn't appear in any training corpus, the AI will make its best guess — and that guess may not match what you heard in your head when you wrote it.

Author's pronunciation guide document open beside a manuscript with character names and phonetic spellings highlighted
Author's pronunciation guide document open beside a manuscript with character names and phonetic spellings highlighted

What to Include in Your Audiobook Pronunciation Guide

A good pronunciation guide is a simple reference document — usually a table or list — that maps every potentially tricky word to its correct pronunciation. You don't need a linguistics degree to write one. Here's what to cover:

Characters and Proper Nouns

This is the most important category. List every character name, including minor ones. A narrator who correctly pronounces your protagonist's name but stumbles over a side character's will still pull listeners out of the story. Include:

  • Main and supporting characters — full names, including surnames
  • Place names — cities, countries, fictional geography
  • Organization and brand names — especially invented ones
  • Titles and honorifics — particularly if they're from non-English traditions

Foreign Language Words and Phrases

If your book includes French, Latin, Mandarin, or any other language, spell out the pronunciation phonetically. Don't assume a narrator — human or AI — is fluent in the language your character switches into for a dramatic moment.

Technical and Domain-Specific Terminology

Medical thrillers, legal dramas, and hard science fiction all carry specialized vocabulary. Words like "tachycardia," "amicus curiae," or "baryogenesis" have established pronunciations that a narrator needs to get right for credibility. Include these even if you think they're obvious — what's obvious to a cardiologist isn't obvious to a voice actor.

Invented Words and Neologisms

This is where indie authors, especially in speculative fiction, have a unique challenge. You invented these words. No dictionary exists for them. Write down exactly how you want them said, and don't leave room for interpretation.

How to Write Phonetic Spellings (Without Learning IPA)

The International Phonetic Alphabet (IPA) is the gold standard for pronunciation notation, but most authors — and many narrators — aren't fluent in it. The good news is that plain-English phonetic spelling works perfectly well for most purposes.

The method is straightforward: break the word into syllables, spell each syllable the way it sounds using common English letter combinations, and mark the stressed syllable in capital letters. A few examples:

  • Caelan → KAY-lan
  • Beauchamp → BEE-chum
  • Vyreth → VY-reth
  • Niamh → NEEV
  • Ptolemy → TOL-uh-mee

If you want to go further, the IPA is worth learning at a basic level — resources like Merriam-Webster's pronunciation key can help you decode it, even if you never write it yourself. For narrators who prefer IPA, sites like Forvo and YouGlish let you hear real human pronunciations of words in context.

If you're not sure how to format the rest of your manuscript for production, our guide on audiobook formatting: chapters, credits, and file structure covers everything from file naming conventions to front matter — a useful companion to your pronunciation work.

A Step-by-Step Workflow for Building Your Guide

Here's a repeatable process that works whether you're producing your first audiobook or your fifteenth:

  1. Do a full manuscript read-through specifically for pronunciation. Don't try to catch these during your editing pass — give it a dedicated session.
  2. Flag every name, place, foreign word, and technical term as you go. Use a simple highlight color or a running list in a separate document.
  3. Decide on the canonical pronunciation for each flagged word. For real words, look them up. For invented words, write down what you've always said in your head.
  4. Format your guide as a table with at least three columns: the word as written, the phonetic spelling, and any contextual notes (e.g., "this character's name changes pronunciation in Chapter 14 when she adopts a new identity").
  5. Add audio samples where possible. If you can record yourself saying the word correctly — even on your phone — include a link or attach the file. This is especially useful for AI platforms that accept audio references.
  6. Put the guide in your project folder alongside your manuscript. Don't let it live only in your email drafts or a sticky note on your monitor.

How Pronunciation Guides Work with AI Narration Platforms

AI narration changes the mechanics of pronunciation correction significantly. With a human narrator, you hand over your guide before recording begins and trust them to internalize it. With AI, you have more direct control — but also more responsibility.

Most AI audiobook platforms handle pronunciation in one of two ways. The first is a pronunciation dictionary (sometimes called a lexicon or custom dictionary), where you input word-to-phoneme mappings directly into the platform. The second is phonetic substitution in the text itself — replacing a difficult word with a phonetic spelling that the AI renders correctly, then reverting to the original spelling in the final manuscript.

StoryVox uses a built-in pronunciation dictionary system that lets you define exactly how character names and specialized terms should be spoken, without altering your manuscript text. This means your guide translates directly into production settings — no workarounds required.

For a broader look at the full production process, including how to choose voices, structure your project, and export ACX-compliant files, our complete guide to AI audiobooks walks through every stage from manuscript to finished file.

Common Mistakes Authors Make with Pronunciation Guides

Even well-intentioned guides can fall short. Watch out for these:

  • Only guiding the hard words. Authors often skip words they consider obvious — but "obvious" is subjective. Include anything with an unusual spelling-to-sound relationship.
  • Forgetting consistency across a series. If you're producing multiple books in a series, your pronunciation guide should be cumulative. A narrator (or AI) shouldn't have to re-learn "Vyreth" for Book 3.
  • Using different phonetic systems in the same document. Pick one approach — plain English phonetics or IPA — and stick to it throughout.
  • Not updating the guide during revisions. If you add a new character in your final edit, add them to the guide. A guide that covers 90% of your names is only 90% useful.
  • Assuming the platform will figure it out. Whether you're working with a human or an AI, assume zero prior knowledge. The guide is your insurance policy.

Where to Store and Share Your Pronunciation Guide

Your guide should live in at least two places: your project folder and your manuscript's front matter. Including it as a page before Chapter 1 — clearly labeled "Pronunciation Guide" or "Author's Note on Pronunciation" — means it travels with the manuscript no matter where it goes. If you ever license your audiobook rights, sell foreign translation rights, or hand the project to a new producer, the guide is already embedded.

For AI platforms, you'll typically upload or input your guide directly into the project settings. Keep a master copy in a format you can edit — a simple spreadsheet works well, with columns for the written form, phonetic spelling, and notes. As your catalog grows, this master file becomes invaluable.

A pronunciation guide is ultimately a form of creative control. You spent months or years choosing exactly the right names for your characters and worlds. A few hours building a clear, thorough guide ensures those choices survive the journey from page to ear — exactly as you intended them.

← Back to Field Notes