Microsoft Word has included a dictation feature for several years, but many users are unaware of it or frustrated by its inconsistent behavior across platforms. If you're trying to use speech to text in Word — whether for faster writing, accessibility needs, or repetitive strain prevention — this guide explains what's available, how it works, and where the limitations lie.

How to Use Speech to Text in Microsoft Word

On Windows (Microsoft 365)

The Dictate button appears in the Home ribbon in Microsoft 365 on Windows. Click it (or press Alt+`) and Word starts listening. Speak naturally and text appears in your document. Word supports voice commands like "new line," "delete that," and punctuation commands like "comma" or "exclamation mark."

The feature uses Azure Cognitive Services for transcription, so an internet connection is required. Accuracy is solid for standard English, and it handles basic formatting commands reasonably well.

On Mac (Microsoft 365)

The Dictate button is also present in Word for Mac, but behavior can be less consistent. On Mac, Word's Dictate feature competes with — and sometimes conflicts with — macOS's own built-in dictation. Some users find that enabling macOS Enhanced Dictation interferes with Word's dictation, and vice versa.

Additionally, Word for Mac's dictation requires an active Microsoft 365 subscription and an internet connection. The offline mode available on Windows isn't fully replicated on Mac.

On iPad and iPhone

On iOS, Word relies on the system keyboard's built-in dictation microphone rather than a separate Dictate button. Tap the microphone on the iOS keyboard and speak. This is Apple's speech recognition, not Microsoft's, so the behavior may differ from the desktop experience.

Voice Commands in Word Dictation

Word supports a useful set of voice commands when Dictate is active:

These commands work in the Windows version consistently. The Mac version supports most of them but has occasional recognition failures where voice commands are transcribed as literal text instead of being interpreted as commands.

Accuracy and Performance

Word's Dictate feature performs well for casual dictation at moderate speaking speeds. It handles common English vocabulary reliably and does acceptably with proper nouns. Where it struggles:

Compared to dedicated dictation tools, Word's built-in feature is adequate but not exceptional. It's convenient precisely because it's already in the app you're using, not because it's the most accurate engine available.

The App-Lock-In Problem

The most significant limitation of Word's speech-to-text is that it only works inside Word. The moment you switch to a different app — email, a note-taking tool, Slack, a browser — you lose voice input. You're back to typing.

For users who spend their entire day in Word, this is acceptable. But most knowledge workers write in many different contexts: email clients, project management tools, communication apps, web forms, code editors. A dictation feature locked to one application solves only a fraction of the problem.

System-Wide Dictation as an Alternative

A better approach for most Mac users is a system-wide dictation tool that works across every application, not just Word. Apps like Steno sit in your menu bar and respond to a hotkey regardless of which app is in focus. Hold the hotkey in Word, in Gmail, in Notion, in Slack — the same voice-to-text capability is available everywhere.

This means your dictation workflow is consistent. You learn one set of habits and they apply to your entire computer, not just one application. And because Steno uses higher-accuracy transcription than Word's built-in engine, you get better results everywhere you type.

The question isn't whether Word's dictation is good enough for Word. It's whether locking your voice capability to a single app makes sense when you write in a dozen different places every day.

macOS Built-in Dictation vs. Word Dictation

macOS includes its own system-wide dictation that works in every app including Word. You can enable it in System Settings → Keyboard → Dictation. The advantage: it's free, requires no Microsoft subscription, and works in every app.

The disadvantage: Apple's built-in dictation accuracy, while improved, still falls short of the best third-party tools — particularly for extended dictation sessions and specialized vocabulary. It's a good free option but has a lower ceiling than dedicated apps.

Choosing the Right Approach for Your Workflow

Here's a simple framework:

Final Thoughts

Microsoft Word's speech-to-text is a useful built-in feature for Word users, particularly on Windows. On Mac, it's more inconsistent. For users who write across many applications, the real productivity gain comes from system-wide dictation tools that work everywhere — not just in one app. The key is matching the tool to how you actually work, not how you imagine you work.