All posts

Voice to text dictation is the practice of converting spoken language into written text as you work. Unlike transcribing a recording after the fact, dictation happens live: you speak, the words appear at your cursor, and you continue composing. For Mac users who spend hours every day writing — emails, documents, messages, notes, code comments — dictation represents one of the most significant productivity improvements available.

This guide covers everything you need to know: how modern dictation works, what tools are available on Mac, and how to build a lasting dictation habit.

Why Voice to Text Dictation Matters

The core argument for dictation is simple: most people speak at 120 to 150 words per minute but type at 40 to 60 words per minute. That gap represents a significant productivity opportunity. If you could express your thoughts at your natural speaking rate, you would produce the same amount of text in roughly one-third of the time.

But speed is not the only benefit. Many people find that speaking their thoughts produces more natural, less stilted prose than typing. When you type, you are simultaneously composing and manually encoding each word character by character. That dual cognitive load can interrupt the flow of ideas. When you speak, the verbal expression happens more automatically, leaving more mental bandwidth for the ideas themselves.

Additionally, typing for hours contributes to repetitive strain injury, carpal tunnel syndrome, and chronic wrist and shoulder pain. Voice to text dictation reduces keyboard time without reducing output — often increasing it.

How Modern Mac Dictation Works

Modern voice to text dictation on Mac works through one of two mechanisms:

Native macOS Dictation

Apple includes dictation in macOS, accessible through System Settings. You can trigger it with a keyboard shortcut and speak into any native text field. Recent versions include an enhanced dictation mode that runs on Apple's neural processing hardware and supports a broader range of vocabulary. This is a reasonable starting point for casual use.

Third-Party Apps with Global Hotkeys

Apps like Steno intercept a global hotkey (a key or key combination you configure), capture audio from your microphone while the key is held, send that audio to a high-accuracy transcription service, and insert the resulting text at your cursor using keyboard simulation. This approach works in any application — native apps, Electron apps, web browsers, and terminals — because it does not rely on native text field APIs.

The hold-to-speak model — where you hold a key while speaking and release to stop — is significantly more ergonomic than toggle-based dictation. You never accidentally leave the microphone on, never transcribe ambient noise, and always have precise control over what gets dictated.

Choosing a Dictation Tool for Mac

For Casual Use: Apple Dictation

If you dictate occasionally and primarily in native Mac apps (Mail, Notes, Pages), Apple's built-in dictation is free and good enough. Set it up in System Settings → Keyboard → Dictation and assign a hotkey. Start there.

For Regular Use: Steno

If you dictate frequently or need dictation to work in every application including Slack, Notion, VS Code, and Chrome, Steno is the better choice. Download it at stenofast.com. It offers higher accuracy on technical and specialized vocabulary, works in every app on your Mac, and uses a hold-to-speak interface that integrates naturally into a fast typing workflow.

Setting Up Your Dictation Environment

Choosing a Hotkey

The hotkey choice matters more than it seems. You want a key that is easy to hold while speaking but unlikely to be pressed accidentally during normal typing. Popular choices include:

Microphone Setup

The MacBook's built-in microphone works adequately in a quiet home office or private room. For open offices, noisy environments, or maximum accuracy, invest in a quality microphone. A set of AirPods Pro provides excellent speech capture because the microphone is close to your mouth and benefits from Apple's noise cancellation. A USB headset with a boom mic positioned next to your mouth is the best option for pure accuracy.

Your Physical Environment

Dictation works in any environment but is most comfortable in private. If you share an office, you can absolutely use voice to text dictation — many people do — but it requires a level of comfort with speaking while others are nearby. Working from home or having a private office removes this barrier entirely.

Building a Dictation Habit

Week One: Email Only

Start by using dictation exclusively for email replies. Email is lower stakes than formal documents, you write a lot of it, and the conversational tone of email suits dictation perfectly. Do not worry about perfection in the first week. Focus on getting comfortable with the mechanic of holding the key, speaking a sentence, and releasing.

Week Two: Slack and Messages

Add chat applications. Dictating a Slack message is fast, the messages are short, and you get many repetitions in a single day. By the end of week two, the hold-to-speak interaction should feel automatic.

Week Three: Documents

Start dictating first drafts of documents. Do not try to dictate a polished document — dictate a rough draft and then edit it. The editing pass will be faster than writing from scratch, and the draft will contain more of your natural voice than a laboriously typed version would.

Week Four and Beyond: Notes and Everything Else

By week four, most users find that they reach for voice to text dictation automatically for any text longer than a sentence. The habit has formed. Continue expanding into meeting notes, research notes, code comments, and anywhere else you write.

Common Dictation Challenges and Solutions

"I don't know what to say"

This is different from writer's block — it is a planning problem. Before you hold the key, think for a moment about what you want to express. You do not need to plan the exact words, just the idea. Then speak it.

"The accuracy is not good enough"

Test with a different microphone first. Microphone quality accounts for a large portion of accuracy variation. Also check that you are using a high-accuracy tool (Steno uses a best-in-class transcription model) and add domain-specific terms to your custom vocabulary.

"I keep getting errors on [specific word]"

Add that word to your custom vocabulary in the app settings. Most recurring errors on specific words are fixed by adding them once.

Voice to text dictation is not a replacement for thinking — it is a way to get your thinking into text faster, with less friction, and less wear on your hands.