One of the most common questions people ask about voice dictation on Mac is: "Will it work in my app?" The answer, at least with Steno, is yes. Every app. Every text field. No exceptions. And unlike Apple's built-in Dictation, which requires specific framework support and sometimes fails silently in certain applications, Steno takes a fundamentally different approach that guarantees universal compatibility.
This guide walks you through setting up system-wide dictation on your Mac, from installation to your first spoken sentence, and then dives into app-specific tips that will make you more effective in the tools you already use.
Why Most Dictation Tools Fail in Some Apps
Before explaining how Steno works everywhere, it helps to understand why other dictation tools do not. Apple's built-in Dictation feature uses the macOS text input system, which relies on each application implementing the NSTextInputClient protocol correctly. Most native Mac apps do, but many Electron-based apps (Slack, Discord, VS Code), web browsers, and custom UI frameworks handle text input differently. This leads to frustrating inconsistencies — dictation works perfectly in Notes but garbles text in your browser-based email client.
Steno sidesteps this problem entirely. Instead of injecting text through the input system, Steno simulates keyboard events at the system level. When you finish speaking, Steno types out the transcribed text character by character, exactly as if you had typed it on your keyboard. If an app accepts keyboard input — and every app does — Steno works in it.
Getting Started: Installation in 30 Seconds
If you have not set up Steno yet, the process takes less than a minute. For the detailed walkthrough with screenshots, see our 30-second setup guide. Here is the quick version:
- Download Steno from stenofast.com. The installer is under 2 MB.
- Open the installer and drag Steno to your Applications folder.
- Launch Steno. You will see the Steno icon appear in your menu bar.
- Grant permissions when prompted. Steno needs Microphone access (to hear you) and Accessibility access (to type text into apps). Both are standard macOS permissions that you can revoke at any time in System Settings.
- Start dictating. Hold the Option key (or your configured hotkey), speak, and release. Your words appear wherever your cursor is.
That is it. No account creation, no API keys to configure, no complex settings. Steno is designed to work out of the box.
The Core Workflow: Hold, Speak, Release
Steno's interaction model is deliberately simple. There is no "start recording" and "stop recording" button to click. Instead, you hold down a hotkey (Option by default), speak for as long as you want, and release the key when you are done. The transcribed text appears at your cursor position within about a second.
This push-to-talk model has several advantages over toggle-based dictation:
- No accidental recordings. You will never find that dictation was "on" when you did not want it.
- Natural boundaries. Holding and releasing creates a clear start and end for each dictation, which improves transcription accuracy.
- Zero mode confusion. You always know whether you are in dictation mode or typing mode.
App-by-App Tips and Tricks
While Steno works in every app, each app has its own quirks and best practices. Here is how to get the most out of voice dictation in the tools you probably use daily.
Web Browsers (Chrome, Safari, Arc, Firefox)
Voice dictation in browsers covers an enormous range of use cases: composing Gmail messages, writing Google Docs content, posting on social media, filling out forms, and contributing to forums. Steno handles all of these seamlessly. A few tips:
- Click into the text field first. Make sure your cursor is in the input where you want text to appear before holding the hotkey.
- Gmail compose windows work flawlessly. Dictate your entire email, then use the keyboard for final formatting.
- Google Docs is one of the best apps for voice dictation because it auto-saves, so you never lose dictated text.
- Social media posts — Twitter/X, LinkedIn, Reddit — are perfect voice dictation targets because they are short-form prose.
Code Editors (VS Code, Xcode, JetBrains IDEs)
Developers often assume voice dictation is useless for coding, but that misses the point. You are probably not going to dictate a for loop, but you will dictate:
- Code comments — explain what a function does in plain English
- Git commit messages — describe your changes quickly and naturally
- Pull request descriptions — write thorough PR descriptions in seconds
- Documentation — README files, API docs, inline documentation
- Slack and Teams messages — quick replies to colleagues without leaving your editor context
For more on how developers integrate voice typing into their workflow, see our guide to app integrations.
Messaging Apps (Slack, Discord, iMessage, WhatsApp)
Messaging is arguably the single best use case for voice dictation. These apps are all about rapid, natural-language communication — exactly what voice input is optimized for. Tips:
- Short messages work great. Hold the key, say "sounds good, let's do the standup at 10," release. Done in two seconds.
- For longer messages, dictate in multiple segments. Say one thought, release, review, then hold and continue. This gives you natural paragraph breaks.
- Thread replies in Slack: Click into the thread first, then dictate. Steno types directly into whatever field has focus.
Writing Apps (Pages, Word, Notion, Obsidian)
Long-form writing is where voice dictation produces the most dramatic time savings. If you are writing a report, a blog post, a memo, or any document longer than a few paragraphs, voice dictation can cut your drafting time by 60-70%.
- Dictate first drafts entirely by voice. Do not worry about perfection — just speak your thoughts. Edit afterward on the keyboard.
- Use natural pauses. Steno handles punctuation automatically. A natural pause becomes a period or comma in the transcript.
- Notion and Obsidian work particularly well because they are Markdown-based, and you can add formatting after your voice draft.
Email (Mail, Outlook, Gmail)
Email is the productivity black hole for most knowledge workers. Voice dictation closes that hole. The average email takes 3-5 minutes to compose by typing. With voice, that drops to 30-60 seconds.
- Reply by voice. Open the email, click Reply, hold Option, speak your response, release. The entire reply flow takes under a minute.
- Compose fresh emails by speaking naturally. "Hi Sarah, following up on our call from yesterday. I reviewed the Q2 projections and have a few suggestions..." — that sentence took three seconds to speak and would have taken 20 seconds to type.
Advanced Techniques
Once you are comfortable with basic dictation, these techniques will help you produce more polished output directly from voice.
Dictating Punctuation
Steno's Whisper-based transcription engine handles most punctuation automatically based on your speech patterns. Pauses become periods or commas. Rising intonation becomes question marks. But you can also speak punctuation explicitly when needed: say "comma" to insert a comma, "period" for a period, "new line" for a line break. For the full list of voice commands, check our voice commands guide.
The Dictate-Then-Edit Workflow
The most efficient approach is to separate creation from editing. Speak your first draft without stopping to correct mistakes. Let the words flow. Then switch to keyboard mode and edit: fix any transcription errors, tighten sentences, rearrange paragraphs. This two-pass approach is faster than trying to produce perfect text in a single pass, whether you are typing or speaking.
Customizing Your Hotkey
If Option does not work well for your workflow — maybe you use it frequently for special characters — you can change Steno's hotkey in Settings. Popular alternatives include Right Command, Fn, or a function key like F5. Choose a key you do not use often and that is easy to hold comfortably.
Troubleshooting Common Issues
If you run into problems, these are the most common causes and fixes:
- "Nothing happens when I hold the hotkey." Check that Steno has Accessibility permission in System Settings > Privacy & Security > Accessibility.
- "Text appears in the wrong app." Make sure the target app's text field has focus before you start dictating. Click into the field first.
- "Transcription is inaccurate." Ensure your microphone is not obstructed. Check System Settings > Sound to confirm the correct input device is selected.
- "There is a long delay before text appears." This is usually a network issue. Steno sends audio to the cloud for transcription, so a stable internet connection produces the fastest results.
Start Dictating Today
The beauty of system-wide dictation is that it meets you where you are. You do not need to switch to a special app or change your workflow. Just hold a key and speak, in whatever app you are already using. Once you build the habit — and most people develop it within a day or two — you will wonder how you ever wrote all those emails and messages by hand.