Most people think of voice dictation and keyboard typing as separate modes — you either type or you talk. In practice, the most productive writers on Mac do neither exclusively. They type and talk, switching fluidly between keyboard and voice depending on what they are doing at any given moment. This hybrid approach is faster than either input method alone, and it is more sustainable over long work sessions.
The key is having a voice tool that makes switching completely frictionless. If toggling to voice requires clicking a button, opening a panel, or changing focus away from your work, the overhead defeats the benefit. What you need is a system where you can type and talk interchangeably without thinking about it at all.
Why Pure Typing is Inefficient for Long-Form Work
Typing is excellent for short, precise input: commands, code, URLs, names. It is less suited to long-form prose where you are generating ideas as you write. The bottleneck when typing a long document is rarely your fingers — it is the cognitive gap between thinking and expressing. Speaking is closer to raw thought speed. When you type and talk, you get to choose the right tool for each part of the task.
Consider a common writing workflow: you are drafting an email. The subject line, the recipient's name, specific figures and dates — these are worth typing carefully. The body of the email, where you are explaining context and making a request, flows faster when spoken. A type-and-talk workflow lets you type the structured parts and voice the expressive parts, combining precision and speed.
The Hold-to-Speak Model
The interaction model that makes type-and-talk work in practice is hold-to-speak. You hold a key while speaking, then release it. The transcribed text appears at your cursor. You then type normally until you want to voice another segment. No mode switching, no UI changes, no interruptions to your flow.
This is exactly how Steno works on Mac. Hold the hotkey (configurable to any key or combination), speak, release. The text lands at your cursor. You keep typing. The interaction is so minimal that after a day of use, switching between voice and keyboard becomes as natural as switching between hands on the keyboard.
What to Type vs. What to Talk
Developing your own type-and-talk intuition takes a little practice. Here are some general principles that work well for most people.
Type When You Need Precision
Proper nouns, technical terms, code snippets, email addresses, numbers, and punctuation-heavy text are usually faster to type. Voice transcription handles these correctly most of the time, but when accuracy is non-negotiable, typing gives you full control. In a type-and-talk workflow, you never have to use voice for anything where you would rather type.
Talk When You Are Generating Content
Paragraph-length prose, explanations, arguments, narratives, meeting notes, brainstorming, and any text where the goal is volume rather than precision — all of these benefit from voice. Speaking at 130 to 180 words per minute is two to three times faster than average typing speed, which means you can generate a first draft far more quickly by talking.
Talk When Your Hands Are Otherwise Occupied
The type-and-talk pattern also works when your hands are doing something else entirely. Reviewing a document with one hand on the mouse, annotating a sketch pad, or managing multiple windows — in all of these situations, voice lets you continue composing without stopping what your hands are doing.
Type and Talk Across All Your Mac Apps
One advantage of using a system-level tool like Steno is that your type-and-talk workflow is identical in every application. Email, Slack, Notion, Google Docs, code editors, terminals — the hotkey works the same way everywhere. You do not need to learn a different voice mode for each app or worry about whether a particular app supports voice input.
This universality is important for real workflows. On a typical workday, a knowledge worker uses ten to fifteen different applications. A voice tool that only works in some of them forces you to constantly remember which mode you are in. Steno's approach — system-level text insertion — means you genuinely never have to think about compatibility.
Building a Type-and-Talk Habit
The fastest way to develop a type-and-talk habit is to pick one application and one task where you will deliberately practice for a week. Email is ideal for most people: it is high-volume, relatively low-stakes, and has both structured parts (recipient, subject) and expressive parts (body). Spend a week typing the structure and voicing the body of every email you write. By the end of the week, the switching will feel automatic.
From email, expand to wherever you write the most. Documents, Slack messages, notes, reports. Each expansion takes less time than the first because you are building on the same core habit: hold the key, speak, release, type, repeat.
The Speed Arithmetic
If you type at 60 words per minute and speak at 150 words per minute, the math is straightforward. For pure prose generation, voice is 2.5 times faster. In a mixed type-and-talk workflow where half your input is typed (structured parts) and half is spoken (prose), your effective throughput is roughly 40 to 50 percent higher than typing alone. Over an eight-hour workday with significant writing, that translates to hours of recovered time.
But the speed gain is not the most important benefit. The more significant change is cognitive. When you are not struggling to keep up with your own thoughts by typing, you think more clearly. The ideas flow better because the friction between thinking and expressing has been reduced. Writers who adopt type-and-talk workflows consistently report that their drafts are better, not just faster.
Getting Started
Steno is a macOS menu bar app that makes type-and-talk effortless. Download it at stenofast.com, assign your preferred hotkey, and start mixing keyboard and voice in your next writing session. The first time you speak a full paragraph without breaking your train of thought, you will understand why so many Mac users have made this their default way of working.
The best writing tools disappear. When your voice and your keyboard feel like one continuous input, you stop thinking about the tools entirely and start thinking only about what you want to say.