All posts

When most people hear "voice assistant," they think of Siri, Alexa, or Google Assistant. They think of asking a device to set a timer, check the weather, or play a song. These are conversational AI tools designed to answer questions and execute commands. Steno is something fundamentally different: a voice assistant for writing that does exactly one thing and does it exceptionally well. It turns your spoken words into typed text, right where your cursor is, in any application on your Mac.

This distinction matters more than it might seem. The history of productivity software is littered with tools that tried to do everything and ended up doing nothing well. Steno succeeds because of its narrow focus. There is no chatbot. There are no voice commands. There is no "Hey Steno, what is the capital of France?" There is only this: hold a hotkey, speak, release, and your words appear as text.

Why Writers Need a Different Kind of Voice Tool

Writers do not need an assistant that talks back. They need an input method that keeps up with their thoughts. The bottleneck in writing has never been the quality of ideas. It is the mechanical process of getting those ideas from your brain onto the screen. A keyboard limits most people to 40-60 words per minute. Speech delivers 120-150 WPM without effort.

This speed difference is not marginal. It is transformative. Consider the math: a 1,000-word blog post takes roughly 20 minutes to type at 50 WPM (not counting pauses for thought). The same post takes about 7 minutes to dictate at 150 WPM. Over the course of a week, a writer producing 5,000 words saves nearly an hour of pure typing time. Over a year, that compounds into weeks of recovered productivity.

But speed is only half the story. The more profound benefit is cognitive. When you type, there is a translation step between thought and output. Your brain formulates a sentence, then your fingers execute it character by character. This mechanical process creates a subtle but real friction that interrupts the flow of ideas. When you speak, that friction disappears. Words arrive on screen at the speed of thought, and many writers report that their first drafts are more natural, more conversational, and require less editing when dictated rather than typed.

How Steno Works as a Writing Assistant

Steno's workflow is deliberately simple because simplicity is what makes it usable in practice, not just in theory.

The Hold-to-Speak Model

You assign a hotkey during setup (most users choose the right Option key or a function key). When you want to dictate, you hold the key, speak naturally, and release when you are done. Steno captures the audio, sends it to the Groq Whisper API for transcription, and inserts the resulting text at your cursor position.

This hold-to-speak model is intentionally different from always-on dictation systems. Always-on tools require you to say "stop listening" or click a button to pause, which creates anxiety about the microphone capturing unintended speech. With Steno, the microphone is only active while you physically hold the key. Release the key, and audio capture stops immediately. There is no ambiguity about when you are being recorded.

Universal Text Insertion

Steno inserts text using macOS Accessibility APIs, which means it works in every application on your Mac. This is a critical difference from browser-based voice tools or even Apple Dictation, which only work in text fields that support specific input protocols. With Steno, you can dictate into Google Docs, VS Code, Terminal, Figma, Notion, Slack, iMessage, or any other app. If you can type there, you can speak there.

No Training Required

Traditional dictation software like Dragon required you to read passages aloud so the system could learn your voice. This training process took 15-30 minutes and had to be repeated if your microphone changed or if the software was installed on a new machine. Steno uses OpenAI's Whisper model, which was trained on 680,000 hours of multilingual audio. It understands your voice out of the box, regardless of your accent, speaking speed, or microphone quality.

Writing Workflows That Voice Transforms

First Drafts

The most natural application of voice-to-text for writers is the first draft. When you are generating ideas, developing arguments, or telling a story, the last thing you want is to be slowed down by typing mechanics. Dictating a first draft lets you focus entirely on content. Many writers describe the experience as closer to thinking aloud than traditional writing, and the resulting text often has a more natural, conversational tone.

The key mindset shift is accepting that a dictated first draft will need editing. This is not a weakness of dictation. It is simply how drafts work. Even typed first drafts need revision. The difference is that dictation gets you to the revision stage much faster.

Email and Professional Communication

The average knowledge worker sends 40 emails per day. If each email takes 3 minutes to type, that is 2 hours of daily typing. With voice input, the same emails take about 1 minute each to dictate, saving over an hour every day. And because spoken language tends to be more direct than written language, dictated emails are often clearer and more concise than their typed counterparts.

Notes and Journaling

Many people want to journal or take notes but find the act of typing tedious. Voice removes that barrier entirely. Open your note-taking app, hold the Steno hotkey, and speak your thoughts. Stream-of-consciousness journaling is particularly well-suited to dictation because the format already mimics natural speech patterns.

Academic and Research Writing

Academic writers often need to produce large volumes of text: literature reviews, methodology sections, discussion chapters. These sections require careful thought but not necessarily careful typing. Dictating these sections and then editing for precision and citation accuracy is significantly faster than typing from scratch, especially for writers who think more clearly when speaking than when staring at a blinking cursor.

Common Objections and Why They Do Not Hold

"I think better when I type"

This is the most common objection, and it deserves a thoughtful response. Many people do feel like they think better when typing, but research suggests this is often a learned association rather than a cognitive reality. You have spent decades typing, so typing feels natural. Dictation feels unfamiliar at first. Give it a week of consistent use, and most people find that speaking becomes equally natural for generating first-draft text. Typing remains superior for precise editing, code, and structured data entry. The optimal workflow uses both.

"What about punctuation?"

Modern speech recognition models like Whisper automatically insert punctuation based on your speech patterns. Pauses become periods. Rising intonation becomes question marks. The accuracy is not perfect, but it is good enough that minor punctuation fixes during editing take seconds rather than minutes.

"I cannot dictate in an open office"

This is a legitimate constraint. Dictation works best in private or semi-private spaces. However, many Mac users work from home at least part of the week, and even in offices, there are private moments throughout the day when dictation is viable: before others arrive, during lunch, in a meeting room. You do not need to dictate everything. Even using voice for 30% of your text input adds up to meaningful time savings.

Getting Started with Voice Writing

If you are a writer who has never tried serious dictation, here is a practical approach. Download Steno from stenofast.com and set up your hotkey. Start with low-stakes writing: emails, Slack messages, quick notes. As dictation becomes comfortable, graduate to longer-form content: blog posts, reports, articles. Within a week, you will have internalized the hold-speak-release rhythm, and voice will feel like a natural extension of your writing process.

Steno's free tier gives you enough daily transcriptions to build the habit. When you find yourself reaching for the hotkey instinctively, Steno Pro at $4.99/month unlocks unlimited dictation for the power user you have become.

The best voice assistant for writing is not one that writes for you. It is one that lets you write faster, with less friction, in any app you choose. That is exactly what Steno does.