All posts

Gmail is the world's most popular email service, with over 1.8 billion users. On a Mac, most people access Gmail through a web browser -- Chrome, Safari, Arc, or Firefox. And on a Mac, there is no built-in speech-to-text feature that works reliably in Gmail's web interface. Apple Dictation is inconsistent with web-based text editors. Google's own voice typing is limited to Google Docs and does not work in Gmail. This leaves Mac users with no native way to dictate emails in Gmail.

Steno fills this gap. It is a native macOS menu bar app that transcribes your voice and pastes the text at your cursor position, which means it works perfectly in Gmail regardless of which browser you use. Here is a complete guide to dictating emails in Gmail on Mac.

The Problem with Dictation in Gmail on Mac

Gmail's compose window uses a contenteditable div -- a web-based rich text editor -- rather than a standard text input field. This matters because Apple Dictation was designed to work with native macOS text fields. When you try to use Apple Dictation in Gmail, you might experience:

These issues are not bugs in Apple Dictation per se. They are the result of a fundamental mismatch between how Apple Dictation inserts text and how Gmail's editor receives it. The result is that most Mac users simply do not use dictation in Gmail, even though email is one of the most natural applications for voice typing.

How Steno Solves This

Steno takes a different approach. Instead of trying to integrate with Gmail's text editor, it uses a three-step process:

  1. Capture audio while you hold the hotkey.
  2. Send the audio to Groq's Whisper API for transcription (returns in under a second).
  3. Paste the transcribed text at your cursor position using the system clipboard.

Because Steno uses paste rather than text input injection, it works with any text editor, including Gmail's contenteditable compose window. The text appears exactly where your cursor is, with no formatting issues, no position errors, and no dropped words.

Step-by-Step: Dictating an Email in Gmail

Setup (One Time)

  1. Download Steno from stenofast.com and install it.
  2. Launch Steno. It appears as an icon in your menu bar.
  3. Choose your preferred hotkey during the onboarding process. This is the key you will hold while dictating.
  4. Grant microphone permission when prompted.

Composing a New Email

  1. Open Gmail in your browser and click Compose.
  2. Fill in the To, CC, and Subject fields as usual (typing is fine for these short fields, though you can dictate the subject line too).
  3. Click in the email body area so your cursor is positioned there.
  4. Hold your Steno hotkey.
  5. Speak your email at natural conversational speed. For example: "Hi Sarah, thanks for sending over the quarterly report. I have reviewed the revenue projections and they look solid. I have two questions about the expense forecasts on page three. First, the travel budget seems significantly higher than last quarter, and I was wondering if that accounts for the new remote work policy. Second, the software licensing line item appears to include the contract that we decided to cancel in January. Could you double check that? I would like to discuss these items at our meeting on Thursday. Let me know if that works for you."
  6. Release the hotkey.
  7. Your transcribed email appears in the compose window within a second.
  8. Review, make any minor edits, and send.

That entire email took about 30 seconds to dictate. Typing it would have taken two to three minutes.

Replying to an Email

Replying follows the same process with one additional tip: read the incoming email fully before you start dictating your reply. Form your complete response in your mind, then hold the hotkey and speak it. Trying to read and dictate simultaneously produces fragmented responses.

Click Reply, click in the reply body (Gmail automatically places your cursor there), hold the hotkey, and speak your response. Steno inserts your text above the quoted original message, exactly where Gmail expects it.

Dictating the Subject Line

You can dictate subject lines too, though they tend to be short enough that typing is equally fast. Click in the Subject field, hold the hotkey, speak a brief subject, release. Steno handles short phrases just as accurately as longer passages.

Tips for Dictating Professional Emails in Gmail

Punctuation Is Automatic

One of the biggest advantages of Steno's Whisper-based transcription is automatic punctuation. You do not need to say "period" or "comma." The AI model infers punctuation from your speech patterns: pauses become periods, brief pauses become commas, rising intonation becomes question marks. This means you can speak naturally without thinking about punctuation at all.

Paragraph Breaks

Whisper handles paragraph breaks intelligently based on longer pauses and topic shifts. For explicit paragraph breaks, you can dictate each paragraph as a separate Steno input: hold the hotkey, speak paragraph one, release. Press Enter twice in Gmail to create the break. Hold the hotkey again, speak paragraph two, release. This gives you precise control over paragraph structure while maintaining the speed of dictation.

Greetings and Sign-offs

You can dictate greetings and sign-offs naturally. Say "Hi John" and Whisper will transcribe it with proper capitalization. Say "Best regards" or "Thanks" and it appears correctly. For your email signature, Gmail handles that automatically, so you do not need to dictate it.

Numbers, Dates, and Names

Whisper handles numbers and dates well. Saying "March 15th" produces "March 15th." Saying "forty-two thousand dollars" produces "$42,000" or "forty-two thousand dollars" depending on context. For specific proper nouns that Whisper might not recognize, you can type those in after dictating the bulk of the message.

Gmail-Specific Workflows

Batch Email Processing

One powerful workflow is batch-processing your inbox with voice typing. Open each email that needs a response, dictate your reply in seconds, send, move to the next. What might take 30 minutes of typing can be accomplished in 10 minutes of dictation. The speed difference becomes most apparent when you have 10 or 15 emails that all need substantive responses.

Email Drafts

Gmail auto-saves drafts as you type. When you dictate with Steno, the text appears all at once (after you release the hotkey), and Gmail auto-saves it as a draft within a few seconds. You can dictate a draft, close the compose window, and come back to it later for editing and sending.

Google Workspace and Multiple Accounts

If you use Gmail through Google Workspace with multiple accounts, Steno works identically across all of them. There is no account-specific setup or configuration because Steno operates at the macOS level, not the Gmail level.

Steno vs. Other Dictation Options for Gmail

Google's own voice typing feature works in Google Docs but not in Gmail. This is a common source of frustration for users who expect Google's tools to work consistently across products. On Mac, Apple Dictation works inconsistently with Gmail's web interface as discussed above. Browser extensions that add dictation to Gmail exist, but they typically require extensive permissions (access to all your Gmail data) and introduce security and privacy concerns.

Steno requires only microphone access. It does not read your emails, does not access your Gmail data, and does not inject code into web pages. It simply transcribes your voice and pastes text -- a minimal, privacy-respecting approach that achieves the desired result without overreaching.

Getting Started

Download Steno free at stenofast.com. Installation takes under a minute, and you can start dictating in Gmail immediately. The free tier lets you experience the workflow before committing. For unlimited daily dictation, Steno Pro is $4.99 per month.

Your inbox is waiting. Hold the hotkey, speak your reply, release. Email will never feel like a chore again.