All posts

The relationship between notes and speech is more circular than it first appears. We take notes to remember things we hear spoken. We speak from notes to share ideas with others. And increasingly, we convert notes to speech so we can absorb written content through our ears while our eyes and hands are busy elsewhere.

This post covers both directions of that loop: how to get the most from notes-to-speech playback technology, and why dictating your notes in the first place produces better content faster than typing ever could.

Why People Convert Notes to Audio

Reading requires your eyes and usually your hands. Listening does not. This asymmetry makes audio versions of written notes valuable in a specific class of situations:

Mac's Built-In Notes to Speech

macOS has had text-to-speech built in for years, and it is accessible from any application. Select the text you want to hear, right-click, choose Speech, then Start Speaking. The voice quality has improved substantially in recent macOS versions, with several natural-sounding voices available.

To customize the voice and speaking rate, go to System Settings, then Accessibility, then Spoken Content. You can choose from a range of voices, set the reading speed, and configure a keyboard shortcut that starts speaking selected text on demand. Once configured, you can listen to any document, note, or web article with a two-key press.

Better Voices for Extended Listening

The default system voices are adequate for short passages but can feel monotonous for long listening sessions. macOS offers enhanced "premium" voices that you can download under the Spoken Content settings. Voices like Siri voice options or third-party TTS apps with neural voice models sound significantly more natural for extended note review sessions.

The Better Approach: Speak Your Notes First

While notes-to-speech playback is genuinely useful, there is an even more powerful insight hiding in the relationship between voice and notes: notes that were spoken first are better notes.

When you dictate your notes rather than typing them, several things happen that improve quality:

Natural Language Over Shorthand

Typed notes tend toward abbreviations, fragments, and shorthand. "Q3 rev - down 12% YoY - inflation + competition" is what typed notes look like. "Third quarter revenue was down twelve percent year over year, primarily due to inflationary pressure on our supplier costs and increased competition in the SMB segment" is what dictated notes sound like. When you read those notes back a month later, the dictated version is usable. The typed shorthand often is not.

Context Embedded in the Language

Speaking forces you to include the context that your brain is holding but your typed shorthand omits. When you say "Sarah mentioned that the timeline is at risk because the integration vendor is behind schedule," you capture attribution, substance, and implication. When you type "integration delayed," you lose almost everything.

Faster Capture, More Complete Records

Dictation is three times faster than typing, which means you can capture more before the moment passes. Meeting notes taken by voice are typically twice as long and three times as useful as typed equivalents because the speed advantage lets you record nuance and detail that typing would force you to omit.

Combining Both: The Voice Note Loop

The most powerful workflow combines both directions. Dictate your notes as they happen — in meetings, during research sessions, while walking between appointments. Then, when you need to review them, use text-to-speech playback while you are commuting or exercising. Your voice-captured notes are already written in a natural, readable style that translates well to spoken audio.

This creates a virtuous cycle: speaking your notes in creates better notes, which means the audio playback of those notes is more valuable, which reinforces the habit of dictating in the first place.

Using Steno for Note Dictation on Mac

Steno is designed for exactly this kind of fast, in-the-moment note capture. Hold the hotkey, speak, release. The advanced transcription engine handles natural conversational speech — including the slightly informal cadence of note-taking — with high accuracy. The text appears wherever your cursor is, whether that is Apple Notes, Obsidian, Notion, or a plain text file.

If you work across multiple note-taking platforms and want to understand how voice dictation integrates with each, the post on voice dictation for Notion and Obsidian covers both platforms in depth.

Notes you can actually listen to are notes you took the time to speak properly. The discipline of dictating well produces records worth having — in text or in audio form.

Download Steno at stenofast.com to start capturing notes by voice today. Between Steno for dictation and macOS's built-in speech playback, your notes become a fully bidirectional audio-text system.