All posts

Journalism has always been a race against the clock. Stories break on deadlines. Interviews happen in noisy environments. Notes get scribbled in margins and on the backs of press releases. And at the end of it all, you sit down at your laptop and type as fast as you can to file before your editor starts sending increasingly urgent messages.

Voice-to-text technology has been a tool in the journalist's arsenal for decades, going back to reporters dictating stories over the phone to typists in the newsroom. What has changed is the quality. Modern AI-powered speech recognition, specifically OpenAI's Whisper model running on Groq's ultra-fast inference hardware, delivers accuracy that makes voice dictation a genuine replacement for typing rather than a clumsy workaround.

Filing Under Deadline Pressure

The most obvious use case for voice-to-text in journalism is writing under deadline. You have just left the press conference, the city council meeting, the courtroom. You know what happened. You know your lede. You know your structure. The only bottleneck is how fast you can get words on the page.

With Steno, you can dictate your story at speaking speed -- 130 to 150 words per minute compared to 40 to 60 words per minute typing. A 600-word breaking news story that would take 10 to 15 minutes to type can be dictated in under 5 minutes. For a journalist on deadline, those saved minutes can mean the difference between being first and being second.

The workflow is simple. Open your CMS, Google Doc, or text editor. Hold the Steno hotkey. Speak your story from beginning to end, as if you were reading it aloud to your editor. Release the hotkey. Your text appears, fully punctuated and ready for a quick editing pass. Steno works in every application because it pastes text at your cursor position, so it does not matter whether your newsroom uses WordPress, Arc, custom CMS software, or plain text files.

The Dictation Mindset

Experienced broadcast journalists will find voice-to-text immediately natural because they already think in spoken sentences. Print and digital journalists may need a brief adjustment period. The key insight is that dictated prose and typed prose are both valid starting points for a polished story. You are not writing final copy by voice. You are producing a first draft at triple speed, then editing it to publication quality.

Most journalists find that their dictated first drafts require less editing than expected. Speaking naturally tends to produce clear, direct prose -- the kind of writing that journalism values. The passive voice, convoluted clauses, and hedging language that creep into typed prose are largely absent from spoken prose because they are unnatural to say aloud.

Post-Interview Note Processing

You have just finished a 45-minute interview. Your notebook has fragments, shorthand, key quotes underlined. You need to process this into usable notes before the details fade from memory. This is where voice-to-text transforms the workflow.

Instead of typing up your notes, speak them. Hold the Steno hotkey and narrate your interview in full sentences: "The mayor confirmed that the budget shortfall is approximately $12 million, which is larger than the $8 million figure that was reported last month. She attributed the increase to revised revenue projections for property tax receipts. Key quote: 'We are going to have to make some very difficult choices about services that residents rely on.' She would not specify which services are at risk but said an announcement would come within two weeks."

This process accomplishes three things simultaneously. It creates a clean, searchable record of the interview. It forces you to organize the information into coherent narratives. And it identifies the key quotes and data points while the interview is still fresh in your mind.

Writing from the Field

Not every story is written at a desk. Feature writers, investigative reporters, and foreign correspondents often need to write in environments where typing is impractical: the back of a car, a noisy cafe, a hotel room with unreliable Wi-Fi and a deadline in 20 minutes.

Steno runs on any Mac -- MacBook Air, MacBook Pro, even older machines. It requires only a brief internet connection to send the audio for transcription. The audio clips are small (seconds to a couple of minutes each), so even a slow or intermittent connection is sufficient. You can dictate paragraphs during windows of connectivity and build your story incrementally.

The hold-to-speak model is particularly valuable in noisy environments. The microphone is only active while you are holding the hotkey, so background noise between dictation bursts does not interfere. And Whisper's noise-robust architecture handles ambient sounds (cafe chatter, street noise, air conditioning) remarkably well.

Story Structuring and Brainstorming

Before writing, many journalists benefit from talking through their story structure. What is the lede? What is the nut graf? What are the key scenes? What is the kicker? Voice-to-text makes this process tangible.

Open a blank document and dictate your story plan: "I am thinking of leading with the scene at the school board meeting where the parents stood up and walked out. That is the most dramatic moment and it captures the conflict. Then I will back up and explain the policy change that triggered the walkout. Third section will be the reaction from the superintendent. Then I want to include the broader context about how this fits into the nationwide debate. I will end with the parent who organized the walkout and what she plans to do next."

You now have a written outline that took 30 seconds to create. Convert it into section headers and start filling in each section, again by voice.

Practical Tips for Journalists

Dictate Quotes Separately

When you are writing a story that includes direct quotes, dictate the narrative sections by voice and then type in the exact quotes from your notes or recording. Quotes require word-perfect accuracy, and it is easier to type them from your source material than to recite them from memory.

Use Steno's History Feature

Steno keeps a history of your last 100 transcriptions. If you dictated a great paragraph but accidentally pasted it in the wrong window or lost it in a CMS crash, you can retrieve it from history. For journalists who work in chaotic, time-pressured environments, this safety net is invaluable.

Develop a Pre-Dictation Habit

Before pressing the hotkey, take five seconds to organize the sentence or paragraph in your mind. The best dictation comes from clear thinking, not from stream-of-consciousness rambling. Think, then speak.

Edit on a Second Pass

Resist the urge to perfect each paragraph as you go. Dictate the entire story first, then go back and edit. The first pass is about getting the content down. The second pass is about getting it right. Separating these two cognitive tasks makes both faster.

Why Steno Over Other Tools

Journalists need a tool that is fast, reliable, and works everywhere. Steno transcribes in under a second using Groq's inference hardware. It works in every CMS, email client, messaging app, and text editor because it pastes at the cursor. It lives in the menu bar and uses minimal system resources. And its hold-to-speak model gives you precise control over when the microphone is active, which matters when you are in a newsroom where multiple conversations are happening simultaneously.

Download Steno free at stenofast.com. Your next deadline will thank you.