Ora is a personal, on-device simultaneous interpreter for macOS that translates speech in real time with streaming partial captions—no external servers. Free.
SpeakMac is an offline speech-to-text dictation app for Mac, converting live voice to text in your active window using on-device processing. 25+ languages.
SpeakON turns iPhone voice into polished text in the app you’re writing in—press one button, speak naturally, and keep working fast.
Harker is a free voice-to-text app for macOS. Dictate anywhere using a global shortcut—upgrade to Premium for AI writing styles, formatting, grammar fixes, and translation.
Grok Speech to Text and Text to Speech APIs by xAI convert audio and text with low-latency REST/WebSocket endpoints, multilingual support, diarization.
Ghost Pepper is a macOS app for voice dictation and meeting transcription, running 100% locally on Apple Silicon and converting speech to text.
Voice and screenshot input for AI builders on Mac. On-device transcription with no cloud upload and no account. One-time $49 download.
Google AI Edge Eloquent turns iPhone dictation into cleaner, context-appropriate text with on-device AI, a personal dictionary, and offline support.
Highlight Studio is a macOS screen recording app to capture, edit, and share polished videos with AI transcription, smart zoom, brand kits, and multi-track editing.
Walkie is a desktop speech-to-text app that inserts dictated text into any app with a hotkey. Choose Fast or Local mode.
MAI-Transcribe-1 is a multilingual speech-to-text model for accurate transcripts across 25 languages, built for batch and low-latency use.
Speak with Claude and listen to voice responses in Claude Voice Mode. Switch between voice and text within the same conversation.
Claras transcribes YouTube videos into instant AI transcripts, adds summaries and a table of contents, and lets you do AI Q&A and export.
Reframe turns your ideas into Instagram and LinkedIn carousel slides with voice dictation and AI—editable design and export for posting.
dictate. for iOS turns voice into text in any app with AI dictation, cloud transcription, multi-language support, and in-keyboard translation.
Search Live is an AI Mode feature in Google Search for interactive voice and camera conversations—available in all languages and locations where AI Mode is offered.
Cohere’s Transcribe converts business audio into precise text for search, analytics, and automation, with structured outputs for RAG pipelines.
Gemini 3.1 Flash Live is Google’s real-time audio and voice model for natural, reliable voice interactions across Google products and developer APIs.
Practice your pitch by recording it and getting instant feedback from Dunky. Refine your delivery with a record → feedback → revise loop.
Type4Me is a macOS speech input tool with real-time transcription and optional LLM prompt-based processing, supporting local offline and cloud streaming.