Signal Recorder SR-7 is a voice recorder app for Mac and iPhone that transcribes audio on device, creates local titles and summaries, and exports Markdown files.
speech-core is a C++17 on-device voice-agent pipeline engine for VAD, streaming and batch speech-to-text, diarization, and text-to-speech.
Krisp Voice Translation API is a real-time speech-to-speech translation API for live calls and voice apps, with 61 languages, any-to-any pairing, noise cancellation, and custom vocabulary.
Vox is an on-device AI dictation app for Mac and Windows that turns speech into clean text on your clipboard. No account, offline-capable.
Wave is a native macOS dictation app that turns speech into text at the cursor, with local Whisper for offline privacy or Groq for faster transcription.
Daisy is an open-source, local-first meeting recorder and dictation app for Mac. Records on-device, saves markdown transcripts, and works with Claude Desktop or Cursor via local MCP.
LocalClicky is a macOS voice assistant that runs locally to transcribe speech, interpret commands, inspect the screen, and control your Mac offline.
Sun is a realtime voice API for collaborative voice interaction in apps and products, built for live voice experiences beyond one-on-one chat.
Ringg Parrot STT V1 is a low-latency speech-to-text API for real-time and file-based transcription of Hindi, English, and code-mixed speech.
TongueType is a macOS voice dictation app that transcribes speech locally with Whisper AI and inserts text at your cursor. No cloud, accounts, or subscriptions.
Carbon Voice is an asynchronous voice messaging app for teams with people and AI agents, plus transcribed updates, voice or text replies, and desktop, mobile, watch, and widget access.
Tico is an AI assistant for Windows that listens to your voice questions, understands your screen, and gives spoken guidance with click points.
Snaply records meetings on your Mac and generates a full transcript, clean summary, and action items—processed locally for privacy.
Memoket Gem is a wearable device that records conversations and turns them into usable context for AI tools with a press-once workflow.
Build voice agents with AssemblyAI Voice Agent API: stream audio in and receive voice output back. Configure transcription details like disfluencies, audio tags, speakers.
Ora is a personal, on-device simultaneous interpreter for macOS that translates speech in real time with streaming partial captions—no external servers. Free.
SpeakMac is an offline speech-to-text dictation app for Mac, converting live voice to text in your active window using on-device processing. 25+ languages.
SpeakON turns iPhone voice into polished text in the app you’re writing in—press one button, speak naturally, and keep working fast.
Harker is a free voice-to-text app for macOS. Dictate anywhere using a global shortcut—upgrade to Premium for AI writing styles, formatting, grammar fixes, and translation.
Grok Speech to Text and Text to Speech APIs by xAI convert audio and text with low-latency REST/WebSocket endpoints, multilingual support, diarization.