Lucidly 是一款面向 iPhone 的 AI 沟通教练应用,通过每日简短开口练习和针对性反馈,帮助你把想法说清楚,适合会议、面试、演示等高压对话场景。
Gemini 3.5 Live Translate is Google’s near real-time speech translation model for developers, Google Meet, and the Google Translate app. It supports 70+ languages and is designed to produce natural-sounding translated audio during live conversations.
speech-core is a C++17 library for on-device speech orchestration, including VAD, streaming and batch STT, diarization, TTS, and a voice-agent pipeline. It runs locally and uses optional ONNX Runtime or LiteRT backends for model inference.
Krisp Voice Translation API is a self-serve real-time speech translation API for developers. It translates live speech, returns translated audio and transcripts, and includes background voice cancellation plus custom vocabulary support.
Vox is an on-device AI dictation app for Mac and Windows. It lets you speak, clean up the output, and paste text locally without a cloud round-trip or account sign-up.
Wave 是一款原生 macOS 语音听写应用,可通过按住说话快捷键将语音转换为文本,支持本地 Whisper 和可选 Groq 模式以加快转写。免费开源,应用本身无需账号。
LocalClicky 是一款适用于 macOS 的本地语音助手,可通过语音控制应用、文件、提醒事项和浏览器操作。支持唤醒词、离线转写、基于 Ollama 的推理及可选屏幕视觉识别,无需云端 API 或订阅。
Clarafy is a browser writing assistant that polishes text directly where you type, with app-aware rewriting, voice dictation, and a synced web hub for account and history management.
Shadow 是一款适用于 Mac 的 AI 界面,可捕捉会议内容和屏幕上下文,并通过可编辑 Skills 生成笔记、回复、转录稿及其他基于提示词的输出。提供免费方案,Plus 可解锁无限 AI 功能。
AutoSubtitles 是一款基于浏览器的 AI 字幕生成器,可为视频生成、样式化并导出带内嵌字幕的成片,支持创作者和团队快速完成无需注册的社媒、教育与客户视频制作。
Trace 是一款适用于 macOS 的会议转录应用,可在本地录制并转写麦克风和系统音频,无需云端账号或会议机器人。生成 Markdown 转录稿、内嵌关键时刻与本地会话文件,适合私密笔记流程。
Ringg Parrot STT V1 is a real-time speech-to-text API for voice AI agents, contact centers, and transcription workflows. It supports Hindi, English, and code-mixed speech, with a playground for evaluation and production access handled by Ringg AI approval.
TongueType is a macOS voice dictation app that turns speech into text with a hold-to-record hotkey and local Whisper AI processing. It supports live dictation, file transcription, and privacy-focused workflows for users who want on-device speech input.
Carbon Voice is an asynchronous voice messaging app for teams and individuals, with transcripts, AI catch-up, and cross-device access. It helps people and agents communicate without needing a live call.
SpeakMac is an offline speech-to-text app for Mac that types what you say into the active cursor location. It works locally on Apple Silicon Macs, with one-time pricing, custom words, and multilingual dictation support.
SpeakON is an iPhone voice typing device and app that turns spoken input into cleaned text inside the active app. It removes fillers and backtracks, supports tone adaptation and translation, and is designed to work offline or on the move.
Harker 是一款适用于 Mac 的免费语音转文字应用,可用全局快捷键在任意应用中口述输入。Premium 还提供用于格式整理、翻译和清理的 AI 写作功能。
xAI’s Grok Speech to Text and Text to Speech APIs let developers add transcription and speech generation to apps through REST and WebSocket endpoints. The product supports multilingual STT, expressive TTS, and usage-based pricing.
Ghost Pepper is a macOS dictation and meeting transcription app that runs entirely on-device. It helps users capture speech, clean it up, and save transcripts locally without sending data to the cloud.
Doing is a local voice input app for Mac that lets AI builders dictate text into the focused app, attach screenshots, and submit prompts faster than typing. It runs on-device with no account required and is sold as a one-time purchase.