Conversational Video Interface (CVI)
Tavus’ CVI is described as an end-to-end pipeline for face-to-face AI, combining perception, dialogue, and real-time rendering so agents can see, hear, and respond in conversation.
Tavus is an AI video platform for building real-time, face-to-face agents, digital twins, and AI companions. It combines APIs, custom replicas, and multilingual conversational workflows for developers and teams.
Tavus is a San Francisco–based AI research lab and developer platform for building human-like video interactions. Its site describes the product as a way to create AI humans that see, hear, and talk face to face in real time, including custom video agents, digital twins, and AI companions.
The core product is the Conversational Video Interface (CVI), an API-first pipeline for real-time AI video conversations. Tavus combines perception, dialogue, and rendering models so teams can build agents that respond with facial behavior, timing, and visual awareness, while also supporting replicas, knowledge sources, and tool use for product workflows.
Tavus’ CVI is described as an end-to-end pipeline for face-to-face AI, combining perception, dialogue, and real-time rendering so agents can see, hear, and respond in conversation.
The platform’s models are split across rendering, perception, and dialogue: Phoenix-4 for facial behavior and animation, Raven-1 for multimodal perception, and Sparrow-1 for conversational timing and turn-taking.
Users can train custom replicas from a short source video, with the pricing page saying custom replica training starts from a two-minute video and includes a custom voice model.
The platform supports uploading knowledge sources such as CSVs, PDFs, TXTs, PPTXs, PNGs, JPGs, or websites so conversations can respond from supplied context rather than only from the base model.
The pricing page lists function calling, memories, objectives and guardrails, and bring-your-own-LLM options so teams can connect Tavus to external tools and adapt the conversation stack.
The site says agents can be deployed in 30+ languages on the home and pricing pages, and 50+ languages on the CVI page, with stock replicas available for quick testing or early builds.
Build a real-time video assistant that can see a user, respond with facial behavior, and maintain a natural conversational flow for product demos or guided interactions.
Create a custom replica from a short source video and use it as a branded digital twin or AI presence in customer-facing workflows.
Use the developer plans to connect conversations to external tools, knowledge bases, and guardrails for flows such as booking meetings, sending quotes, or answering from company documents.
Deploy multilingual agents for audiences that need the same experience across languages, including region-specific support or globally distributed teams.
Use PAL plans for personal or always-on AI companions that support messaging and voice/video calls, with usage levels that scale across free, Plus, and Max tiers.
Tavus provides APIs and a no-code portal for building conversational video experiences. The CVI page says you can start with its defaults and swap in your own LLM, voice, and knowledge stack as you scale.
The site positions Tavus for building video agents, digital twins, and AI companions. The pricing page also separates plans for PALs and for developers building conversations, which suggests both consumer-style and product-facing use cases.
The pricing page lists support for uploading CSVs, PDFs, TXTs, PPTXs, PNGs, JPGs, or websites as conversation context, and it also mentions function calling, memories, and objectives and guardrails.
The site states that Tavus supports 30+ languages on the home and pricing pages, while the CVI page states 50+ languages for agents. The exact language coverage may vary by product area and plan.
The source materials describe pricing tiers and usage limits, but they do not provide full implementation, security, or deployment documentation here. For those details, the product site points users toward docs and enterprise contact paths.
HiringPartner.ai is an autonomous AI recruiting platform for sourcing, screening, and interviewing candidates 24/7. It supports ATS-connected workflows, bulk resume uploads, and reviewable interview outputs for hiring teams.
Lasso is an ecommerce product data platform for enriching catalog records, processing supplier files, generating product content, and monitoring competitors. It combines a web app with a REST API, SDK, and MCP server for teams and developers.
Sanota is an app that turns spoken memories, reflections, and interviews into clear written stories. It supports personal storytelling, family history, and shared memories, with guided prompts and subscription pricing.
AgentMail is an email inbox API for AI agents to create, send, receive, and search messages via REST APIs and SDKs. Built for threaded workflows, verification, support, scheduling, and approvals.
Carbon Voice is an asynchronous voice messaging app for teams and individuals, with transcripts, AI catch-up, and cross-device access. It helps people and agents communicate without needing a live call.
Talkpal is an AI-powered language learning web and mobile app for practicing speaking, listening, writing, and pronunciation. It offers guided courses, roleplays, and call-style conversation practice across 130+ languages.