- The AI Voice Newsletter
- Posts
- Wispr Flow Raises $81M Series A Total
Wispr Flow Raises $81M Series A Total

🔊 Soundcheck
Voice AI startup Wispr secures $81M total funding
Build real‑time voice AI with SIP, RAG, compliance
Hands‑on VibeVoice tutorial: ASR to speech‑to‑speech
Dagbani Voice AI Dataset Workshop Advances Inclusion
Read time: 3 minutes
🔥 Hot Mic
Big moves, deep dives, and standout stories.
Wispr’s voice-first app Flow has closed $81 million in Series A funding to fuel expansion and innovation.
Wispr Flow, the voice-driven productivity tool that turns spoken words into polished writing, has reached a significant milestone: $81 million in cumulative Series A funding. After a $30 million round in mid‑2025, the company raised an additional $25 million later that year, bringing its total capital to $81 million. This infusion of resources signals strong investor confidence in voice‑first computing and Wispr’s broader ambition to build a voice‑native operating system.
Key Points:
Raised $30 M in June 2025 led by Menlo Ventures
Added $25 M Series A extension in Nov 2025 led by Notable Capital
Total funding now stands at $81 M
Fueling expansion across platforms and voice‑model development
Takeaway: Wispr Flow’s $81 million funding highlight points to a growing bet on voice as the future of human‑computer interaction—beyond dictation, toward a full voice‑first interface.
This article walks developers through building real‑time AI voice agents that combine SIP telephony integration, RAG architectures for grounded responses, and compliance guardrails to ensure safe and reliable production use. It outlines end‑to‑end design considerations—from voice input handling and retrieval to compliance enforcement and transactional safety flows. Practical insights help you structure systems that act correctly, stay within regulatory boundaries, and serve real users confidently.
Key Points:
Integrate SIP telephony for live voice input/output
Use RAG to ground agent responses in real data
Embed compliance guardrails to prevent unsafe actions
Design transactional safety flows for accurate execution
Takeaway: A production‑ready voice AI agent needs three pillars—live SIP integration, grounded RAG responses, and strict compliance guardrails—to deliver trustworthy, real‑time service without hallucinations or unsafe behavior.
Guide walks through speaker‑aware ASR, real‑time TTS, and speech‑to‑speech pipelines using VibeVoice.
This walkthrough introduces Microsoft’s VibeVoice in a hands‑on way, using Colab to install dependencies and access the latest models. It covers speaker‑aware transcription, context‑guided ASR, batch audio workflows, expressive real‑time TTS, and a full speech‑to‑speech pipeline. You’ll build everything from environment setup to end‑to‑end voice interaction, gaining practical insight into advanced voice model integration.
Key Points:
Enables speaker‑aware ASR with context guidance
Demonstrates batch audio processing workflows
Shows expressive real‑time text‑to‑speech generation
Builds full speech‑to‑speech pipeline in Colab
Takeaway: This tutorial gives developers a practical, start‑to‑finish code‑first introduction to VibeVoice’s powerful ASR and TTS capabilities, bridging recognition and synthesis in one seamless workflow.
Dagbani community volunteers in Tamale created an open speech dataset via Mozilla Common Voice workshop.
At the end of January 2026, volunteers in Tamale gathered for a two‑day workshop to build voice AI resources for Dagbani speakers. They learned Mozilla Common Voice workflows and contributed annotated and validated recordings of culturally relevant sentences. The initiative emphasizes community involvement, care for linguistic nuance, and building capacity locally. It’s a practical step toward an offline‑capable, mobile‑first speech application in a language often overlooked by major tech.
Key Points:
Workshop held January 31–February 1, 2026 in Tamale for Dagbani voice data creation.
Volunteers trained on annotation, validation, and quality review using Mozilla Common Voice.
Contributions reflect culturally accurate phrasing and dialectal nuance.
Aim is an open dataset feeding an offline‑capable mobile voice application.
Takeaway: This grassroots workshop demonstrates how community‑led efforts can generate high‑quality, culturally rooted speech data for underrepresented languages, fostering both inclusion in AI and local capacity to sustain voice technology efforts.
🎙️ Mic Drop
What else is making noise in voice AI.
DataVisor introduces conversational AI agents to tackle fraud, expanding voice AI’s practical reach in financial services. (joplinglobe.com)
PayU debuts a voice AI assistant to automate merchant onboarding, reducing friction for financial institutions. (ibsintelligence.com)
PartsNow.ai uses AI-powered voice assistants to reshape how shops source truck parts with multimodal interface integration. (overdriveonline.com)
AutoRaptor’s new AI voice agent handles inbound calls for car dealerships, automating lead capture and appointment scheduling. (martechcube.com)
Ship.Cars partners with Axe to deploy AI voice automation, aiming to make logistics communication more efficient. (cbtnews.com)
Hyperfunnel launches a voice AI agent for Samsung smart TVs, bringing conversational sales tech to travel retail. (seatrade-cruise.com)
Google introduces Gemini for Home, offering early access to its latest voice assistant platform in Australia. (blog.google)
Mistral AI announces VoxTral TTS, an open-source, lightweight speech synthesis model aimed at democratizing voice AI. (mlq.ai)
Viettel Telecom and iFLYTEK form partnership to accelerate development and localization of Vietnamese voice AI technologies. (technode.global)
Zebra and Aiva Health team up to deploy voice AI nurse assistants, bringing speech tech into healthcare environments. (investing.com)
Explores whether voice-AI interview platforms can improve candidate experience and trust in automated hiring processes. (benefitnews.com)
Analyzes conversational AI's role in shifting vacation research toward destinationless, highly personalized itinerary creation. (travelandtourworld.com)