The AI Voice Newsletter
Posts
Uber Uplevels App With AI Voice Features

Uber Uplevels App With AI Voice Features

May 05, 2026

🔊 Soundcheck

Uber becomes your one-stop AI voice travel & booking app
Voice AI Investment Skyrockets
Voxtral TTS: Open‑Weight Voice Cloning
Guide makes voice agents from text easy

Read time: 4 minutes

🔥 Hot Mic

Big moves, deep dives, and standout stories.

Uber Uplevels App: Hotels & AI Voice

Uber now lets you book hotels, chat via voice AI, and manage travel—all in one app experience.

Uber just unveiled a major evolution beyond rides: a hotels tab in its app powered by Expedia’s inventory, putting over 700,000 properties at users’ fingertips. The company also introduced AI voice booking and a new travel mode at its GO‑GET 2026 showcase in New York. This move is part of Uber’s bid to become an ‘everything app,’ streamlining travel, rides, and delivery into one seamless experience.

Key Points:

Hotels tab provides access to 700,000+ Expedia-listed properties.
Uber One members get 20% off select hotels plus 10% back in credits.
AI voice-booking tool lets users book hands‑free through conversation.
Travel mode offers tailored ride guidance and curated local tips.

Takeaway: Uber is rapidly reshaping itself into a super‑app, weaving together travel inventory, AI‑powered conversations, and delivery logistics—all designed to reclaim users’ time and reduce app fragmentation.

Voice AI Investment Skyrockets

Q1 2026 saw over $7B invested in voice‑AI startups as enterprise use in healthcare and customer service expands.

Venture capital poured over $7 billion into voice‑AI startups during the first quarter of 2026, shattering previous records. The surge reflects renewed confidence as improved voice recognition tech meets enterprise needs. In healthcare, Abridge rolled out AI‑powered note‑taking across Phoenix care centers, relieving clinicians from charting burdens and keeping patients more engaged. Meanwhile, customer service got a boost: Decagon’s voice agents are now handling after‑hours calls and guiding customers through product details with impressive accuracy. Startups like ElevenLabs, Synthesia, and Runway have all raised fresh rounds this year, signaling investor enthusiasm. The global voice recognition market, valued at about $22 billion in 2026, is forecast to nearly triple in the next five years. This moment brings together innovation, capital, and real-world deployment, shifting voice AI from concept to enterprise necessity.

Key Points:

Over $7 billion in voice‑AI startup funding raised in Q1 2026
Abridge deployed AI note‑taking across Phoenix care centers
Decagon’s voice agents now manage after‑hours customer service
ElevenLabs, Synthesia, and Runway raised new funding rounds
Voice recognition market valued at $22 billion in 2026, set to triple

Takeaway: Voice AI is moving fast from experimentation into enterprise infrastructure: massive funding, high‑impact deployments in healthcare and customer service, and a booming market signal that this technology is becoming a foundational business tool rather than a novelty.

Voxtral TTS: Open‑Weight Voice Cloning

Voxtral TTS is a 4B‑parameter open‑weight TTS model delivering real‑time, multilingual voice cloning from just three seconds of audio.

Voxtral TTS is Mistral AI’s first open‑weight text‑to‑speech system, released March 26, 2026. With 4 billion parameters, it delivers natural‑sounding speech in nine languages and clones voices using only three seconds of sample audio. The model supports ultra‑low latency streaming—about 70–90 ms to first audio and a real‑time factor near 9.7x—and runs on consumer hardware like laptops or a single GPU. It gives developers full control by offering downloadable weights under CC BY‑NC 4.0, with commercial options via Mistral’s API.

Key Points:

Released March 26, 2026 as first open‑weight TTS from Mistral AI
4 billion‑parameter multilingual model supporting nine languages
Clones a new voice from as little as three seconds of reference audio
Ultra‑low latency: ~70–90 ms to first audio, ~9.7× real‑time factor

Takeaway: Voxtral TTS shifts voice AI by offering human‑quality, multilingual, zero‑shot voice cloning in an open‑weight format—fast, flexible, and deployable on everyday hardware, putting powerful speech tech firmly in developers’ hands.

Nova 2 Sonic Transforms Text Agents

AWS shows how to convert text agents into fluid voice assistants using Nova 2 Sonic bidirectional model.

AWS shares a step‑by‑step guide on migrating text‑based agents into authentic voice assistants using Amazon Nova 2 Sonic. It outlines the architectural differences, design shifts, and new interaction models required for real‑time speech. The post emphasizes adapting prompts, handling interruptions, managing latency, and preserving conversational context naturally. It also highlights seamless reuse of existing logic, reducing overhead while improving responsiveness and usability.

Key Points:

Compares text‑agent and voice‑agent requirements and design needs.
Nova 2 Sonic unifies ASR, reasoning, tool use, and TTS in one bidirectional model.
Supports asynchronous tool calls, allowing natural conversation amid background tasks.
Handles turn‑taking and interruptions with built‑in voice activity and turn detection.

Takeaway: Transitioning to voice isn’t just wrapping speech around text agents—Nova 2 Sonic reshapes interaction, latency, and prompts while letting teams reuse core logic without managing separate reasoning models, streamlining voice assistant development.

🎙️ Mic Drop

What else is making noise in voice AI.

Maple and TRAY partner to deploy voice-driven ordering solutions at major restaurant chains, accelerating automation in hospitality. (bdtonline.com )

US restaurants adopt new voice AI for phone-based automated food ordering via Maple and TRAY integration. (verdictfoodservice.com )

Home Depot deploys voice AI agents in U.S. stores to replace traditional phone menus and streamline customer service. (trendhunter.com )

SoundHound shares spike on news of major LivePerson partnership and positive industry earnings for voice AI. (foreignpolicyjournal.com )

AudioCodes reports revenue gains powered by conversational AI and UCaaS managed services, indicating strong enterprise demand. (telecompaper.com )

NordVPN launches browser-based AI voice detector to flag synthetic audio in real time, enhancing user security. (ghacks.net )

Deployment of AI agents boosts call center outreach by 4–5x across Aditya Birla Capital’s seven business units. (crnasia.com )

Generative voice AI reduces India's call center jobs, disrupting the global BPO landscape. (outsourceaccelerator.com )

Vobiz.ai raises $1M to expand global number provisioning and outbound AI voice connectivity across 190 countries. (techinasia.com )

DashLoc debuts multilingual AI voice agents for customer calling, lead qualification, and follow-ups in enterprise workflows. (socialsamosa.com )

Vernon Town Hall launches always-on AI voice agent for residents, enabling 24/7 civic engagement. (fox61.com )

Mistral AI releases Voxtral TTS, a new open-source, lightweight speech model for fast, flexible voice synthesis. (mlq.ai )

Uber Uplevels App With AI Voice Features

🔊 Soundcheck

🔥 Hot Mic

Uber Uplevels App: Hotels & AI Voice

Voice AI Investment Skyrockets

Voxtral TTS: Open‑Weight Voice Cloning

Nova 2 Sonic Transforms Text Agents

🎙️ Mic Drop

Nova 2 Sonic Transforms Text Agents