- The AI Voice Newsletter
- Posts
- Uber Uplevels App With AI Voice Features
Uber Uplevels App With AI Voice Features

đ Soundcheck
Uber becomes your one-stop AI voice travel & booking app
Voice AI Investment Skyrockets
Voxtral TTS: OpenâWeight Voice Cloning
Guide makes voice agents from text easy
Read time: 4 minutes
đĽ Hot Mic
Big moves, deep dives, and standout stories.
Uber now lets you book hotels, chat via voice AI, and manage travelâall in one app experience.
Uber just unveiled a major evolution beyond rides: a hotels tab in its app powered by Expediaâs inventory, putting over 700,000 properties at usersâ fingertips. The company also introduced AI voice booking and a new travel mode at its GOâGET 2026 showcase in New York. This move is part of Uberâs bid to become an âeverything app,â streamlining travel, rides, and delivery into one seamless experience.
Key Points:
Hotels tab provides access to 700,000+ Expedia-listed properties.
Uber One members get 20% off select hotels plus 10% back in credits.
AI voice-booking tool lets users book handsâfree through conversation.
Travel mode offers tailored ride guidance and curated local tips.
Takeaway: Uber is rapidly reshaping itself into a superâapp, weaving together travel inventory, AIâpowered conversations, and delivery logisticsâall designed to reclaim usersâ time and reduce app fragmentation.
Q1 2026 saw over $7B invested in voiceâAI startups as enterprise use in healthcare and customer service expands.
Venture capital poured over $7âŻbillion into voiceâAI startups during the first quarter of 2026, shattering previous records. The surge reflects renewed confidence as improved voice recognition tech meets enterprise needs. In healthcare, Abridge rolled out AIâpowered noteâtaking across Phoenix care centers, relieving clinicians from charting burdens and keeping patients more engaged. Meanwhile, customer service got a boost: Decagonâs voice agents are now handling afterâhours calls and guiding customers through product details with impressive accuracy. Startups like ElevenLabs, Synthesia, and Runway have all raised fresh rounds this year, signaling investor enthusiasm. The global voice recognition market, valued at about $22âŻbillion in 2026, is forecast to nearly triple in the next five years. This moment brings together innovation, capital, and real-world deployment, shifting voice AI from concept to enterprise necessity.
Key Points:
Over $7âŻbillion in voiceâAI startup funding raised in Q1 2026
Abridge deployed AI noteâtaking across Phoenix care centers
Decagonâs voice agents now manage afterâhours customer service
ElevenLabs, Synthesia, and Runway raised new funding rounds
Voice recognition market valued at $22âŻbillion in 2026, set to triple
Takeaway: Voice AI is moving fast from experimentation into enterprise infrastructure: massive funding, highâimpact deployments in healthcare and customer service, and a booming market signal that this technology is becoming a foundational business tool rather than a novelty.
Voxtral TTS is a 4Bâparameter openâweight TTS model delivering realâtime, multilingual voice cloning from just three seconds of audio.
Voxtral TTS is Mistral AIâs first openâweight textâtoâspeech system, released March 26, 2026. With 4 billion parameters, it delivers naturalâsounding speech in nine languages and clones voices using only three seconds of sample audio. The model supports ultraâlow latency streamingâabout 70â90 ms to first audio and a realâtime factor near 9.7xâand runs on consumer hardware like laptops or a single GPU. It gives developers full control by offering downloadable weights under CC BYâNC 4.0, with commercial options via Mistralâs API.
Key Points:
Released March 26, 2026 as first openâweight TTS from Mistral AI
4âŻbillionâparameter multilingual model supporting nine languages
Clones a new voice from as little as three seconds of reference audio
Ultraâlow latency: ~70â90âŻms to first audio, ~9.7Ă realâtime factor
Takeaway: Voxtral TTS shifts voice AI by offering humanâquality, multilingual, zeroâshot voice cloning in an openâweight formatâfast, flexible, and deployable on everyday hardware, putting powerful speech tech firmly in developersâ hands.
AWS shows how to convert text agents into fluid voice assistants using NovaâŻ2âŻSonic bidirectional model.
AWS shares a stepâbyâstep guide on migrating textâbased agents into authentic voice assistants using Amazon NovaâŻ2âŻSonic. It outlines the architectural differences, design shifts, and new interaction models required for realâtime speech. The post emphasizes adapting prompts, handling interruptions, managing latency, and preserving conversational context naturally. It also highlights seamless reuse of existing logic, reducing overhead while improving responsiveness and usability.
Key Points:
Compares textâagent and voiceâagent requirements and design needs.
NovaâŻ2âŻSonic unifies ASR, reasoning, tool use, and TTS in one bidirectional model.
Supports asynchronous tool calls, allowing natural conversation amid background tasks.
Handles turnâtaking and interruptions with builtâin voice activity and turn detection.
Takeaway: Transitioning to voice isnât just wrapping speech around text agentsâNovaâŻ2âŻSonic reshapes interaction, latency, and prompts while letting teams reuse core logic without managing separate reasoning models, streamlining voice assistant development.
đď¸ Mic Drop
What else is making noise in voice AI.
Maple and TRAY partner to deploy voice-driven ordering solutions at major restaurant chains, accelerating automation in hospitality. (bdtonline.com)
US restaurants adopt new voice AI for phone-based automated food ordering via Maple and TRAY integration. (verdictfoodservice.com)
Home Depot deploys voice AI agents in U.S. stores to replace traditional phone menus and streamline customer service. (trendhunter.com)
SoundHound shares spike on news of major LivePerson partnership and positive industry earnings for voice AI. (foreignpolicyjournal.com)
AudioCodes reports revenue gains powered by conversational AI and UCaaS managed services, indicating strong enterprise demand. (telecompaper.com)
NordVPN launches browser-based AI voice detector to flag synthetic audio in real time, enhancing user security. (ghacks.net)
Deployment of AI agents boosts call center outreach by 4â5x across Aditya Birla Capitalâs seven business units. (crnasia.com)
Generative voice AI reduces India's call center jobs, disrupting the global BPO landscape. (outsourceaccelerator.com)
Vobiz.ai raises $1M to expand global number provisioning and outbound AI voice connectivity across 190 countries. (techinasia.com)
DashLoc debuts multilingual AI voice agents for customer calling, lead qualification, and follow-ups in enterprise workflows. (socialsamosa.com)