Building Voice Agents with Gemini Live API and Agora’s Conversational AI
Apr 29, 2026•Channel
AI Analysis
Data from YouTube Data API v3•Updated Just now
Video Overview
Video Details
Published1 month ago
Duration9:24
Video ID2ltcbA2CCTo
Languageen
CategoryScience & Technology
PrivacyPublic
Made for KidsNo
Video TypeRegular Video
Performance Metrics
Views6K
Likes177
Comments11
Engagement Rate3.14%
Likes per 100 views2.96
Comments per 1K views1.84
Video Tags
Description
Mason from Agora walks through how to drop Gemini 3.1 Flash Live into Agora's real-time voice and video infrastructure. Speech-to-speech with multilingual switching, sub-second latency, and tool calls wired to actual hardware.
What's covered: cloning the Agora agent quick start, configuring App ID and certificate in the Agora console, enabling conversational AI, swapping the default chained pipeline (STT, LLM, TTS) for Gemini Live in a single SDK method, and pointing the WebSocket at Google's server. Plus two live demos: a Reachy Mini robot calling 70+ tool emotes mapped to physical motors, and a food ordering agent (Foodgora) handling cart updates and recommendations in real time.
Grab your Gemini API key at Google AI Studio and your Agora credentials at agora.io to get started.
Resources:
Gemini Live API overview → https://goo.gle/4tFoFeK
GitHub examples → https://goo.gle/4uj3HCw
What are you building with the Gemini Live API? Drop it in the comments.
Subscribe to Google for Developers → https://goo.gle/developers
Speaker: Mason Adams
Products Mentioned: Google AI, Gemini