Google has introduced Gemini Live, a new voice interaction feature powered by its gemini AI, to enhance voice assistants. Announced at the made by google 2024 event, this feature aims to rival openai’s advanced voice mode for chatgpt, which was launched in limited alpha earlier this year.
Immersive Voice Conversations
Gemini live allows users to engage in deep voice conversations with google’s AI chatbot, gemini, on their smartphones. Thanks to an upgraded speech engine, gemini live delivers more expressive and realistic dialogues. Users can interrupt the chatbot to ask follow-up questions. The AI adapts in real-time, creating a more natural experience.
Personalized Interaction
Users can choose from ten natural-sounding voices that the AI can respond with, making interactions feel more personalized. Additionally, the feature is hands-free. Users can continue their conversation even when the phone is locked or the app is running in the background. Conversations can be paused and resumed at any time.
Enhanced Memory and Context
One of the key advantages of gemini live over its competitors is its improved memory. The AI model, based on gemini 1.5 pro and gemini 1.5 flash, has a longer-than-average context window. It can handle extended conversations, maintaining context and coherence throughout.
Real-World Performance
Google claims gemini live’s architecture is adapted for conversational use. It is particularly effective in scenarios where lengthy, back-and-forth dialogue is necessary. However, the actual performance remains to be seen in real-world use.