Back to Browse

Gemini TTS - Native Audio Out

49.0K views
May 28, 2025
13:50

In this video, I look at the Gemini TTS that was released at Google I/O last week and show you how you can use it to do various things with speech and dialogue. Blog: https://blog.google/technology/ai/io-2025-keynote/ https://blog.google/technology/google-deepmind/google-gemini-updates-io-2025/#performance Colab: https://dripl.ink/Dq3gy For more tutorials on using LLMs and building agents, check out my Patreon Patreon: https://www.patreon.com/SamWitteveen Twitter: https://x.com/Sam_Witteveen 🕵️ Interested in building LLM Agents? Fill out the form below Building LLM Agents Form: https://drp.li/dIMes 👨‍💻Github: https://github.com/samwit/llm-tutorials ⏱️Time Stamps: 00:00 Intro 00:28 New Gemini 2.5 Speech Generation Text-to-Speech 01:44 Google AI Studio: Native Speech Generation 02:37 Colab Demo: Single Speaker 08:51 Colab Demo: Multi-Speaker Podcast

Download

1 formats

Video Formats

360pmp423.7 MB

Right-click 'Download' and select 'Save Link As' if the file opens in a new tab.

Gemini TTS - Native Audio Out | NatokHD