In this video, I look at the Gemini TTS that was released at Google I/O last week and show you how you can use it to do various things with speech and dialogue.
Blog: https://blog.google/technology/ai/io-2025-keynote/
https://blog.google/technology/google-deepmind/google-gemini-updates-io-2025/#performance
Colab: https://dripl.ink/Dq3gy
For more tutorials on using LLMs and building agents, check out my Patreon
Patreon: https://www.patreon.com/SamWitteveen
Twitter: https://x.com/Sam_Witteveen
🕵️ Interested in building LLM Agents? Fill out the form below
Building LLM Agents Form: https://drp.li/dIMes
👨💻Github:
https://github.com/samwit/llm-tutorials
⏱️Time Stamps:
00:00 Intro
00:28 New Gemini 2.5 Speech Generation Text-to-Speech
01:44 Google AI Studio: Native Speech Generation
02:37 Colab Demo: Single Speaker
08:51 Colab Demo: Multi-Speaker Podcast