In this episode we are joined by our very own Andrew Seagraves, VP of Research at Deepgram, explores text-to-speech (TTS) technology and language modeling. With a PhD from MIT and a background in AI-driven explosive design, Andrew now leads advanced speech recognition research. He discusses the challenges of creating natural-sounding TTS systems, the role of context conditioning, and his career journey from MIT to Deepgram.
Episode Highlights:
- Andrew Seagraves shares his insights on why language modeling poses such a complex challenge, particularly in the domain of text-to-speech systems.
- Seagraves discusses how future developments promise to address these issues dramatically.
- From his initial steps at Deepgram working on speech recognition and diarization, to his current focus on scaling models for varied languages and contexts—discover Andrew Seagraves' transformative journey in AI.
- Andrew’s fascinating career trajectory, from designing defense technologies at MIT to spearheading voice technology innovations used by global leaders like Spotify and NASA.
- Demetrios and Seagraves express excitement for the near future of TTS technology, hinting at groundbreaking features that will redefine our interaction with digital devices.
-------------------------------------------------------------
Connect with Andrew Seagraves
https://www.linkedin.com/in/seagravesan/
Connect with Demetrios:
https://www.linkedin.com/in/dpbrinkm/
Connect with Deepgram:
https://deepgram.com/
https://www.linkedin.com/company/deepgram
https://x.com/deepgramai
What is AIMinds?
Welcome to the AI Minds podcast, where we explore how the companies of tomorrow are built with an AI-first approach.
This series of episodes are brought to you by Deepgram the number one Speech to Text, Text to Speech, and voice API on the internet trusted by the world’s top Enterprises, Conversational AI Leaders, & Startups like Spotify, Twilio, NASA and Citi Bank.