Listen on:

Subscribe to Newsletter

Convo AI World

A podcast from Agora

Explore voice-first conversational AI through honest conversations with practitioners. Hear from AI builders, infra engineers, product strategists, and more for the latest insights on what it takes to build best-in-class conversational AI experiences.

AI Practitioners

Industry Insights

Technical Deep Dives

Best Practices

Listen on Your Favorite Platform

Available on all major podcast platforms

18 episodes

View

00:41:35

How RiseLink is Giving Every Object a Brain

In this episode of the ConvoAI World Podcast, host Hermes Frangoudis interviews Diana Zhu, Head of the US Office at RiseLink, regarding their development of ultra-low-power edge AI chips designed to make hardware more intelligent and accessible. Diana highlights the BK7259 chip, which utilizes a multi-core architecture and an on-board NPU to enable complex tasks like facial recognition and voice processing entirely on-device for enhanced privacy and speed. Through a strategic partnership with Agora, RiseLink delivers ultra-low latency and barge-in capabilities, allowing for natural, real-time dialogue in applications ranging from smart home security and e-bikes to interactive educational toys. Diana concludes by envisioning a multimodal future where everyday objects leverage local memory and AI to act as personalized, interactive assistants.

0

#riselink#diana zhu

01:03:34

From Code to Cosmos: The First AI Astrologer

In this episode of the Convo AI World Podcast, Rishi Ahluwalia interviews Punit Pandey, Co-founder and CIO of Astrosage, who discusses the intersection of astrology and AI. Punit shares his journey, including pivotal moments that led him to realize the potential of conversational AI in transforming astrology. He explains the technology behind the AI astrologer, the importance of emotional intelligence in training AI models, and the challenges of building trust with users. The conversation also touches on the future of astrology as a life guide, the ethical considerations in AI, and the roadmap for Astrosage AI. Punit emphasizes the need for innovation rooted in empathy and understanding human emotions, showcasing how ancient wisdom can harmonize with modern technology.

0

#astrosage#punit pandey

00:18:50

Meet Fuzozo: The Pocket-Sized Robot Ending the Loneliness Crisis

In this special episode of the Convo AI World Podcast, Patrick Ferriter interviews Joe Zhaozhi Sun, CEO and co-founder of RoboPoet, at the Fuzozo store in Shanghai to discuss the origins and technology behind the Fuzozo AI companions. They explore the product's design philosophy, which draws on Joe's background in automotive and robot design to prioritize emotional connection and "healing" loneliness for Gen Z users. Joe details the unique "five elements" personalities of the robots and their "Multimodal Emotional Model" (MEM), which utilizes Agora's technology to achieve low-latency voice interactions and maintain long-term memory of user conversations. The discussion highlights impressive engagement metrics, with users averaging 50 minutes of daily interaction and reporting significant emotional improvement after talking to their devices. The episode concludes with RoboPoet's strategy for international expansion into Japan and Joe's vision for a new category of "personal robots" that function as ongoing emotional companions

0

#fuzozo#robopoet

00:54:15

Agnes AI: This is How You Win Southeast Asia

In this episode of the Convo AI World Podcast, Derek Zheng interviews Bruce Yang, Founder and CEO of Agnes AI, discussing the platform's meteoric rise to a $100 million valuation and its acquisition of 3 million users in Southeast Asia. They explore Agnes AI's positioning as a mobile-first, "sovereign AI" that blends productivity with social networking, distinguishing itself from competitors like Manus through a focus on local cultural nuances and social algorithms. Bruce shares insights on their technical architecture—including the proprietary Ava model and token-efficient code agents—and explains their strategy of prioritizing daily active users over immediate monetization by offering premium features for free. The conversation highlights the collaboration with Agora to power real-time group chat features and concludes with Bruce's vision for the future of AI-native social networks and voice-first interactions.

0

#agnes ai#bruce yang

00:34:32

Reimagining the Future of Learning through Conversational AI with Physics Wallah's Supreet Singh

In this episode of the Convo AI World Podcast, Rishi Ahluwalia interviews Supreet Singh, Senior Director of Engineering at Physics Wallah, discussing the transformative role of EdTech and conversational AI in education. Supreet shares his journey from a small city in India to leading engineering efforts in one of the largest EdTech companies. The conversation explores the evolution of conversational AI, its potential to personalize learning, and the challenges of multilingual education in India. Supreet emphasizes the importance of adaptability in AI, the need for transparency, and the future outlook of AI companions in education.

0

#physics wallah#supreet singh

00:54:24

Relatability Over Perfection in Voice AI with Rime's Lily Clifford

In this episode of the Convoy AI World Podcast, Hermes Frangoudis interviews Lily Clifford, CEO of Rime AI, discussing the evolution of voice AI technology. They explore the importance of high-quality data, the impact of linguistic nuances on voice models, and the challenges of creating relatable and multilingual voice agents. Lily shares insights on customer experience, the role of R&D in meeting market demands, and the future of conversational voice agents. The conversation highlights the technical bottlenecks in voice AI and the ongoing quest for more human-like interactions in voice technology.

0

#rime ai#lily clifford

00:47:38

Humanizing Learning in the Age of AI with Colabery's Ram Katamaraja

In this episode of the Convo AI World Podcast, Hermes Frangoudis interviews Ram Katamaraja, the founder of Colaberry. They discuss the origin of Colaberry, its mission to help individuals transition into tech jobs, and the impact of AI on workforce development. Ram shares insights on the importance of upskilling, the role of conversational AI in enhancing user experience, and the future of work as AI becomes more integrated into various industries. The conversation also touches on the challenges of recruitment in the age of AI and the need for a builder mindset in utilizing AI effectively.

0

#colaberry#voice ai

00:38:35

Redefining Live Entertainment and the Creator Economy with Eloelo's Sagar Gaonkar

In this episode of the Convo AI World Podcast, Rishi Ahluwalia speaks with Sagar Gaonkar, Chief Technology Officer at Eloelo, about the transformative role of conversational AI in the live entertainment industry. They discuss Sagar's extensive experience in video streaming, the challenges of integrating AI in a diverse linguistic landscape like India, and the innovative use cases that conversational AI can unlock for creators. Sagar emphasizes the importance of experimentation, understanding user personas, and the need for tailored solutions in the evolving AI landscape. The conversation also touches on building effective teams for AI development and the metrics for measuring success in AI adoption.

0

#conversational ai#live entertainment

00:39:14

Real-Time Avatars, Translation, and Visual Storytelling with Akool's Jeff Lu

In this episode of the Convo AI World podcast, Hermes Frangoudis interviews Jeff Lu from Akool, a company revolutionizing video generation technology. They discuss Akool's origin, its innovative approach to visual storytelling, and the various use cases of its technology in marketing, internal communications, and more. Jeff shares insights on the challenges of balancing quality and cost in video generation, the importance of real-time inference, and advancements in video translation. The conversation also touches on Akool's strategy for staying ahead in the rapidly evolving generative AI landscape and the future of creativity in content creation.

0

#video generation#akool

00:39:39

AI at the Edge: 6G, Arabic LLMs & the Middle East’s AI Leap with Mérouane Debbah

In this episode of Convo AI World Podcast, we dive deep into the future of AI, telecom, and the evolving role of conversational interfaces with Prof. Merouane Debbah, Founding Director of the Khalifa University 6G Research Center and one of the leading minds behind the Arab world’s first large language models — Noor and Falcon.

0

#6g#arabic llms

00:51:12

The Voice AI and VR Revolution in Heavy Machinery with Carbon Origins' Amogha

In this episode of the Convo AI World Podcast, Hermes Frangoudis interviews Amogha Srirangarajan, Co-founder and CEO of Carbon Origins. They discuss the evolution of Carbon Origins from last-mile delivery robots to heavy machinery teleoperation, the integration of voice AI and VR in enhancing operator experiences, and the future of robotics in construction and space mining. Amogha shares insights on the challenges of labor shortages in critical industries and how Carbon Origins aims to address these through innovative technology and partnerships. The conversation also touches on ambitious plans for energy solutions and space exploration, highlighting the potential of robotics in shaping the future of human civilization.

0

#carbon origins#heavy machinery

00:33:38

Open-Source Voice Activity Detection with TEN Framework's Ziyi Lin

Ziyi Lin, speech engineer on the TEN Framework team, joins the Convo AI World podcast to explore the design and impact of a new open-source Voice Activity Detection (VAD) model. The episode explores the challenges faced with existing VAD solutions, the importance of high-quality training data, and the design choices that led to improved performance metrics. Ziyi explains how VAD functions as a critical component in conversational AI, managing real-time processing and latency, and the advantages of deploying it on edge devices.

0

#voice activity detection#VAD

00:40:42

Building AI Community with Voice AI Space

Thibault Mardinli (T-Bot) from Voice AI Space joins to discuss the evolution of Voice AI communities and ecosystems. Hermes and Thibault explore Thibault's journey from building a Voice AI startup to creating an open resource platform, the challenges of discoverability in the fragmented Voice AI landscape, and the democratization of AI expertise through visual interfaces. The conversation covers the spectrum of Voice AI companies from infrastructure to UX-focused products, adoption in emerging markets, privacy considerations, and the future of voice-first interfaces. Thibault shares insights on building global communities, curating quality resources, and the grassroots movement powering Voice AI innovation.

0

#voice ai#conversational ai

01:06:02

The Science Behind AI Speech Recognition with Deepgram's Andrew Seagraves

Deepgram's VP of Research Andrew Seagraves joins to explore the science and engineering behind modern speech recognition systems. Hermes and Andrew dive deep into why speech recognition isn't a solved problem, the two-stage training process of speech-to-text models, and the challenges of balancing real-time latency with accuracy. The conversation covers Deepgram's origins from dark matter research, power laws in speech data, buffer-based architectures for real-time transcription, and frontier challenges like multilingual code-switching, emotion detection, and conversational dynamics. Andrew shares insights on model deployment, customer use cases from NASA to food ordering, and the future of self-adapting speech models.

0

#speech recognition#deepgram

00:37:01

AI Content Moderation with Google's Ninny Wan

Google's Ninny Wan, Product Lead for AI Content Safety, joins to discuss the evolution of AI content moderation in the age of GenAI. The conversation covers Google's approach to semantic understanding, multilingual moderation across 140+ languages, synthetic data generation for training, and the balance between user freedom and safety. Ninny shares insights on transformer models, human-in-the-loop processes, cross-functional safety reviews, and Google's on-device privacy-compliant features like sensitive content warnings.

0

#ai content moderation#google

00:44:29

Interactive Digital Avatars with Trulience's Richard Bowdler

Trulience's Head of Growth Richard Bowdler joins to discuss the world of interactive digital avatars and conversational AI. Hermes and Richard explore how Trulience creates lifelike avatars, the technology behind real-time client-side rendering, multilingual support, and real-world applications from healthcare to customer service. The conversation covers the evolution from capture cages to modern avatar creation, competitive advantages in scalability, and the democratization of AI expertise through visual interfaces.

0

#interactive avatars#trulience

00:30:47

Real-Time Translation with Palabra's Artem Kukharenko and Ivan Kuzin

In this episode, Palabra's Artem Kukharenko (Co-Founder) and Ivan Kuzin (Head of Business Development) join to discuss the Palabra real-time speech-to-speech translation technology, the inspiration behind Palabra, common misconceptions about AI translation, the balance between latency and accuracy, and the challenges of voice cloning and intonation. The conversation also covers the applications of their technology, user feedback, differentiation in a competitive market, privacy and data security, benchmarking, developer experience, and future advancements in AI and speech translation.

0

#real-time translation#palabra

00:31:08

Introduction to Conversational AI with Agora's Ben Weekes

Agora's Ben Weekes joins to discuss the world of voice-first conversational AI. Hermes and Ben delve into the differences between voice and chat-based systems, explore the real-world applications of conversational AI, and break down the technology stack involved in creating effective voice agents. The conversation also touches on virtual avatars, infrastructure challenges, and the various conversational AI frameworks available for developers.

0

#conversational ai#voice ai