The Startup Using Voice to Detect Depression, Fatigue and Diabetes
Loading video...
Show Notes
In this episode of the Convo AI World Podcast, Hermes Frangoudis interviews Emilia Molimpakis, CEO and Co-founder of thymia, who discusses how biomarker-enriched voice intelligence is becoming critical safety infrastructure for communications platforms. Emilia explains how thymia goes beyond words to understand human state in real time—analyzing not just what people say, but how they say it. By combining objective vocal biomarkers with platform policy, thymia enables voice systems to recognize risk and respond appropriately as conversations unfold.
Key Topics Covered
- •The personal story that inspired Thymia: Emilia's best friend's depression and suicide attempt
- •The three layers of speech analysis: acoustics, content, and timing
- •How voice connects to mental health and the entire nervous system
- •Building a health biomarker dataset: why scraping the internet doesn't work
- •The importance of human-curated data and multimodal datasets (video, EEG, blood biomarkers)
- •Using gamification to engage people with depression in data collection
- •Why voice is a powerful biomarker: ubiquity, ease of access, and signal strength
- •The challenge of isolating health signals from demographic signals
- •Multilingual support across Greek, English, Spanish, Italian, Brazilian Portuguese, Indonesian, and Japanese
- •Building individual longitudinal models where each person becomes their own baseline
- •Evolution from 10-minute assessments to 15 seconds with 0.2-second processing
- •Real-world applications in healthcare (NHS, pharma) and safety-critical environments
- •Health and safety infrastructure layer for voice-first communication
- •The trust domino effect: healthcare trust enabling safety-critical trust
- •Scientific validation through 290+ peer-reviewed publications
- •The surprising discovery that diabetes can be detected through voice
- •How chronic hyperglycemia affects vocal cords and lung capacity
- •Validation pipeline: hypothesis, data gathering, multimodal controls, peer review
- •Real-world deployment challenges: unpredictable human behavior and environmental variability
- •Regulatory challenges: moving from version-based to input/output-based regulation
- •Investor misconceptions about building health biomarker models
- •Overhyped trends: automation vs. deeper AI understanding
- •Human-in-the-loop: augmenting clinicians, not replacing them
- •Future roadmap: regulatory approval, diabetes clinical trials, edge deployment
Episode Chapters & Transcript
Highlights
Preview clips featuring key insights about voice biomarkers, 15-second assessments, driver fatigue detection, and diabetes detection through voice.
Introduction and Thymia's Origin Story
Hermes welcomes Emilia Molimpakis from Thymia. Emilia shares the deeply personal story that inspired Thymia - her best friend's depression and suicide attempt, which led her to quit her postdoc and start a company using voice as a biomarker for health.
The Science Behind Voice Biomarkers
Emilia explains the three layers of speech analysis: acoustics (sound wave properties), content (words and meaning), and timing (pauses, breathing). How these connect to mental health and the entire nervous system, including depression, anxiety, respiratory health, cardiovascular health, and metabolic health.
Building the Dataset
The challenges of building a health biomarker dataset - why scraping the internet doesn't work. The importance of human-curated data, longitudinal collection, and multimodal data (video, behavioral, EEG, blood biomarkers). How gamification helped engage people with depression.
Why Voice is a Powerful Biomarker
Voice's ubiquity and ease of access make it ideal for health monitoring. The challenge of isolating health signals from demographic signals (age, accent, ethnicity). How Thymia differentiates condition-specific signatures from personal characteristics.
Multilingual Support and Personalization
Thymia's multilingual capabilities across Greek, English, Spanish, Italian, Brazilian Portuguese, Indonesian, and Japanese. The breakthrough of building individual longitudinal models where each person becomes their own baseline over time.
15-Second Real-Time Assessment
The evolution from 10-minute assessments to just 15 seconds with real-time processing (0.2 seconds). Hermes shares his experience trying the demo at a train station and being amazed by the accuracy.
Real-World Applications
Thymia's applications in healthcare (NHS, pharma companies) and safety-critical environments. The expansion from mental health to detecting fatigue and stress in automotive, aerospace, construction, mining, and defense industries.
Health and Safety Infrastructure Layer
How Thymia is creating a health and safety infrastructure layer for voice-first communication. Enabling safe voice agent deployment at scale by detecting distress, stress, depression, panic, and pain in real-time voice interactions.
Scientific Validation and Trust
How Thymia gained trust through rigorous science - 290+ peer-reviewed publications from the team, certifications, and medical device regulation. The trust domino effect from healthcare to safety-critical industries.
Discovering Diabetes Detection
The surprising discovery that diabetes can be detected through voice. How chronic hyperglycemia affects vocal cords, lung capacity, and nerve coordination, creating detectable signatures. The rigorous validation process to ensure it's actually diabetes, not correlated factors.
Validation Pipeline and Peer Review
Thymia's validation process: hypothesis, data gathering, multimodal controls, synthetic data generation, model building, productionization, and peer-reviewed publication. Papers currently under review at Nature and other major journals.
Real-World Deployment Challenges
The huge difference between academic research and real-world deployment. Unpredictable human behavior (like users wearing towels over their heads), environmental variability, privacy concerns, and the need to account for background noise, multiple voices, and different devices.
Unexpected Discoveries
Novel effects discovered in voice: how people compensate for loud environments, the need for different models in cars vs. other environments, and handling multiple voices with proper consent and voice locking.
Early Assumptions vs. Reality
How assumptions changed: from needing multimodal data in deployment to voice-only, from 10-minute samples to 15 seconds, from minutes of processing to 0.2 seconds. This speed enabled completely new applications like real-time health and safety infrastructure.
Future Roadmap
2025 as the year of voice. Upcoming milestones: deployment at scale with telecoms and LLM providers, regulatory approval as Class II medical device, diabetes clinical trials comparing against blood biomarkers, respiratory and cardiovascular health models, and edge deployment.
Regulatory Challenges
The biggest challenge: regulators initially forcing companies to regulate specific model versions, which stifles AI iteration. How Thymia helped change this to regulate inputs/outputs instead, allowing model updates while maintaining regulatory compliance. Speaking at EU Parliament to advocate for change.
Investor Misconceptions
Common investor misunderstandings: underestimating the difficulty of building health biomarker models, thinking big companies can do it overnight, and asking 'what models do you use?' when Thymia built everything from scratch. The value of human-curated datasets vs. scraping the internet.
Overhyped Trends: Automation vs. Understanding
The overemphasis on automating existing processes vs. deeper AI understanding. Example: automating broken mental health questionnaires doesn't solve problems - it creates bottlenecks. Thymia's approach: using biomarkers for intelligent triage and differentiation.
Human-in-the-Loop and Augmentation
The importance of clinician-in-the-loop. Thymia can detect symptoms like appetite problems but can't determine root causes - doctors need to ask the right questions. AI augments human capabilities but doesn't replace human empathy and judgment.
Future of Consciousness and Closing
If not working with voice, Emilia would explore consciousness itself - understanding what human consciousness is and whether it can be replicated in AI. Closing remarks and thanks.
Click on any chapter to view its transcript content • Download full transcript
Convo AI Newsletter
Subscribe to stay up to date on what's happening in conversational and voice AI.