Search

ALL
Blog
Research Areas
Publications
News
Others

Blog(31)

[INTERSPEECH 2025 Series #9] Fairness in Dysarthric Speech Synthesis: Understanding Intrinsic Bias in Dysarthric Speech Cloning using F5-TTS
Voice cloning, especially zero-shot speech synthesis, has become one of the most exciting frontiers in speech technology.
Multi-task Learning for Speech Emotion Recognition in Naturalistic Conditions
Speech Emotion Recognition (SER) is a crucial task in human-computer interaction, enabling applications such as mental health monitoring, affective computing, and customer service automation.
A Model for Every User and Budget: Label-Free and Personalized Mixed-Precision Quantization
Research in Automatic Speech Recognition (ASR) continues to show that larger models yield better results. But while state-of-the-art networks continue to grow with billions of parameters, the difficulty of deploying these models on device also increases.

View More

Research Areas(0)

Publications(42)

View More

News(10)

View More

Others(0)