Arduino Speech Synthesis Module

Learning Contrastive Emotional Nuances in Speech Synthesis

Abstract: Prosody is a crucial speech feature in emotional text - to-speech (TTS), as different emotions have distinct prosodic characteristics. Existing works in emotional TTS have primarily utilized ...

GitHub

mezonai/mezon-noise-suppression

AI-powered noise suppression for real-time audio processing with LiveKit. Based on the DeepFilterNet paper and implementation by Rikorose.

IEEE

Augmenting Short Enrollment Speech via Synthesis for Target Speaker Extraction

Abstract: A high-quality enrollment speech is crucial to target speaker extraction (TSE), since it provides essential cues for identifying the target speaker in the mixture. However, real applications ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Learning Contrastive Emotional Nuances in Speech Synthesis

mezonai/mezon-noise-suppression

Augmenting Short Enrollment Speech via Synthesis for Target Speaker Extraction

Trending now