Abstract: Given the scarcity of Code-Switching (CS) datasets, most researchers synthesize CS speech using multiple monolingual datasets. However, this approach presents challenges in synthesizing CS ...
Abstract: Underwater acoustic (UA) communication system has low data rate due to the limited bandwidth of the UA channel. This makes real-time speech communication challenging. In this paper, we ...
KittenTTS brings small text to speech models to edge devices; the Nano 8-bit model is about 25 MB, local playback is possible.