Java Speech API Synthesis

UnitDiff: A Unit-Diffusion Model for Code-Switching Speech Synthesis

Abstract: Given the scarcity of Code-Switching (CS) datasets, most researchers synthesize CS speech using multiple monolingual datasets. However, this approach presents challenges in synthesizing CS ...

GitHub

An OpenAI API compatible text to speech server.

For a minimal docker image with only piper support (<1GB vs. 8GB), use docker compose -f docker-compose.min.yml up usage: speech.py [-h] [--xtts_device XTTS_DEVICE ...

IEEE

Severity-controllable pathological text-to-speech synthesis for clinical applications

Abstract: The paper presents a new pathological text-to-speech (TTS) synthesis system that has the ability to control speech severity using latent interpolations. Recognising the difficulty of this ...

PR Newswire

DeepL Launches Voice API for Real-Time Speech Transcription and Translation enabling instant multilingual communication

COLOGNE, Germany, Feb. 3, 2026 /PRNewswire/ -- DeepL, a global AI product and research company, today announced the general availability of DeepL Voice API. This innovative product empowers developers ...

GitHub

BigDaddyAman/youtube-transcript-api

A lightweight FastAPI service that extracts YouTube video captions (no speech-to-text). No video or audio downloads — just clean, structured captions returned as ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results