🎉 Discrete Neural Codec With 24 Tokens Per Second (24KHZ) for Spoken Language Modeling! Different color lines indicate the data flow used in inference and only for training. During inference, the ...