Official implemention for the paper "Frame-level emotion state alignment method for speech emotion recognition", which is submitted to ICASSP 2024. I am very sorry, I have carefully checked the data ...
Overview of Our Pipeline. We take 2D tracks and depth maps generated by off-the-shelf models as input, which are then processed by a motion encoder to capture motion patterns, producing featured ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results