Abstract: This work introduces a comprehensive Multimodal Emotion Recognition (MMER) method that utilizes facial expression and speech data, using a custom 2-dimensional Convolutional Neural Network ...
Abstract: Due to rapid advancements in deep learning, Transformer-based architectures have proven effective in speech emotion recognition (SER), largely due to their ability to model long-term ...