Spectrogram to Audio Python

Performance Analysis of CNN-Based Spectrogram with Multiple Audio Feature Types for English Digit Recognition

Abstract: Audio feature selection and neural network architecture play crucial roles in speech recognition performance. This paper presents a comparative analysis of Artificial Neural Networks (ANNs) ...

Edex Live

Listening to the Forest: An AI innovator’s mission to protect humans and wildlife

By Atharva Agrawal Growing up in the Tiger Capital of India, Nagpur, a city surrounded by some of the country’s most eminent wildlife sanctuaries, including Pen ...

GitHub

DCASE2025_TASK3_Stereo_PSELD_Mamba

This repo contains code for our DCASE 2025 task3 proposed method : Stereo sound event localization and detection based on PSELDnet pretraining and BiMamba sequence modeling [1]. For more information, ...

GitHub

MalcolmStran/whisper-subtitle-translator

A complete video subtitle translation pipeline with modern web interface that uses OpenAI Whisper for speech-to-text transcription and Google Translate for multi-language subtitle generation.

IEEE

Contrastive Audio Spectrogram Transformer for Robust Recognition of Environmental Acoustic Signals

Abstract: In recent years, environmental sound classification has become an essential component in intelligent urban monitoring systems, smart infrastructure, and public noise analysis. However, this ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results