Abstract: In this work, we propose CleanMel, a single-channel Mel-spectrogram denoising and dereverberation network for improving both speech quality and automatic speech recognition (ASR) performance ...
Abstract: The classification of human activity using radar has gained considerable attention in recent years because of the radar sensor’s resistance to harsh settings. However, when using machine ...
All the datasets must be located in the datasets folder. This folder should contain the following subfolders after downloading the datasets: GTZAN Speech_Music: Contains the GTZAN Speech Music dataset ...
Autotunable parameters with direct physical interpretation. Easy visualization of all intermediate workflow steps. Collected cluster statistics allow for fine-grained QC and classification of signals.