News

This is why we have designed and developed a suite of configurable software pipelines with Python Luigi for speech-data preprocessing, feature extraction, fold construction for cross-validation, ...
This project accelerates MFCC extraction using CUDA for real-time speech recognition. Offloading the process to the GPU reduces latency and speeds up processing, enabling fast, local speech-to-text ...
A consistent part of gas sensor research activities aims to improve sensing performances by synthesizing new sensing materials, improving the selection of elements in arrays, and optimizing the ...
This represents a significant 5% improvement over the existing standalone system that uses uncoded MFCC features. These findings highlight that the Polar codes can be effectively utilized in speaker ...
While extensive research has been conducted in the field of biometrics, particularly in face and fingerprint recognition, remote speaker recognition has yet to gain global acceptance due to challenges ...
MATLAB code for audio signal processing, emphasizing Real Cepstrum and MFCC feature extraction. Reads a wave file, applies Hamming and Rectangular windows, then computes Real Cepstrum. Utilizes MATLAB ...
With THINGSvision, we provide a Python toolbox that enables researchers to extract features for most state-of-the-art neural network models for existing or custom image datasets with just a few lines ...
Discover the power of Wake-Up-Word Speech Recognition (WUW-SR) with advanced feature extraction using MFCC, LPC, and ENH_MFCC. Explore our FPGA design for real-time spectrogram extraction and learn ...