Abstract: In this work, we propose CleanMel, a single-channel Mel-spectrogram denoising and dereverberation network for improving both speech quality and automatic speech recognition (ASR) performance ...
This repo is the implementation of a research project aimed at enhancing Acoustic Side-Channel Attacks (ASCAs) using a novel combination of Vision Transformers (VTs) and Large Language Models (LLMs).
--output Output path (default: input name + extension) --format jpg or png (default: jpg) --width Output width (default: 1920) --height Output height (default: 1080 ...
Abstract: The application of machine learning in underwater acoustics is often limited by the lack of high-quality data. One method to avoid this data issue is to use modeled data to train a machine ...