Abstract: In this study, we explore the use of Vector Quantized Variational Autoencoders (VQ-VAE) for real-time audio spectrogram inpainting, with a focus on minimizing environmental impact. We ...
Abstract: While DCGAN as deep learning model utilizing spectrogram, allows for detection of deepfake audio, it is prone to overfitting which affects its ability to discriminate between real and fake ...
This work implements Audio Spectrogram Transformer on ShipsEar Database (A private underwater vessel noise database) which serves as the benchmark for various models on underwater noise classification ...