VGGVox for PyTorch
Published:
Implementation of the VGGVox network using pytorch. The implementation is based on the descriptions given in the papers
- A. Nagrani, J. S. Chung, A. Zisserman, VoxCeleb: a large-scale speaker identification dataset, INTERSPEECH, 2017
- S. Albanie, A. Nagrani, A. Vedaldi: Emotion Recognition in Speech using Cross-Modal Transfer in the Wild
This repository contains the implementation of the VGGVox network itself, some utility functions for audio processing and an example DataLoader for audio files.