-
Concordia University and Mila Quebec AI Institute
- Montreal, Canada
Starred repositories
Simple Zero-Resource Syllable Tokenization for Spoken Language Modeling
anon-uscf / uscf
Forked from kamperh/linearvcUniversal Speech Content Factorization
Voice conversion with just linear regression.
[Interspeech 2025] Official implementation of "Training-Free Voice Conversion with Factorized Optimal Transport"
DUSTED: Spoken-Term Discovery using Discrete Speech Units
Unsupervised Rhythm Modeling for Voice Conversion
SoftVC VITS Singing Voice Conversion
Training code and trained checkpoints for ASGAN.
HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
Recipe for Towards localisation of keywords in speech using weak supervision
An 16kHz implementation of HiFi-GAN for soft-vc.
Soft speech units for voice conversion
Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization".
BERT and LSTM baseline models of the ZeroSpeech Challenge 2021
A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
SoundFile is an audio library based on libsndfile, CFFI, and NumPy
Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation
A Python module for interacting with Praat TextGrid files. Also includes a class for reading HTK .mlf files into Praat
VQ-VAE for Acoustic Unit Discovery and Voice Conversion
Evaluation functions for music/audio information retrieval/signal processing algorithms.
A Generative Flow for Text-to-Speech via Monotonic Alignment Search