-
Massachusetts Institute of Technology
- Cambridge, MA
-
22:17
(UTC -04:00) - people.csail.mit.edu/hengjui
- @hjchang87
Highlights
- Pro
-
MARBLE Public
Forked from a43992899/MARBLEState-of-the-art pretrained music models for training, evaluation, inference
Python UpdatedJul 2, 2025 -
-
dscore Public
Forked from nryant/dscoreDiarization scoring tools.
Python BSD 2-Clause "Simplified" License UpdatedMay 23, 2025 -
ssast Public
Forked from YuanGongND/ssastCode for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".
Python BSD 3-Clause "New" or "Revised" License UpdatedApr 10, 2025 -
rspin Public
Official inference code for NAACL 2024 paper "R-Spin: Efficient Speaker and Noise-invariant Representation Learning with Acoustic Pieces"
-
x-transformers Public
Forked from lucidrains/x-transformersA concise but complete full-attention transformer with a set of promising experimental features from various papers
Python MIT License UpdatedFeb 27, 2025 -
torch-audiomentations Public
Forked from iver56/torch-audiomentationsFast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
Python MIT License UpdatedFeb 25, 2025 -
s3prl Public
Forked from s3prl/s3prlSelf-Supervised Speech Pre-training and Representation Learning Toolkit.
Python Apache License 2.0 UpdatedNov 5, 2024 -
SpeechTokenizer Public
Forked from ZhangXInFD/SpeechTokenizerThis is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
Python Apache License 2.0 UpdatedJun 3, 2024 -
benchmarks Public
Forked from zerospeech/benchmarksA command line tool that helps use the "Zero Ressource Challenge" benchmarks
Python GNU General Public License v3.0 UpdatedAug 21, 2023 -
spin Public
Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering"
-
espnet Public
Forked from espnet/espnetEnd-to-End Speech Processing Toolkit
Python Apache License 2.0 UpdatedMar 4, 2023 -
zr-2021vg_baseline Public
Forked from bhigy/zr-2021vg_baselineBaselines for the Zero-Resources Speech Challenge using VisuallyGrounded Models of Spoken Language, 2021 edition
Python Apache License 2.0 UpdatedJan 26, 2023 -
MiniASR Public
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
-
phone-seg-ssl Public
Forked from lstrgar/ss-phoneme-segPhoneme segmentation using pre-trained speech models
Python GNU General Public License v3.0 UpdatedNov 4, 2022 -
-
-
SBCSAE-preprocess Public
Preprocessing and downloading scripts for the Santa Barbara Corpus of Spoken American English (SBCSAE).
-
awesome-self-supervised-learning Public
Forked from jason718/awesome-self-supervised-learningA curated list of awesome self-supervised methods
1 UpdatedOct 25, 2021 -
receptive-field-calculator Public
A simple receptive field calculator for convolutional neural networks (CNN).
-
vectominist.github.io.old Public
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
JavaScript MIT License UpdatedAug 13, 2021 -
A comprehensive list of awesome self-supervised speech representation learning papers.
machine-learning awesome deep-learning awesome-list representation-learning speech-processing fairseq12 UpdatedAug 12, 2021 -
eval-word-vectors Public
Forked from mfaruqui/eval-word-vectorsEasy to use scripts for evaluating word vectors on a variety of tasks.
Python MIT License UpdatedJul 6, 2021 -
MedNLP Public
Mandarin Medical Dialogue Analysis with Pytorch.
-
-
spectra-review-paper-competition Public
Forked from Mathpix/spectra-review-paper-competitionCompetition for best expository article on cutting-edge ML research
UpdatedMar 2, 2021 -
Course-Map-Visualization Public
A simple website for visualizing course maps 🎓🗺.
-
End-to-end-ASR-Pytorch-DLHLP Public
Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation (Deep Learning for Human Language Processing Special Project)
-
End-to-end-ASR-Pytorch Public
Forked from Alexander-H-Liu/End-to-end-ASR-PytorchThis is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.
-
ICG2020Spring-HW1 Public
🎨 HW1 (shading and transformation) of the course Interactive Computer Graphics, NTU CSIE.
JavaScript UpdatedAug 30, 2020