Skip to content
View GuodongQi's full-sized avatar
🎯
Focusing
🎯
Focusing
  • ZheJiang University

Block or report GuodongQi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"

Python 155 17 Updated Mar 3, 2026

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 4,591 349 Updated Jun 21, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 24,134 2,776 Updated Mar 12, 2026

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 10,988 1,440 Updated Mar 17, 2026

MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, it supports streaming and variable bitrates, delivering SOTA …

Python 197 13 Updated Apr 13, 2026

Train the next generation of TTS systems.

Python 170 17 Updated Sep 13, 2024

Fast and memory-efficient exact attention

Python 23,552 2,645 Updated Apr 26, 2026

A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech

Python 912 62 Updated Apr 9, 2026

Open-Source Frontier Voice AI

Python 42,165 4,832 Updated Apr 24, 2026

The best ChatGPT that $100 can buy.

Python 52,567 7,019 Updated Apr 14, 2026

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 56,979 6,214 Updated Apr 19, 2026

pyright fork with various type checking improvements, improved vscode support and pylance features built into the language server

TypeScript 3,286 113 Updated Apr 20, 2026

12306接口抢票

Python 41 14 Updated Sep 19, 2025

Edit, preview and share mermaid charts/diagrams. New implementation of the live editor.

Svelte 6,459 1,083 Updated Apr 24, 2026

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 20,219 2,490 Updated Mar 16, 2026

Awesome speech/audio LLMs, representation learning, and codec models

1,224 74 Updated Apr 4, 2026

Audio Large Language Models

Python 912 47 Updated Jul 5, 2025

Your one-stop solution for voice dataset creation

Python 130 24 Updated Dec 10, 2023

Towards Human-Sounding Speech

Python 6,111 520 Updated Dec 5, 2025

SOTA Open Source TTS

Python 29,962 2,524 Updated Apr 6, 2026

Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice

Python 511 68 Updated Dec 22, 2025

Text Normalization & Inverse Text Normalization

Python 754 102 Updated Feb 27, 2026

基于SparkTTS、OrpheusTTS等模型,提供高质量中文语音合成与声音克隆服务。

Python 601 76 Updated May 18, 2025

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Python 1,109 85 Updated Dec 23, 2024

Official code for "EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting"

Python 115 13 Updated Oct 16, 2025

How to use our public wav2vec2 dimensional emotion model

Jupyter Notebook 545 51 Updated May 22, 2023

This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Systematic Survey".

235 12 Updated Apr 21, 2026

Added vLLM support to IndexTTS for faster inference.

Python 1,134 157 Updated Apr 13, 2026

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 20,768 2,387 Updated Mar 16, 2026
Next