GuodongQi

🎯

Focusing

guodong qi GuodongQi

🎯

Focusing

ZheJiang University

8 followers · 2 following

ZheJiang University

Achievements

Stars

ResearAI / AutoFigure-Edit

Python 2,920 211 Updated Apr 23, 2026

FrontierLabs / F5R-TTS

Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"

Python 155 17 Updated Mar 3, 2026

MoonshotAI / Kimi-Audio

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 4,591 349 Updated Jun 21, 2025

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 24,134 2,776 Updated Mar 12, 2026

QwenLM / Qwen3-TTS

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 10,988 1,440 Updated Mar 17, 2026

OpenMOSS / MOSS-Audio-Tokenizer

MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, it supports streaming and variable bitrates, delivering SOTA …

Python 197 13 Updated Apr 13, 2026

adelacvg / ttts

Train the next generation of TTS systems.

Python 170 17 Updated Sep 13, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 23,552 2,645 Updated Apr 26, 2026

stepfun-ai / Step-Audio-EditX

A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech

Python 912 62 Updated Apr 9, 2026

microsoft / VibeVoice

Open-Source Frontier Voice AI

Python 42,165 4,832 Updated Apr 24, 2026

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 52,567 7,019 Updated Apr 14, 2026

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 56,979 6,214 Updated Apr 19, 2026

DetachHead / basedpyright

pyright fork with various type checking improvements, improved vscode support and pylance features built into the language server

TypeScript 3,286 113 Updated Apr 20, 2026

mipha777 / 12306

12306接口抢票

Python 41 14 Updated Sep 19, 2025

mermaid-js / mermaid-live-editor

Edit, preview and share mermaid charts/diagrams. New implementation of the live editor.

Svelte 6,459 1,083 Updated Apr 24, 2026

index-tts / index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 20,219 2,490 Updated Mar 16, 2026

ga642381 / speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

1,224 74 Updated Apr 4, 2026

AudioLLMs / Awesome-Audio-LLM

Audio Large Language Models

Python 912 47 Updated Jul 5, 2025

rioharper / VocalForge

Your one-stop solution for voice dataset creation

Python 130 24 Updated Dec 10, 2023

canopyai / Orpheus-TTS

Towards Human-Sounding Speech

Python 6,111 520 Updated Dec 5, 2025

fishaudio / fish-speech

SOTA Open Source TTS

Python 29,962 2,524 Updated Apr 6, 2026

xingchensong / S3Tokenizer

Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice

Python 511 68 Updated Dec 22, 2025

wenet-e2e / WeTextProcessing

Text Normalization & Inverse Text Normalization

Python 754 102 Updated Feb 27, 2026

HuiResearch / FlashTTS

基于SparkTTS、OrpheusTTS等模型，提供高质量中文语音合成与声音克隆服务。

Python 601 76 Updated May 18, 2025

ddlBoJack / emotion2vec

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Python 1,109 85 Updated Dec 23, 2024

yanghaha0908 / EmoVoice

Official code for "EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting"

Python 115 13 Updated Oct 16, 2025

audeering / w2v2-how-to

How to use our public wav2vec2 dimensional emotion model

Jupyter Notebook 545 51 Updated May 22, 2023

imxtx / awesome-controllable-speech-synthesis

This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Systematic Survey".

235 12 Updated Apr 21, 2026

Ksuriuri / index-tts-vllm

Added vLLM support to IndexTTS for faster inference.

Python 1,134 157 Updated Apr 13, 2026

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 20,768 2,387 Updated Mar 16, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

guodong qi GuodongQi

Achievements

Achievements

Block or report GuodongQi

Stars

ResearAI / AutoFigure-Edit

FrontierLabs / F5R-TTS

MoonshotAI / Kimi-Audio

liguodongiot / llm-action

QwenLM / Qwen3-TTS

OpenMOSS / MOSS-Audio-Tokenizer

adelacvg / ttts

Dao-AILab / flash-attention

stepfun-ai / Step-Audio-EditX

microsoft / VibeVoice

karpathy / nanochat

RVC-Boss / GPT-SoVITS

DetachHead / basedpyright

mipha777 / 12306

mermaid-js / mermaid-live-editor

index-tts / index-tts

ga642381 / speech-trident

AudioLLMs / Awesome-Audio-LLM

rioharper / VocalForge

canopyai / Orpheus-TTS

fishaudio / fish-speech

xingchensong / S3Tokenizer

wenet-e2e / WeTextProcessing

HuiResearch / FlashTTS

ddlBoJack / emotion2vec

yanghaha0908 / EmoVoice

audeering / w2v2-how-to

imxtx / awesome-controllable-speech-synthesis

Ksuriuri / index-tts-vllm

FunAudioLLM / CosyVoice