Stars
A comprehensive ComfyUI toolkit for video generation, image editing, and audio-driven lip‑sync, featuring Flux, LTXV, Wan2.2 and advanced batch workflows.
AI一键批量生成各类短视频,自动批量混剪短视频,自动把视频发布到抖音,快手,小红书,视频号上,赚钱从来没有这么容易过! 支持本地语音模型chatTTS,fasterwhisper,GPTSoVITS,支持云语音:Azure,阿里云,腾讯云。支持Stable diffusion,comfyUI直接AI生图。Generate short videos with one click using A…
基于MoneyPrinterTurbo,AI生成分镜大纲与视频(动态,不是念ppt),接入万相通义wan2.1 ai文生视频、图生视频功能,灵活把控视频生成。Based on MoneyPrinterTurbo, AI generates image outline and video (dynamic, not ppt), and integrates wan2.1 text-to-vid…
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
Streaming ASR and TTS based on FastAPI+ sherpa-onnx
Digital Human Resource: 2D/3D/4D Human Modeling, Avatar Generation & Animation, Clothed People Digitalization, Virtual Try-On, etc.
Real time interactive streaming digital human
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
FP-Studio / framepack-studio
Forked from lllyasviel/FramePackExpanding FramePack into a multifunction video creation tool
Instant voice cloning by MIT and MyShell. Audio foundation model.
Fast local TTS inference engine in C# with ONNX runtime. Multi-speaker, multi-platform and multilingual. Integrate on your .NET projects using a plug-and-play NuGet package, complete with all voices.
MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.
Run GGUF models on your android app with ease!
Open source real-time translation app for Android that runs locally
关于本地离线翻译程序,支持文本翻译,下划线翻译,屏幕截图翻译,语音(音频文件)翻译,视频翻译,txt文件,PPT,Word,PDF,Excel,图片翻译。资源
shixiangcap / llama-jni
Forked from ggml-org/llama.cppAndroid JNI for port of Facebook's LLaMA model in C/C++
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…
Computer Graphics Practice The Tutorial(Visual C++ Edition)