Skip to content
View longcw's full-sized avatar

Block or report longcw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SoulX-FlashTalk is the first 14B model to achieve sub-second start-up latency (0.87s) while maintaining a real-time throughput of 32 FPS on an 8xH800 node.

Python 1,228 118 Updated Apr 2, 2026

🎦 Micam 是一个专为小米摄像头设计的 RTSP 桥接服务(非官方),能够将小米摄像头的视频流本地转推到RTSP服务器,支持接入 HomeAssistant、Go2rtc、Frigate、Scrypted、Homekit 等多种NVR和智能家居系统。该项目采用 Docker Compose 快速部署方案,基于小米官方的Miloco,并集成Go2rtc实现RTSP流服务,无需GPU即可运行…

Python 727 41 Updated Feb 5, 2026

A Fully Self-Hosted Solution for Full-Duplex Voice Interaction

Python 524 42 Updated Sep 28, 2025

​​Unlimited-length talking video generation​​ that supports image-to-video and video-to-video generation

Python 6,367 1,131 Updated Dec 18, 2025
TypeScript 65 20 Updated Apr 28, 2026

bitHuman SDK examples

HTML 8 1 Updated Feb 4, 2026

🔊 让小爱音箱「听见你的声音」,解锁无限可能。

Rust 2,430 307 Updated Apr 4, 2026

[ICCV 2025] Official Pytorch Implementation of FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait.

Python 472 64 Updated Nov 10, 2025

OpenAI Agents adapter for Livekit

Python 7 2 Updated Jun 25, 2025

A tool for Container Debloating that removes bloat and improves performance.

Go 638 16 Updated Aug 12, 2025

LiveKit Agent integrated with MCP server of Home Assistant

Python 21 8 Updated May 25, 2025

Turns any OpenAI voice agent into a lively visual agent with bitHuman SDK

Python 3 Updated Apr 21, 2025

Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching

Python 4,774 791 Updated Jan 4, 2026

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python 5,660 807 Updated Sep 26, 2025

A debugging and profiling tool that can trace and visualize python code execution

Python 7,621 468 Updated Feb 16, 2026

coredumpy saves your crash site for post-mortem debugging

Python 757 20 Updated Jan 5, 2026

A lightweight, powerful framework for multi-agent workflows

Python 25,479 3,885 Updated Apr 28, 2026

Voice activity detector (VAD) for the browser with a simple API

TypeScript 1,947 260 Updated Jan 30, 2026

Agno turns agents into production software. Build agents in any framework. Run as a service. Ship to real users.

Python 39,730 5,305 Updated Apr 28, 2026

[CVPR2025] We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference ima…

Python 1,414 99 Updated Sep 21, 2025

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Python 79,183 8,965 Updated Apr 28, 2026

Run frontier AI locally.

Python 44,168 3,099 Updated Apr 28, 2026

A framework for building realtime voice AI agents 🤖🎙️📹

Python 10,253 3,073 Updated Apr 28, 2026

Human: AI-powered 3D Face Detection & Rotation Tracking, Face Description & Recognition, Body Pose Tracking, 3D Hand & Finger Tracking, Iris Analysis, Age & Gender & Emotion Prediction, Gaze Tracki…

HTML 3,108 425 Updated Dec 13, 2025

Playground Web UI using segment-anything-2 models from the Meta.

Python 57 6 Updated Dec 4, 2024

Very low latency speech to text, intent recognition, and text to speech, for building voice agents and interfaces

C 7,883 405 Updated Apr 25, 2026

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 14,400 2,117 Updated Apr 20, 2026
Next