Stars
The headless browser for AI agents and web scraping
A fancy self-hosted monitoring tool
A community-supported supercharged document management system: scan, index and archive all your documents
VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
Example applications using the @scribeberry/sdk — medical transcription, AI note generation, and realtime speech-to-text.
🎥 Make videos programmatically with React
Curated tools, templates, and automation workflows for IPN founders. Includes n8n bots, real-time orchestration flows, and startup-ready AI integrations. Built for rapid prototyping, smart routing,…
Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity detection, and speaker diarization. In Swift, powered by SOTA open source.
Unlock your displays on your Mac! Flexible HiDPI scaling, XDR/HDR extra brightness, virtual screens, DDC control, extra dimming, PIP/streaming, EDID override and lots more!
Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.
A free, open source, and extensible speech-to-text application that works completely offline.
Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.
Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin9…
Animate shapes with hand gestures. Web tool built with threejs and mediapipe hand-tracking
Create and control 3D shapes using hand gestures in real-time. Built with mediapipe computer vision and threejs
Open-source AI coworker, with memory
iOS Voice Activity Detection (VAD). Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
Build effective agents using Model Context Protocol and simple workflow patterns
🚀 The fast, Pythonic way to build MCP servers and clients.
Track stocks, crypto, and derivatives prices and positions in real time from your terminal
🔥 The API to search, scrape, and interact with the web for AI
A curated and opinionated list of resources for Chief Technology Officers, with the emphasis on startups
Papermark is the open-source DocSend alternative with built-in analytics and custom domains.
On-device Speech AI for Apple Silicon
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.