Lists (2)
Sort Name ascending (A-Z)
Stars
Open Source framework for voice and multimodal conversational AI
OCR model that handles complex tables, forms, handwriting with full layout.
A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 97+ formats. Available for Rust, Pyt…
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Open-source web dashboard for Vexa – manage meeting transcriptions, view real-time transcripts, and chat with your meetings using AI.
Open-source meeting transcription API for Google Meet, Microsoft Teams & Zoom. Auto-join bots, real-time WebSocket transcripts, MCP server for AI agents. Self-host or use hosted SaaS.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
Give your AI agent n8n superpowers. 537 nodes with full schemas, 7,700+ templates, Git-like sync, and TypeScript workflows.
Free MaxMind GeoLite2-Country database for IP geolocation. Ultra-lightweight (~2MB), auto-updated via jsDelivr CDN.
ElevenLabs NextJS playground
Mass Mail designed for sending bulk emails efficiently, enabling effective communication for email marketing campaigns.
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
VibeBox is a powerful three-agent development environment that combines Claude Code, Cursor, and Task Master AI to work together on complex software development tasks.
VibeStage is an audience engagement tool designed for presentations and workshops.
🔥 The API to search, scrape, and interact with the web for AI
A simple screen parsing tool towards pure vision based GUI agent