Lists (2)
Sort Name ascending (A-Z)
Stars
Emdash is an orchestration layer for running multiple coding agents in parallel in Git worktrees
Native Swift and CoreML SDK for local speaker diarization, VAD, and speech-to-text for real-time workloads. Works on iOS and macOS.
generate music visualizations in the browser with threejs
Multi-agent AI coding platform powered by Vercel Sandbox and AI Gateway
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…
Automatic Speech Recognition in Python using ONNX models
A markdown editor, which is lighter, smarter and purer. 一个 Markdown 编辑器,但是更轻快、更智能、更纯粹。
VoiceTypr - AI powered voice to text dictation tool for busy founders, vibe coders, AI power users on macos, windows. Alternative to wispr flow and superwhisper.
Official implementation of SIGGRAPH 2025 paper "Image-GS: Content-Adaptive Image Representation via 2D Gaussians"
The Open Source Alternative to Cluely - A lightning-fast, privacy-first AI assistant that works seamlessly during meetings, interviews, and conversations without anyone knowing. Built with Tauri fo…
An OS for your agents, built for your pocket.
Bash script using ADB to enhance smoothness and performance on android devices
Examples of using Soniox client libraries in different programming languages
🔥 A tool to analyze your website's AI-readiness, powered by Firecrawl
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
The 500 AI Agents Projects is a curated collection of AI agent use cases across various industries. It showcases practical applications and provides links to open-source projects for implementation…
This repo is a mirror of the contents of base-action in https://github.com/anthropics/claude-code-action.
Claudable is an open-source web builder that leverages local CLI agents, such as Claude Code, Codex, Gemini CLI, Qwen Code, and Cursor Agent, to build and deploy products effortlessly.
Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. Features low-latency audio streaming, dynamic visual feedback…
Leantime is a goals focused project management system for non-project managers. Building with ADHD, Autism, and dyslexia in mind.
The only PR bot that actually tests your code.
A powerful GUI app and Toolkit for Claude Code - Create custom agents, manage interactive Claude Code sessions, run secure background agents, and more.
A command-line interface tool for serving LLM using vLLM.
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
Create/Sell courses and digital downloads and publish blogs on your own branded website. An open source alternative to Teachable, Thinkific, Podia and the likes.
Create subtitles in various languages in mere minutes using Whisper and Qwen3-32b via Groq's lightning-fast inference.
Cursor for design - Open Source
Modern YouTube downloader with a clean PySide6 interface. Download videos in any quality, extract audio, fetch subtitles, sponsorBlock, and view video metadata. Built with yt-dlp for reliable perfo…