Highlights
Stars
A toolkit to calculate speech audio quality. Not affiliated with the original authors
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Official repository for the paper "xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement" (Accepted to INTERSPEECH 2025)
The backend of HTTP Toolkit
Electron wrapper to build and distribute HTTP Toolkit for the desktop
A python package to analyze and compare voices with deep learning
An encyclopedia of jailbreaking techniques to make AI models safer.
aider is AI pair programming in your terminal
Fine tuning grounding Dino
Codebase for Aria - an Open Multimodal Native MoE
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
Agentic AI framework for enterprise workflow automation.
MongoDB + Haize = Safe & Secure RAG with RBAC
Continuation of an abandoned project fast-coco-eval
TF-ID: Table/Figure IDentifier for academic papers
Blazing GUI for easily viewing and interacting with computer vision models.
Devon: An open-source pair programmer
Open Source Auth Built on Freestyle: own your auth + data https://docs.freestyle.dev/guides/authentication/