Highlights
Stars
A toolkit to calculate speech audio quality. Not affiliated with the original authors
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Official repository for the paper "xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement"
The backend of HTTP Toolkit
Electron wrapper to build and distribute HTTP Toolkit for the desktop
A python package to analyze and compare voices with deep learning
An encyclopedia of jailbreaking techniques to make AI models safer.
aider is AI pair programming in your terminal
Fine tuning grounding Dino
Codebase for Aria - an Open Multimodal Native MoE
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
Open-source framework to review and patch code using your preferred LLM.
MongoDB + Haize = Safe & Secure RAG with RBAC
Continuation of an abandoned project fast-coco-eval
TF-ID: Table/Figure IDentifier for academic papers
Blazing GUI for easily viewing and interacting with computer vision models.
Parameterize Python scripts/notebooks all from the command line and run on cloud GPUs
Devon: An open-source pair programmer