Skip to content
View Robinatp's full-sized avatar

Block or report Robinatp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
140 stars written in Python
Clear filter

High-Resolution Image Synthesis with Latent Diffusion Models

Python 41,808 5,332 Updated Jun 25, 2025

FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

Python 26,374 5,442 Updated Nov 20, 2023

Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

Python 25,355 11,712 Updated Jun 7, 2024

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 24,820 1,733 Updated Sep 28, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 15,780 3,115 Updated Sep 30, 2025

Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).

Python 15,077 3,582 Updated Sep 23, 2025

Face recognition using Tensorflow

Python 14,195 4,811 Updated Jul 24, 2023

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 12,428 1,318 Updated Sep 29, 2025

A paper list of object detection using deep learning.

Python 11,416 2,774 Updated Feb 12, 2024

Spark-TTS Inference Code

Python 10,544 1,120 Updated Apr 9, 2025

A PyTorch-based Speech Toolkit

Python 10,504 1,562 Updated Sep 25, 2025

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 9,319 1,294 Updated Apr 24, 2024

Python library for audio and music analysis

Python 7,905 1,008 Updated Sep 16, 2025

Text-audio foundation model from Boson AI

Python 7,383 532 Updated Sep 15, 2025

Towards Human-Sounding Speech

Python 5,600 468 Updated May 6, 2025

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,817 1,156 Updated Sep 25, 2025

《21个项目玩转深度学习———基于TensorFlow的实践详解》配套代码

Python 4,624 1,764 Updated Mar 18, 2019

A TensorFlow Implementation of the Transformer: Attention Is All You Need

Python 4,415 1,308 Updated May 21, 2023

Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow

Python 3,991 793 Updated Oct 8, 2021

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Python 3,982 811 Updated Jul 5, 2024

Deep neural networks for voice conversion (voice style transfer) in Tensorflow

Python 3,934 842 Updated Sep 30, 2022

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 3,683 277 Updated Sep 26, 2025

Mask RCNN in TensorFlow

Python 3,097 1,092 Updated Jan 5, 2021

ACE-Step: A Step Towards Music Generation Foundation Model

Python 3,056 337 Updated Jun 27, 2025

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

Python 2,984 313 Updated Sep 13, 2025

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,743 248 Updated Jun 25, 2025

Semantic Segmentation Suite in TensorFlow. Implement, train, and test new Semantic Segmentation models easily!

Python 2,523 876 Updated Apr 22, 2021

Unofficial implemention of lanenet model for real time lane detection

Python 2,499 904 Updated Dec 8, 2023

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,459 512 Updated Jun 13, 2025

Bag of Tricks and A Strong Baseline for Deep Person Re-identification

Python 2,316 583 Updated Apr 23, 2020
Next