Lists (1)
Sort Name ascending (A-Z)
Starred repositories
30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than100 days, follow your own pace. These videos ma…
Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution
😎 A curated list of awesome MLOps tools
🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽𝘀 best practices: ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 12 𝘩𝘢𝘯𝘥𝘴-𝘰𝘯 𝘭𝘦𝘴𝘴𝘰𝘯𝘴
A Python module for decorators, wrappers and monkey patching.
Run VSCode (codeserver) on Google Colab or Kaggle Notebooks
Advanced Deep Learning with Keras, published by Packt
In PyTorch Learing Neural Networks Likes CNN、BiLSTM
Time series forecasting with machine learning models
This repo is the home of the official documentation for Visual Studio.
(Deprecated) Scikit-learn integration package for Apache Spark
LAMA - automatic model creation framework
All the slides, accompanying code and exercises all stored in this repo. 🎈
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
PyTorch Dual-Attention LSTM-Autoencoder For Multivariate Time Series
Code and source for paper ``How to Fine-Tune BERT for Text Classification?``
Full stack, modern web application generator. Using Flask, PostgreSQL DB, Docker, Swagger, automatic HTTPS and more.
Optimal binning: monotonic binning with constraints. Support batch & stream optimal binning. Scorecard modelling and counterfactual explanations.
Basic Discounted Cash Flow library written in Python. Automatically fetches relevant financial documents for chosen company and calculates DCF based on specified parameters.
t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark
Train and run Pytorch models on Apache Spark.
Object-Oriented Programming concepts, with Python
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All comp…