Skip to content
View cweill's full-sized avatar
🤗
🤗

Block or report cweill

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

What would you do with 1000 H100s...

Jupyter Notebook 1,098 68 Updated Jan 10, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 44,338 7,529 Updated Dec 9, 2024

Automatically generate Go test boilerplate from your source code.

Go 5,106 344 Updated Sep 12, 2023

The smartest way to learn touch typing and improve your typing speed.

TypeScript 3,470 331 Updated Sep 8, 2025

🔥Highlighting the top ML papers every week.

11,901 725 Updated Jul 20, 2025

Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

Jupyter Notebook 2,260 157 Updated Sep 14, 2025

Collect data intelligently from your robots.

C++ 25 1 Updated Mar 30, 2023

A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton

Python 860 36 Updated Jul 3, 2023

A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.

Python 1,467 371 Updated Aug 1, 2024

🦉 Data Versioning and ML Experiments

Python 14,878 1,249 Updated Sep 12, 2025

XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 typologically diverse languages and includes nine tasks.

Python 647 110 Updated Jan 4, 2023

PMLB: A large, curated repository of benchmark datasets for evaluating supervised machine learning algorithms.

Python 840 139 Updated Feb 25, 2025

NitroML is a modular, portable, and scalable model-quality benchmarking framework for Machine Learning and Automated Machine Learning (AutoML) pipelines.

Jupyter Notebook 43 6 Updated Mar 10, 2021

A distributed task scheduler for Dask

Python 1,648 736 Updated Sep 14, 2025

Parallel computing with task scheduling

Python 13,482 1,797 Updated Sep 10, 2025

Windows Subsystem for Linux

C++ 29,824 1,474 Updated Sep 13, 2025

Apache Beam is a unified programming model for Batch and Streaming data processing.

Java 8,297 4,402 Updated Sep 14, 2025

TFX is an end-to-end platform for deploying production ML pipelines

Python 2,162 722 Updated Jun 18, 2025

PyTorch implementation of TabNet paper : https://arxiv.org/pdf/1908.07442.pdf

Python 2,844 510 Updated Oct 23, 2024

Fast and Accurate ML in 3 Lines of Code

Python 9,381 1,054 Updated Sep 8, 2025

A Hyperparameter Tuning Library for Keras

Python 2,901 402 Updated Sep 2, 2025

An Emacs framework for the stubborn martian hacker

Emacs Lisp 20,942 3,124 Updated Sep 13, 2025

This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"

Python 190 23 Updated Mar 18, 2019

[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware

C++ 1,449 287 Updated Aug 30, 2024

Compare GAN code.

Python 1,818 314 Updated Jan 31, 2021

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 38,919 6,796 Updated Sep 14, 2025

Fast and flexible AutoML with learning guarantees.

Jupyter Notebook 3,458 531 Updated Nov 30, 2023

A scikit-learn compatible neural network library that wraps PyTorch

Jupyter Notebook 6,112 404 Updated Aug 15, 2025

Deep universal probabilistic programming with Python and PyTorch

Python 8,857 998 Updated Jul 9, 2025

DeepArchitect: Automatically Designing and Training Deep Architectures

Python 147 20 Updated Oct 1, 2019
Next