-
AWS
- London, UK
- @uros_lipovsek
Stars
A modern replacement for Redis and Memcached
Productive, portable, and performant GPU programming in Python.
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.
A retargetable MLIR-based machine learning compiler and runtime toolkit.
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
Collective communications library with various primitives for multi-machine training.
A library for distributed ML training with PyTorch
A tensor-aware point-to-point communication primitive for machine learning
Unreal Engine plugin for easy creation of synthetic image datasets
This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.
Large scale graph learning on a single machine.
Tensorflow Fork to go along with the paper "Plumber: Diagnosing and Removing Performance Bottlenecks in Machine Learning Data Pipelines". See https://github.com/mkuchnik/PlumberApp for directions.