-
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedMar 14, 2025 -
-
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
-
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedJun 12, 2023 -
scripts Public
Dump of all the scripts that are not part of any specific project.
Python Apache License 2.0 UpdatedMay 12, 2023 -
apex Public
Forked from NVIDIA/apexA PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Python BSD 3-Clause "New" or "Revised" License UpdatedMar 14, 2023 -
nerfacc Public
Forked from nerfstudio-project/nerfaccA General NeRF Acceleration Toolbox in PyTorch.
Python MIT License UpdatedNov 27, 2022 -
nerfstudio Public
Forked from nerfstudio-project/nerfstudioA collaboration friendly studio for NeRFs
Python Apache License 2.0 UpdatedOct 31, 2022 -
drjit Public
Forked from mitsuba-renderer/drjitDr.Jit — A Just-In-Time-Compiler for Differentiable Rendering
C++ BSD 3-Clause "New" or "Revised" License UpdatedOct 26, 2022 -
DeepSpeed Public
Forked from deepspeedai/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
Python MIT License UpdatedJul 27, 2022 -
metaseq Public
Forked from facebookresearch/metaseqRepo for external large-scale work
Python MIT License UpdatedJul 14, 2022 -
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harnessA framework for few-shot evaluation of autoregressive language models.
Python MIT License UpdatedJun 17, 2022 -
svox2 Public
Forked from sxyu/svox2Plenoxels: Radiance Fields without Neural Networks, Code release WIP
Python BSD 2-Clause "Simplified" License UpdatedJun 10, 2022 -
eval_t0_deepspeed Public
Forked from yongzx/eval_t0_deepspeedEvaluate T0 with DeepSpeed
Python UpdatedMar 31, 2022 -
seqio Public
Forked from google/seqioTask-based datasets, preprocessing, and evaluation for sequence models.
Python Apache License 2.0 UpdatedDec 2, 2021 -
text-to-text-transfer-transformer Public
Forked from google-research/text-to-text-transfer-transformerCode for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
Python Apache License 2.0 UpdatedNov 26, 2021 -
FlexFlow Public
Forked from flexflow/flexflow-trainA distributed deep learning framework that supports flexible parallelization strategies.
C++ Apache License 2.0 UpdatedOct 8, 2021 -
bigscience Public
Forked from bigscience-workshop/bigscienceCodebase for the engineering/scaling WG
Shell Other UpdatedSep 24, 2021 -
Megatron-DeepSpeed Public
Forked from bigscience-workshop/Megatron-DeepSpeedOngoing research training transformer language models at scale, including: BERT & GPT-2
Python Other UpdatedSep 16, 2021 -
datasets Public
Forked from huggingface/datasets🤗 The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedAug 2, 2021 -
lxmls-toolkit Public
Forked from LxMLS/lxmls-toolkitMachine Learning applied to Natural Language Processing Toolkit used in the Lisbon Machine Learning Summer School
Python MIT License UpdatedJul 14, 2021 -
promptsource Public
Forked from bigscience-workshop/promptsourceToolkit for collecting and applying templates of prompting instances
Python Apache License 2.0 UpdatedJun 14, 2021 -