-
Indian Association for the Cultivation of Science
- Kolkata
-
23:50
(UTC +05:30)
Highlights
- Pro
-
-
-
modded-nanogpt Public
Forked from KellerJordan/modded-nanogptNanoGPT (124M) in 3 minutes
Python MIT License UpdatedFeb 11, 2025 -
coconut Public
Forked from facebookresearch/coconutTraining Large Language Model to Reason in a Continuous Latent Space
Python MIT License UpdatedJan 24, 2025 -
-
cleanrl Public
Forked from vwxyzjn/cleanrlHigh-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Python Other UpdatedJan 9, 2025 -
google-summer-of-code Public
Forked from rust-lang/google-summer-of-codeRust project ideas for Google Summer of Code
1 UpdatedJan 3, 2025 -
agents Public
Forked from huggingface/smolagents🤗 Agents: the simplest building blocks to build and run LLM agents.
Python Apache License 2.0 UpdatedDec 25, 2024 -
ml-gsm-symbolic Public
Forked from apple/ml-gsm-symbolicGSM-Symbolic templates and generated data
Other UpdatedDec 8, 2024 -
smol-course Public
Forked from huggingface/smol-courseA course on aligning smol models.
Jupyter Notebook Apache License 2.0 UpdatedDec 3, 2024 -
entropix Public
Forked from xjdr-alt/entropixEntropy Based Sampling and Parallel CoT Decoding
Python Apache License 2.0 UpdatedNov 13, 2024 -
smoltropix Public
Forked from smolorg/smoltropixMLX port for xjdr's entropix sampler (mimics jax implementation)
Python Apache License 2.0 UpdatedNov 4, 2024 -
Differential-Transformer-PyTorch Public
Forked from nanowell/Differential-Transformer-PyTorchPyTorch implementation of the Differential-Transformer architecture for sequence modeling, specifically tailored as a decoder-only model similar to large language models (LLMs). The architecture in…
Python MIT License UpdatedOct 27, 2024 -
mlx-examples Public
Forked from ml-explore/mlx-examplesExamples in the MLX framework
Python MIT License UpdatedOct 25, 2024 -
LLM-Drop Public
Forked from CASE-Lab-UMD/LLM-DropThe official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".
Python Apache License 2.0 UpdatedOct 21, 2024 -
inspect_ai Public
Forked from UKGovernmentBEIS/inspect_aiInspect: A framework for large language model evaluations
Python MIT License UpdatedOct 20, 2024 -
equinox Public
Forked from patrick-kidger/equinoxElegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/
Python Apache License 2.0 UpdatedOct 18, 2024 -
-
concordia Public
Forked from rstrivedi/concordiaA library for generative social simulation
Python Apache License 2.0 UpdatedOct 3, 2024 -
improving-RAG-systems-dhs2024 Public
Forked from dipanjanS/improving-RAG-systems-dhs2024This repository will contain the presentation and python jupyter notebooks for the DataHack Summit 2024 conference talk, Improving Real-world Retrieval Augmented Generation Systems, focusing on the…
Jupyter Notebook GNU General Public License v3.0 UpdatedSep 27, 2024 -
High performance AI inference stack. Built for production. @ziglang / @openxla / MLIR / @bazelbuild
Zig Apache License 2.0 UpdatedSep 25, 2024 -
Metal-Puzzles Public
Forked from abeleinin/Metal-PuzzlesSolve Puzzles. Learn Metal 🤘
Jupyter Notebook MIT License UpdatedSep 24, 2024 -
easy-problems-that-llms-get-wrong Public
Forked from autogenai/easy-problems-that-llms-get-wrongJupyter Notebook UpdatedSep 18, 2024 -
-
attention-output-saes Public
Forked from ckkissane/attention-output-saesCode to reproduce key results for "Interpreting Attention Layer Outputs with Sparse Autoencoders"
HTML UpdatedJul 27, 2024 -
typst Public
Forked from typst/typstA new markup-based typesetting system that is powerful and easy to learn.
Rust Apache License 2.0 UpdatedJul 20, 2024 -
-
VAE-Pytorch-ASL Public
Forked from explainingai-code/VAE-PytorchThis repository implements a simpleVAE for training on CPU on the MNIST dataset and provides ability to visualize the latent space, entire manifold as well as visualize how numbers interpolate betw…
Python MIT License UpdatedJun 26, 2024 -
Harkirat-Singh-course_code_and_notes Public
Forked from SartHak-0-Sach/Harkirat-Singh-course_code_and_notesJavaScript MIT License UpdatedJun 18, 2024 -