Skip to content
View mohitpandey92's full-sized avatar
:dependabot:
Learning
:dependabot:
Learning

Block or report mohitpandey92

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
mohitpandey92/README.md

Hi there, I'm Mohit Pandey πŸ‘‹

Welcome to my GitHub profile! Here you'll find my background and projects that I have been working on.

πŸ§‘β€πŸ’» About Me

I'm a Machine Learning Scientist with over five years of experience in deep learning, optimization algorithms, and software engineering. I'm passionate about leveraging these skills to analyze high-throughput biological datasets. My background includes a Ph.D. in Computational Statistical Physics and Biophysics, along with biotech industry experience in i) peptide and protein modeling, and ii) CRISPR-Cas gene-editing and epigenetic-editing medicine. I'm eager to collaborate on projects that use AI to accelerate drug discovery and improve patient outcomes

Source file: CV text


My ML research expertise areas:

  • Protein Language Models based on transformers: ESM-2 and ProtT5
  • Molecular modeling: 3D molecular structure, Physics-based molecular modeling and Cheminformatics
  • Generative AI: VAE, GAN and Diffusion
  • Graph Neural Network: message passing neural network, GATConv, GCNConv
  • Monte Carlo Sampling: Simulated annealing, Monte Carlo Markov Chain
  • Uncertainty ML quantification

Here is my CV link


  • πŸ”­ I’m currently working on: Pre-training protein langauge model for CRISPR-Cas enzyme engineering
  • 🌱 I’m learning: How to generate config files using Hydra and Omegaconf for tracking ML experiments
  • πŸ’¬ Ask me about: Deep Learning Algorithms applied to biology
  • ⚑ Fun fact: I love training new tricks to my rescue dog.

πŸ“‚ My Selected Projects

  • protein-ml-utils: A ML utility package for working with protein datasets
  • data-science-template: A Python package to create a quick template for starting a Data Science project
  • GNN-VAE: A custom-built graph neural network VAE (Generative AI) to generate peptide sequences with an alpha-helical constraint, which are predicted to have high cell permeability.
  • Finetuned pLM: Finetuned protein language model on proprietary CRISPR-Cas gene editing data

You can see more of my projects here.


🀝 Let's Connect

LinkedIn Email


β€œI may not have gone where I intended to go, but I think I have ended up where I needed to be." -Douglas Adam

Pinned Loading

  1. protein-mlutils protein-mlutils Public

    A Python package for ML utility functions

    Jupyter Notebook

  2. data-science-template data-science-template Public

    A Python package to create a quick template for starting a Data Science project

    Python

  3. python-logging-tutorial python-logging-tutorial Public

    Tutorial for Python logging module

    Jupyter Notebook

  4. mbellitti/wikiart-classifier mbellitti/wikiart-classifier Public

    Teaching a Convolutional Neural Network to recognize painting genre. Handcrafted dataset. Cool visualizations.

    Jupyter Notebook 10 1

  5. k_arm_bandit k_arm_bandit Public

    A simple exercise in reinforcement learning

    Jupyter Notebook