Skip to content
View Rish-01's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Rish-01

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Rish-01/README.md

🤖 Rishab Sharma | Machine Learning, Deep Learning, Computer Vision

M.S. in Computer Science @ UMass Amherst | Ex-Predoctoral Research Fellow @ IISc

I am a computer science student interested in Representation Learning, Generative Modeling, and Probabilistic Machine Learning.


🔬 Research Focus & Technical Stack

Category Skills & Tools
🧠 Core Theory Generative Models (DDPM, VAE, GANs), Probabilistic ML (VI, EM, Bayesian Methods), Representation Learning, Optimization, Deep Metric Learning, XAI (GradCAM)
💻 Frameworks PyTorch, JAX, Scikit-Learn, HuggingFace Transformers
🛠️ Languages Python, C++, Java, SQL, Bash
⚙️ Infrastructure Git, Docker, Linux, Spark, Hadoop, TensorBoard
📝 Specialized LaTeX, FastAPI, Pandas, NumPy, OpenCV

🌟 Featured Projects & Publications

1. Learning Low-Rank Latent Spaces with Autoencoders

  • Publication: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024)
  • Developed nuclear norm–regularized autoencoders for low-rank latent spaces.
  • Achieved FID (11.09 on MNIST) by fitting Gaussian Mixture Models (GMMs) in the latent space, demonstrating superior generative performance.

2. Explainable Schizophrenia Detection (Thesis)

  • Bachelors Thesis Report | GitHub
  • Built a binary classifier for EEG signals by transforming data into wavelet scalograms and applying ResNet-18 transfer learning.
  • Achieved 91.59% accuracy and utilized GradCAM to provide critical explainability for the medical diagnosis.

3. Speech-Text Fusion for Word-Level Stress Detection

  • GitHub
  • Developed a multimodal model using cross-attention to merge features from wav2vec 2.0 (acoustic) and BERT (textual).
  • Resulted in a 24% boost in F1-score for word-level stress detection over unimodal baselines, validating the multimodal approach.

4. Generative Models Implementation

  • Implemented core generative architectures, including Denoising Diffusion Probabilistic Models (DDPM) using a U-Net with spatial attention, and Vanilla GANs for data distribution mimicry.

  • (Other projects include work on Image Captioning (CNN+LSTM) and Siamese CNN for Change Detection.)

🔗 Connect

Platform Link
LinkedIn linkedin.com/in/rish01/
ML Blog rish-01.github.io/blog/ — My notes on VAEs, MLE, and Probabilistic Models.
Email rishab2001rs@gmail.com

Pinned Loading

  1. Schizo-xai Schizo-xai Public

    Schizophrenia detection using wavelet transforms plus gradcam explainability on the scalograms

    Jupyter Notebook 1 1

  2. Multimodal-Word-Stress-Detection Multimodal-Word-Stress-Detection Public

    Detecting word-level stress in English speech using wav2vec 2.0, with extensions to multimodal speech+text models via cross-attention fusion with BERT.

    Jupyter Notebook

  3. PyTorch-DDPM PyTorch-DDPM Public

    A PyTorch implementation of DDPM

    Python 1

  4. PyTorch-GANs PyTorch-GANs Public

    PyTorch implementation of Vanilla GAN architecture

    Jupyter Notebook 1

  5. PyTorch-Siamese-CNN PyTorch-Siamese-CNN Public

    Pytorch implementation of Deep Siamese Convolutional Networks which gives the changes between old and new aerial images

    Jupyter Notebook 1

  6. PyTorch-Transformer PyTorch-Transformer Public

    An implementation of the paper "Attention is all you need" using PyTorch

    Jupyter Notebook