Hi, I'm Himanshu Wagh
I'm a Data Science professional and M.S. graduate, passionate about building intelligent, data-driven systems. My work spans AI research, NLP, and generative modeling—where I turn cutting-edge theory into real-world applications.
With a strong foundation in Machine Learning, MLOps, and Python engineering, I’ve developed scalable AI solutions through academic research and industry collaboration. I’m currently seeking roles in Machine Learning, Applied AI, or Data Science to push the boundaries of intelligent automation.
📧 Let's connect and build something intelligent together. Email | LinkedIn
Programming Skills
Publication
DyGAF: Dynamic Graph Attention Framework for COVID-19 Biomarker Identification
Bioinformatics and Biology Insights (2024) | DOI : https://doi.org/10.1177/11779322251325390
- Developed an attention-based neural model for biomarker detection and COVID-19 diagnostics using gene expression data.
- Achieved 94.23% classification accuracy by combining deep learning with traditional ML feature selection methods.
Made the model and codebase publicly available: GitHub Repository - DyGAF
Professional Experience
Graduate Research Assistant
- Designed and implemented machine learning models to analyze complex genomic data, achieving 93% accuracy in predicting genetic interactions
- Preprocessed large-scale biomedical datasets from NCBI, applying techniques like TMM normalization and advanced feature engineering
Graduate Research Assistant
- Developed DyGAF, an attention-based neural model for COVID-19 biomarker ranking — published in Bioinformatics and Biology Insights (2024)
Software Engineer
- Built an NLP pipeline for real-time receipt data extraction using Named Entity Recognition (NER), improving processing speed by 30%
- Contributed to backend architecture in Python and C++ for scalable document analysis
- Integrated AI models into production systems in collaboration with DevOps and product teams
Education
Michigan Technological University
Savitribai Phule Pune University
Projects
SKILL BRIDGE
SkillBridge is a project that connects skills with opportunities, providing tooling and workflows to match, evaluate, and manage skill-based collaborations.
GPT-OSS FROM SCRATCH
Built the entire transformer architecture from the ground up in PyTorch. GPT-OSS - a complete implementation that actually outperforms the original GPT-2 architecture on validation loss. Full transformer architecture and trained on TinyStories dataset.
RESEARCH PAPER TO BLOG
This project is a Flask-based web application that converts research papers into detailed blog posts. The application extracts text from the PDF, processes it using OpenAI's GPT-4 model, and generates a well-structured blog post in LaTeX format.
ViT FROM SCRATCH
Replicated and creating a Vision Transformer (ViT) from scratch using PyTorch. Then trained ViT, a state-of-the-art computer vision architecture, to perform on Sports Classification Mini problem.
IceCube - Neutrinos in Deep Ice
This project was part of the kaggle competition, goal of which is to predict a neutrino particle’s direction. Develop a model based on data from the "IceCube" detector, which observes the cosmos from deep within the South Pole ice.