Shubham Gaur

Logo

View My GitHub Profile

Shubham Gaur

AI/ML Research Engineer • MS NLP @ UC Santa Cruz

🌍 San Francisco Bay Area, CA, USA
📧 sgaur2@ucsc.edu
📱 (408) 640-5717
🔗 LinkedIn | GitHub | Website | Google Scholar


🎯 About Me

Graduate Student in Natural Language Processing at UC Santa Cruz (Silicon Valley Campus) with 5+ years of industry experience at Nokia, Adani Group, and BlackRock. Currently based in Santa Clara, CA, working on advancing generalist AI through multimodal systems and agentic AI.

Research Interests:


🎓 Education

University of California, Santa Cruz (UCSC) | Santa Clara, CA

M.S. in Natural Language Processing (Major: AI) | Sept 2024 - Dec 2025

SRM Institute of Science & Technology (SRMIST) | Chennai, India

B.S. in Information Technology (Major: ML) | Jul 2015 - May 2019


💼 Work Experience

🔵 Nokia | Machine Learning Intern | Jun 2025 - Present

Naperville, USA

🟢 Adani Group | Lead Machine Learning Engineer | Nov 2021 - Aug 2024

Gurgaon, India

🔴 BlackRock | Machine Learning Engineer | Jul 2019 - Oct 2021

Mumbai, India


🔬 Research Projects

🤖 GUI Agents (in collaboration with Samsung Research America)

May 2025 - Present

🏠 HomeHelper - Multimodal Agent for Appliance Troubleshooting

Apr - June 2025

📊 Evaluation of Faithfulness over ARMs and Diffusion Language Models | Code

Apr - June 2025

🎨 Leveraging LLMs and VLMs for Idiomatic Understanding | Code

SemEval’25 Research


📝 Publications

  1. “Leveraging LLMs and VLMs for idiomatic understanding” | ACL’25 Workshop (SemEval) Paper
    Judith Clymo, Adam Zernik, Shubham Gaur

  2. “Advancing Web-Based Visual Question Answering with Efficient Image-Text Alignment” | ICRAAI’24 Paper
    Saketh Kilaru, Shubham Gaur, Spandan Rout

  3. “Extraction of Cumulative Blobs Using Dynamic Gestures” | IJSR’21 Paper
    Rishabh Naulakha, Shubham Gaur, Dhairya Lodha, Mehek Tulsyan, Utsav Kotecha


🛠️ Technical Skills

Programming & Data Science

Python R Java C++ SQL

Libraries: pandas, NumPy, SciPy, scikit-learn, Matplotlib, Jupyter

Machine Learning & AI

PyTorch TensorFlow HuggingFace

Specializations: Computer Vision (CNN, OpenCV), Reinforcement Learning, LoRA/PEFT, Diffusion Models

Generative AI & LLMs

Models: BERT, GPT-3/4, LLaMA, CLIP, ALIGN
Frameworks: LangChain/LangGraph, AutoGen, Retrieval Augmented Generation
Vector Databases: Pinecone, Qdrant
Techniques: Supervised Fine Tuning, RLHF, PPO, Instruction-Tuned LLMs

Big Data & Cloud

Azure AWS GCP

Technologies: Apache Spark, Hadoop, Kafka, Databricks, BigQuery, Synapse, Data Factory

Deployment & MLOps

Docker Kubernetes

Tools: FastAPI, MLflow, Airflow, CI/CD, Redis, PostgreSQL, MySQL, MongoDB


🏆 Key Achievements


📞 Let’s Connect!

I’m always interested in discussing AI research, collaboration opportunities, or innovative projects. Feel free to reach out!


“Advancing AI systems that understand, reason, and act across multiple modalities with human-level reliability.”