Open to AI/ML roles · US work authorized · No sponsorship required
Vikhyat Chauhan

Vikhyat Chauhan

Hi, I'm Vikhyat — an LLM & RAG infrastructure engineer who likes shipping AI that has to actually work on real hardware, for real people.

I spent two years at GE HealthCare getting FDA-regulated MRI models onto Kubernetes — the kind of work where a missed edge case shows up in an operating room, so you learn to care a lot about latency, reproducibility, and the boring parts of MLOps. These days I'm finishing my M.S. in Computer Engineering at Virginia Tech (GPA 4.0), researching brain-inspired autonomous systems and building RAG pipelines from scratch — no LangChain, just the parts I actually need. Outside of work I'm usually tinkering with multi-agent setups, reading neuroscience papers I'm only half-qualified for, or chasing whatever rabbit hole the latest paper sends me down.

2024 — Present

Graduate AI/ML Researcher

· Virginia Tech — Blacksburg, VA
  • Designed and experimentally validated a brain-inspired UAV navigation system achieving 34.6% reduction in compute energy and elapsed time over baseline (p < 0.0001, n=1,000 runs, 3 real-world-mapped environments) - NSF-funded.
  • Architected a LangGraph + Llama 3 multi-agent system for grammar-constrained SDF XML generation, automating 100+ simulation environments and eliminating 40 hours/week of manual authoring.
2022 — 2024

Software Engineer II

· GE HealthCare — Bengaluru, IN
  • Led the technical architecture migration from a legacy C++ monolith to Kubernetes-based Java/Spring Boot services, reducing MRI application runtime by 80%, by designing and implementing image normalization and segmentation algorithms in C++.
  • Profiled and optimized a Swin Transformer segmentation model end-to-end - ONNX export, FP16 precision tuning, custom CUDA/C++ plugin authoring for unsupported ops, and layer fusion via TensorRT builder API - achieving 20% throughput uplift on FDA-regulated hardware enabling real-time intraoperative use.
  • Developed a multimodal MRI report generation system (MedCLIP, PyTorch, TensorRT) fusing scan sequences with patient history, achieving quantifiable impact with a 6.5% improvement in diagnostic classification F1 over baselines.
  • Introduced automated quality gates and CI/CD pipelines (Jenkins, SonarQube) for production deployment and optimization, reducing release defects by 60% while maintaining 0 patient-safety regressions across 12 quarterly releases.
  • Recognized with the GE Impact Award for independently modernizing a critical legacy MR imaging application to production microservices, directly improving clinical software delivery at scale.
2020 — 2022

Software Engineer I

· TNM Electronics — Delhi NCR, IN
  • Owned the full platform from day one; architected an AWS-backed microservice backend (EC2, Lambda, RabbitMQ, MongoDB, MQTT broker), scaling from 0 to 1000+ devices while sustaining 99.9% uptime.
  • Co-invented and patented a master-slave microcontroller communication system for home automation, cutting final product BOM cost 80% and directly enabling first design-win customers.
  • Reduced field bug reports 98% vs. prior manual integration method by designing a structured hardware-software validation workflow as the company’s sole engineer.
2018 — 2020

Research Intern

· Mahindra Susten — Pune, IN
  • Built a Linux-based autonomous quadcopter controller (MAVLink + advanced control) and integrated it into Mahindra Susten’s solar ops—automating thermal fault detection and reporting, delivering a 35% reduction in labor costs and faster issue triage.
Personal · Production

Professional RAG Pipeline

Two-stage retrieval (BGE-base → MS-MARCO cross-encoder) with SHA-256 semantic cache and per-query token tracking. Built from scratch — no LangChain — and deployed as a containerized service.

Python ·FastAPI ·BGE ·Cross-Encoder ·Cloud Run ·Docker
−30%
API overhead
50+
Eval queries
LLM-judge ↑
Faithfulness
Virginia Tech · NSF-funded

Multi-Agent LLM for Simulation Synthesis

LangGraph + Llama 3 multi-agent pipeline for grammar-constrained SDF XML generation, automating authoring of 100+ ROS 2 / Gazebo environments with stateful task graphs.

LangGraph ·Llama 3 ·ROS 2 ·Gazebo ·Python
−40 hrs/wk
Manual work
100+
Envs
Local
Inference
Virginia Tech · Personal

Physiological Stress Classifier

Developed a Conv1D Autoencoder-based multimodal pipeline on the WESAD dataset for unsupervised compression and three-class emotion classification of physiological signals (BVP, EDA, EMG, TEMP), achieving 83.5% accuracy across 15 subjects.

Python ·PyTorch ·scikit-learn ·NumPy ·Pandas
83.5%
Accuracy
15
Subjects
4-modal
Signals
GE HealthCare · Production

Medical Imaging Inference Pipeline

TensorRT-optimized Swin Transformer UNETR for FDA-regulated MRI. ONNX export, FP16 with selective FP32, custom CUDA plugins, and TensorRT layer fusion enabling real-time intraoperative use.

PyTorch ·TensorRT ·CUDA/C++ ·ONNX ·Kubernetes ·IEC 62304
+20%
Throughput
+80% faster
Deploys
−60%
Defects
Virginia Tech · M.S. Thesis

CANavigator — Brain-Inspired Drone Navigation Framework

Conflict-resolution framework on a MISD architecture for 3D autonomous UAV navigation. Validated across three real-world-mapped simulation environments with statistically significant gains.

Python ·MISD ·Control Systems ·Simulation
−34.5%
Energy
−34.6%
Time
p < 0.0001
Significance
Aug 2024 — May 2026

M.S., Computer Engineering (Thesis Track)

· Virginia Polytechnic Institute and State University — Blacksburg, VA
  • Defended M.S. Thesis on Brain-Inspired Drone Navigation System using Conflict Architecture — 4.0 GPA.
  • Coursework: CS5424 Advanced Machine Learning, CS5465 Applications of Machine Learning, ECE5504 Computer Architecture.
  • Honors: Phi Kappa Phi (top 10% of Master’s graduates) and Omicron Delta Kappa National Leadership Honor Society, Virginia Tech, 2026.
Jul 2016 — Dec 2020

B.E., Electronics and Communication Engineering

· Uttarakhand Technical University — Uttarakhand, IN
  • Graduated with First Division.

I'm focused on LLM & RAG infrastructure roles — production retrieval systems, eval pipelines, and inference cost / latency optimization. Background in FDA-regulated medical AI is a bonus for clinical-grade deploys. US-authorized, no sponsorship needed, open to relocation.

LinkedIn vikhyat-chauhan
Location Blacksburg, VA · Open to relocation