Computer Vision & ML Engineerarchitecting high-throughput systems for edge and cloud scale.

I engineer end-to-end vision pipelines optimized for low-latency inference, from custom data engines → hardware-constrained deployment.

MS Computer Science • AI/ML + Vision • Cloud/Engineering delivery

TensorRT•PyTorch•CUDA / C++•ONNX Edge Deployment•Sensor Fusion•Distributed Inference

Featured Work

End-to-end systems built for performance and scale.

AI Highway Safety & Incident Detection Platform

Production-grade vehicle detection, tracking, and near-miss classification deployed on Google Cloud Run.

YOLOv8sGCPFastAPI

OCR Deep Learning Pipeline

Parallelized extraction architecture processing 50k+ documents daily using optimized ONNX graphs and Triton Server.

OCRONNXTriton

RAG-Based Enterprise Search

Ingest documents, optimize retrieval, evaluate relevance, and deliver a chat-style search experience.

GenAIRAGEvaluation

Edge Vision Systems

Optimization of object detection and segmentation nets via FP16/INT8 quantization, TensorRT compiling, and custom C++ inference pipelines.

Data Engine Architecture

Designing active learning loops to systematically target edge cases, employing synthetic generation, and managing heavily imbalanced distributions.

High-Throughput ML Infrastructure

Deploying Triton Inference Servers, building distributed data ingestion pipelines, and ensuring strict SLA observability in cloud environments.

Engineering Philosophy

I’m an optimizations-obsessed engineer who bridges the gap between research and strict physical hardware limits. Whether it’s writing custom CUDA kernels to shave off milliseconds or architecting cloud pipelines to serve millions of predictions daily, I build systems that perform at scale.

✓

Obsesses over inference constraints: Memory footprints, FLOPs, and millisecond latency targets.

✓

Bridges Python prototyping with highly optimized C++ and TensorRT deployment graphs.

✓

Treats data engines and active learning loops as first-class citizens in the engineering lifecycle.

✓

Architects robust, distributed cloud infrastructure capable of adhering to high-criticality SLAs.