Computer Vision Engineer & ML Researcher
Engineering production-ready AI solutions with expertise in Computer Vision, Natural Language Processing, and scalable MLOps infrastructure. Passionate about transforming cutting-edge research into real-world applications.
Bridging the gap between advanced AI research and practical business solutions
Computer Vision Engineer & ML Researcher with 1+ years of experience building production-ready AI solutions. I specialize in developing high-accuracy computer vision systems (95%+), implementing NLP solutions with LLMs, and architecting scalable MLOps pipelines. My expertise spans from model development to deployment, ensuring robust performance in real-world applications.
Building impactful AI solutions across diverse industries
Innovative AI solutions demonstrating technical excellence and real-world impact
Built, debugged, and trained a Denoising Diffusion Probabilistic Model (DDPM) entirely from scratch using raw PyTorch. Features a custom YOLO-style U-Net with Sinusoidal Position Embeddings and a mathematically rigorous linear beta schedule. Scaled to train on the full 60,000-image MNIST dataset.
Advanced real-time vehicle tracking system using YOLOv8 object detection combined with SORT algorithm for multi-object tracking. Integrated EasyOCR for automatic license plate extraction and recognition with 90%+ accuracy in varied lighting conditions.
Comprehensive CLI and Flask-based application for intelligent document Q&A. Supports multiple formats including PDF, DOCX, and XLSX. Leverages state-of-the-art Transformer models for accurate information retrieval and contextual understanding.
AI-powered exam integrity monitoring system utilizing YOLO for real-time object detection and MediaPipe for precise head-pose estimation. Detects unauthorized devices and suspicious behaviors with automated alert generation for proctors.
Focuses on object detection using YOLOv11 and other YOLO models. Automates identifying objects in images, applicable for surveillance, quality control, and smart monitoring.
A comprehensive computer vision pipeline that detects humans and animals in images and videos, classifies them correctly, and simultaneously performs Optical Character Recognition (OCR) on the media.
Advanced computer vision system to accurately detect and track drones in varied environments using YOLO and SSD (Single Shot MultiBox Detector) architectures.
A robust QR code detection and decoding pipeline using OpenCV. Designed to handle challenging real-world conditions including noisy, rotated, or low-contrast inputs.
A research-oriented comparative analysis of two prominent deep learning approaches (CNN and ResNet50) for highly accurate drone image classification tasks.
Comprehensive skill set for end-to-end AI/ML development
CGPA
8.41
Duration
2020 - 2023
Open to discussing new opportunities and collaborations in AI/ML
I'm passionate about solving complex problems with AI and always interested in challenging projects. Whether you need help with computer vision, NLP, or building scalable ML systems, let's talk!