Computer Vision & Data Engineer
Engineering production-ready AI solutions with expertise in Computer Vision, Natural Language Processing, and scalable MLOps infrastructure. Passionate about transforming cutting-edge research into real-world applications.
Bridging the gap between advanced AI research and practical business solutions
Computer Vision Engineer & ML Researcher with 1+ years of experience building production-ready AI solutions. I specialize in developing high-accuracy computer vision systems (95%+), implementing NLP solutions with LLMs, and architecting scalable MLOps pipelines. My expertise spans from model development to deployment, ensuring robust performance in real-world applications.
Building impactful AI solutions across diverse industries
AI-Powered Pet Analytics & Edge Monitoring
Traditional pet monitoring systems are often limited by high latency and static analysis.
My goal was to build a high-performance, edge-optimized system capable of real-time detection, persistent tracking, and behavior classification, ensuring a smooth 30 FPS visual experience even on resource-constrained hardware.
Architected a dual-thread system to prevent frame drops. The stream_worker manages the 30 FPS visual layer, while the ai_worker executes the inference pipeline asynchronously at maximum CPU potential.
Implemented an optimized pipeline using onnxruntime featuring YOLO26 Nano (Detection), IoU Tracker (Tracking), OSNet (Re-ID), and MobileNetV2 for real-time behavior classification.
Integrated manual memory management, RAM-cached SQLite lookups for zero-latency weight serialization, and HTML5 Canvas rendering to bypass DOM-based memory leaks.
Innovative AI solutions demonstrating technical excellence and real-world impact
Built, debugged, and trained a Denoising Diffusion Probabilistic Model (DDPM) entirely from scratch using raw PyTorch. Features a custom YOLO-style U-Net with Sinusoidal Position Embeddings and a mathematically rigorous linear beta schedule. Scaled to train on the full 60,000-image MNIST dataset.
Advanced real-time vehicle tracking system using YOLOv8 object detection combined with SORT algorithm for multi-object tracking. Integrated EasyOCR for automatic license plate extraction and recognition with 90%+ accuracy in varied lighting conditions.
Comprehensive CLI and Flask-based application for intelligent document Q&A. Supports multiple formats including PDF, DOCX, and XLSX. Leverages state-of-the-art Transformer models for accurate information retrieval and contextual understanding.
AI-powered exam integrity monitoring system utilizing YOLO for real-time object detection and MediaPipe for precise head-pose estimation. Detects unauthorized devices and suspicious behaviors with automated alert generation for proctors.
Focuses on object detection using YOLOv11 and other YOLO models. Automates identifying objects in images, applicable for surveillance, quality control, and smart monitoring.
A comprehensive computer vision pipeline that detects humans and animals in images and videos, classifies them correctly, and simultaneously performs Optical Character Recognition (OCR) on the media.
Advanced computer vision system to accurately detect and track drones in varied environments using YOLO and SSD (Single Shot MultiBox Detector) architectures.
A robust QR code detection and decoding pipeline using OpenCV. Designed to handle challenging real-world conditions including noisy, rotated, or low-contrast inputs.
A research-oriented comparative analysis of two prominent deep learning approaches (CNN and ResNet50) for highly accurate drone image classification tasks.
Comprehensive skill set for end-to-end AI/ML development
CGPA
8.41
Duration
2020 - 2023
Open to discussing new opportunities and collaborations in AI/ML
I'm passionate about solving complex problems with AI and always interested in challenging projects. Whether you need help with computer vision, NLP, or building scalable ML systems, let's talk!