Data Engineer specializing in scalable ETL pipelines, machine learning, and distributed computing. Published researcher with expertise in deep reinforcement learning and generative AI.
Data Engineer | ML Engineer
Passionate about leveraging data and AI to solve complex problems at scale
I'm a Data Engineer at Adept Tech Solutions, where I build scalable ETL pipelines and synthetic data platforms using PySpark and FastAPI. My work focuses on distributed computing, data profiling, and production-grade microservices.
With a background in Electrical Engineering from NUST, I've published research at IEEE WCNC 2024 on deep reinforcement learning for IoT networks. I combine strong engineering fundamentals with cutting-edge ML techniques.
Recently earned Top Performer recognition at the Pak Angels GenAI Hackathon, building AI-powered systems using BERT, LLaMA, and LangChain. I'm passionate about generative AI, RAG systems, and autonomous agents.
Building scalable systems and pushing the boundaries of AI
Deep Reinforcement Learning framework optimizing computation offloading for energy-harvesting IoT devices. Achieved superior performance over baseline methods with faster convergence and minimized service delay.
Hybrid AI-powered phishing detection combining BERT and LLaMA for supply chain security. Winner of Top Performer award at Pak Angels GenAI Hackathon among competitive cohort.
Novel Conv2D + LSTM architectures for video-based regression on UBFC dataset. Implemented custom preprocessing pipelines and data augmentation strategies.
Production-grade distributed ETL pipelines for data profiling, feature engineering, and synthetic data generation. Built scalable microservices with FastAPI and PySpark to orchestrate high-volume data workflows. The platform supports end-to-end synthetic data creation pipelines used for advanced analytics, model training, and privacy-preserving data sharing across teams.
Retrieval-Augmented Generation system using Groq LLM and FAISS vector database for efficient document querying and context-aware responses.
Vision-based chatbot for analyzing construction damage images using multimodal AI. Deployed as interactive Gradio application for real-time assessment.
Ultra-low power RFID chip for animal tagging using CMOS 65nm technology. Focused on power optimization, reliability, and market competitiveness.
AI-powered transcription and summarization tool for meetings. Uses speech-to-text and LLM-based summarization for actionable insights.
Multi-agent system demonstrating autonomous decision-making and task execution. Implements ReAct framework for reasoning and action.
Custom 4-bit microprocessor with hierarchical datapath and control logic. Implemented VGA line-drawing algorithms in Verilog for FPGA deployment.
Contributing to the advancement of AI and IoT systems
Proposed a Deep Deterministic Policy Gradient (DDPG) framework for optimizing resource allocation in energy-harvesting IoT devices using CR-NOMA and Mobile Edge Computing. Demonstrated superior performance over baseline methods with faster convergence.
Open to collaborating on data engineering and ML projects