I build the full stack of intelligent systems: the agent harnesses, memory layers, self-correcting loops, and production infrastructure that make AI reliable at scale.
My focus: agentic architectures that plan, execute, and course-correct autonomously. Subagent orchestration. Context engineering for token efficiency. RLHF pipelines that close the feedback loop.
Research in LLM safety, cloud security, and low-resource NLP. Published in IEEE conferences.
Custom harnesses with tool orchestration, subagent spawning, and self-correcting loops. No human in the loop.
Context-compact architectures, RAG pipelines, and persistent memory for coherent multi-turn agents.
Human-in-the-loop annotation, automated evaluation, iterative prompt improvement. Data-driven agent quality.
IEEE conference publications. Ongoing work in LLM safety and low-resource NLP.
Agent design to production monitoring. No handoffs.
Published and ongoing work in cloud security, LLM safety, and low-resource NLP.
Discovering and documenting novel attack vectors that exploit AWS services for unauthorized cryptocurrency mining, contributing to cloud security defense strategies.
Comprehensive survey of emerging security paradigms in cloud computing, analyzing current threat landscapes and defense mechanisms across major cloud providers.
Investigating and mitigating the tendency of large language models to hallucinate non-existent software packages, a supply-chain attack vector where adversaries can register hallucinated package names with malicious code.
Building the largest known pretraining corpus for Kashmiri, a low-resource language with ~7M speakers. Enabling foundation model development for an underserved linguistic community.
The first dedicated neural model for Kashmiri diacritic restoration. A ByT5 byte-level seq2seq model with script-aware normalization and skeleton-preserving inference recovers omitted Perso-Arabic diacritics. Achieves 77.5% expert-rated accuracy; released a 23.7k aligned sentence-pair dataset.
Production systems, open-source tools, and research prototypes spanning AI, web, and security.

Built at the Emergent AI hackathon. 5 AI agents compete autonomously in "The Great Buildoff". Each agent confined to a 10x10 platform, builds unique structures via API. Custom harness with pathfinding, patrol loops, spawn protection, and event tracking.
Navigates multi-step web forms across external portals. Plans route, fills fields, self-corrects on errors, submits. Human-in-the-loop checkpoints for approval on sensitive steps.
Discovers, summarizes, and emails daily LLM research digests. Fully autonomous after setup.
Fine-tuned BERT first pass, LLM-as-a-Judge second pass. Self-correcting loop. Production scale.
Reads CRM context, crafts personalized messages, delivers across WhatsApp and Instagram. Persistent memory across sessions.

EdTech platform. AI chat agents, CRM, payments, operations dashboards. Thousands of active users.
Digital lending platform. MERN, AWS, CI/CD, server-side analytics. Built end-to-end.
Full-stack fashion e-commerce platform with catalog, payments, order management, and Redis-cached product search.
Immersive 3D web experience for a university cultural fest. Interactive Three.js scenes with real-time WebGL animations.
Compliance platform for executing CIS benchmark scripts and managing security audits. Automated baseline verification for enterprise infrastructure.
Comparison tool for similar products across Flipkart and Amazon. Side-by-side pricing, ratings, and availability.
NLP-powered visualization mapping how cryptojacking attack techniques evolved over time. Analyzes threat intelligence reports and plots progression.
Mobile app with real-time AR haircut simulation using Mediapipe face/head tracking and 3D overlay rendering.
AI engineering roles. Research collaborations. Hard problems that need autonomous systems.