01
EXPERIENCE
AfterQueryACTIVE
AI BENCHMARK ENGINEER — FULL-STACK & SYSTEMS TASK AUTHOR
- Author production-grade SWE-bench-style coding tasks & terminal agent benchmarks (Silver & Pluto) used to evaluate frontier models including Claude Opus on realistic full-stack problems.
- Design end-to-end task pipelines: Docker environments, test harnesses, oracle solutions, automated CI/CD verification — deterministic behavior-driven contracts across multi-file codebases.
- Engineer machine-verifiable suites (pytest, Jest, cargo) — null/oracle loops, cross-cutting bug detection, edge-case regressions in TypeScript, Python, and Node.js systems.
- Trace symptoms across distributed services, isolate failure modes, produce reference patches applied cleanly against pinned base commits in live production repos.
WelBuilt AI Solution Pvt LtdACTIVE
AI ENGINEER INTERN
- Architecting production-grade AI models and integrating them into scalable enterprise systems.
- Optimizing LLM inference speeds and developing automated data pipelines to support agentic AI features.
AIOT ENGINEER — PRODUCT LEAD
- Built scalable full-stack AI/IoT systems with cloud-connected backend APIs and real-time monitoring dashboards deployed in production.
- Designed modular REST API architectures for production-grade IoT and AI workflows — balancing reliability, latency, and operational simplicity.
- Led product direction, deployment pipelines, cross-functional debugging, and rapid iteration cycles.