






Building trustworthy AI systems — Multi-Agent RL, LLM Safety, AI Governance, Responsible AI. PyTorch · AWS · RAG. Bengaluru, India.
The hardest ML engineering problem isn't building a high-performing model — it's making sure it doesn't cause harm in production. Adversarial testing, red-teaming, fairness metrics, and model auditability — integrated into the lifecycle, not bolted on after deployment.
Co-authored the first benchmark for temporal safety in autonomous code agents. 9,213 trajectories, new metrics, evidence that early detection cuts monitoring costs by 75% — projected $108M in enterprise savings. ICML 2026 submission. Parallel work at U. Arizona on Multi-Agent RL for cybersecurity.
Handshake — AI Fellow (2025–26) · benchmarking LLMs on CS fundamentals.
Escape™ App AI — ML Engineer · PyTorch recommender, +30% engagement, HIPAA/GDPR ETL.
Omdena — ML Engineer · agentic mental-health chatbot (CrewAI · LangChain · RAG), −95% harmful responses.
Amazon — Support OPS · ML annotation, +20% accuracy.
Core: Multi-Agent RL · LLM Safety & Guardrails · RAG Architectures · Adversarial Robustness · Responsible AI · AI Governance · Privacy-Preserving ML.
Stack: PyTorch · AWS (SageMaker, EC2, S3) · LangChain · CrewAI · LangGraph · Docker · MLOps · NLP · Transformers · Python · SQL.
University of Arizona — M.S. Information Science (2023–2025).
Visvesvaraya Technological University — B.E. Mechanical Engineering (2018–2022).
Certifications: Future AWS AI Scientist · ML with Python · Oracle Cloud Infrastructure 2025 · Certified Generative AI Pro · Google IT Automation · Oracle AI Vector Search Pro.
Currently exploring Machine Learning Engineer, AI Researcher, ML Researcher, AI Safety Engineer, Responsible AI Engineer, and AI Governance opportunities. Reach me at hemantkumar.bk@gmail.com or linkedin.com/in/hemantbk.
Curated dataset behind StepShield — the first temporal safety benchmark for autonomous code agents.
Reduction unlocked by early temporal detection — translating to projected enterprise savings of $108M.
Cut on the Omdena agentic chatbot via bias detection, toxicity filtering, and empathy validation guardrails.
Lifted by the Escape™ App AI PyTorch recommender — built with HIPAA/GDPR-compliant ETL from day one.



