Veracode · via Accion Labs
Senior Data Scientist
- Applying machine learning to application security — the discipline I practised as a guardian at Amazon, now as the mission itself.
Incoming transmission · Bengaluru, Earth
Senior Data Scientist @ Veracode · ex-AWS Bedrock
I build autonomous AI systems — multi-agent architectures, retrieval-augmented generation, and the ML infrastructure that keeps them honest. Forged on AWS Bedrock, now pointed at application security.
I'm a Senior Data Scientist at Veracode with ~7 years of engineering experience and an M.Tech in Data Science from BITS Pilani. Before this I spent 3½ years on AWS A.I. / Bedrock, working at the seam between research and production — taking ideas like multi-agent coordination and retrieval-augmented generation and turning them into systems that run unattended, at AWS scale.
The move to application security wasn't a detour. At Amazon I served as an Application Security Guardian alongside my day job; at Veracode, securing the world's software is the day job — with machine learning as the instrument.
The thread connecting all of it is the same one that makes Interstellar my favourite film: a fascination with systems bigger than ourselves — black holes, orbital mechanics, and machines that can reason. I'm drawn to Research Engineer and Applied Scientist problems where engineering rigor meets state-of-the-art AI.
Senior Data Scientist
System Development Engineer Nov 2024 — Apr 2026
Application Engineer Oct 2022 — Nov 2024
System Engineer Apr 2021 — Oct 2022
Assistant System Engineer Jul 2019 — Mar 2021
Multi-Agent Region Expansion System
A novel Coordinator–Delegator–Worker architecture on Amazon Bedrock that autonomously runs AWS region expansion. Agents retrieve historical deployment issues via RAG and resolve infrastructure failures without human intervention — a hybrid mesh of LLM reasoning and Lambda execution.
IMPACT 2 weeks of manual effort → 4 hours, fully autonomous.
Vector Integrated Search & Retrieval
A production-grade RAG system built past the limits of native knowledge bases: SageMaker for embedding generation, OpenSearch Serverless for k-NN indexing, and an AWS Batch ingestion pipeline that chunks and preprocesses large corpuses across PDF, Excel, and text.
IMPACT Unlimited document volume; retrieval latency from hours → seconds.
Reproducible sanitisation for LLM training data
A modular preprocessing pipeline for high-volume RLHF datasets, with custom transformers (TransformerMixin pattern) encapsulating BERTScore similarity and Detoxify filtering, plus automated outlier and noise handling.
IMPACT −40% manual review overhead; 95% of data-quality issues caught.
M.Tech thesis · BITS Pilani
An abstractive summarisation model for Indian legal constitutional documents, built on Bi-Directional LSTM and GRU networks with an attention mechanism to preserve context across long-form legal text.
IMPACT Bridged NLP research and a hard real-world domain — long, dense legal language.
BITS Pilani · 2020 — 2022 · CGPA 8.53/10
Thesis: AI-based legal document summarisation (LSTM/GRU). Coursework: Deep Learning, NLP, Information Retrieval.
NSS College of Engineering · 2015 — 2019 · CGPA 8.19/10
Hiring for a Research Engineer or Applied Scientist role? I'd love to talk. Email is the fastest channel — I usually reply within a day.
Bengaluru, India · UTC+5:30 · no time dilation observed