About
Hi, I’m Sajan Kumar. I build AI systems that go from prototype to production and keep working at scale. I’ve been the founding ML engineer at a YC startup, scaled DeFi platforms to 2M+ users, and now ship computer vision systems processing 10,000+ streams a day for state infrastructure.
How I got here
I fell into production ML at OurEye.ai (YC22) working on CCTV analytics for 500+ networks. When a thousand cameras started streaming, the only thing that mattered was whether the system stayed up. That’s where I learned: anyone can train a model—getting it to serve millions reliably is the real game. Since then I’ve bounced across surveillance, crypto, security automation, and transportation, always focused on making AI production-ready.
What I actually do
- LLM systems: retrieval/rerankers, multi-agent workflows, eval loops, and guardrails.
- Vision at scale: distributed inference, GPU utilization tuning, low-latency pipelines on mixed hardware.
- Platform & ops: serving, CI/CD, IaC, observability, and reliability for fast shipping.
- Product journeys: 0→1 and then 1→100—shipping, measuring, hardening.
Currently
- Abstract Security — Senior ML Engineer (Oct 2024–Present)
Leading LLM-powered automation and multi-agent workflows; processing 20TB+ security data and cutting triage from hours to minutes. - Purdue University (TASI) — Graduate Research Assistant (Jan 2025–Present)
Optimizing statewide CCTV tracking and re-ID with transformer embeddings; 80%+ GPU utilization and ~40% lower latency on 10K+ streams/day.
Previously
- WCG Clinical — Senior Data & Analytics Engineer (Aug 2024–Oct 2024) — Tuned GenAI/embeddings APIs and pipelines to cut latency and cloud costs.
- CoinDCX — Senior Software Engineer (Oct 2022–Jul 2024) — Scaled DeFi wallet/microservices to 2M+ users; $50M+ daily volume; LLM-powered portfolio automation and fraud detection.
- OurEye.ai (YC22) — Founding ML Engineer (Dec 2020–Sep 2022) — Real-time video analytics on edge hardware across 500+ CCTV networks.
Highlights
- Automated 60–70% of security triage with LangGraph-based agent workflows on vLLM.
- Statewide CCTV analytics: >80% GPU utilization, ~40% latency reduction on 10K+ streams/day.
- Built microservices with 99.9% uptime handling $50M+ daily transactions.
Reach me
- Resume: view or download PDF
- Email: kumar836@purdue.edu