Principal Research Scientist, Amazon
I am a Principal Research Scientist at Amazon, where I lead research on reinforcement learning for large-scale supply chain decision systems. My work spans reinforcement learning, causal inference and experimental design, LLM post-training (RLHF / preference optimization), and game theory.
I received my Ph.D. in Statistics from the University of Washington, advised by Marina Meilă. Previously, I was a Senior Quantitative Analyst at Google. My earlier research focused on manifold learning and spectral methods.
Model-based RL, constrained RL, sim2real transfer, inventory control and placement optimization
RLHF, preference optimization (DPO / C2-DPO), fine-tuning, evaluation
Interference-aware experimentation, meta-analysis of randomized experiments, treatment effect transportation
Marketplace equilibrium, competitive pricing mechanisms, shared-revenue models
See Google Scholar for a full list.
Principal Research Scientist, Supply Chain Optimization Technologies
Tech lead for a 15+ researcher group building reinforcement learning systems for inventory buying and placement decisions spanning multi-billion-dollar annual inventory spend. Led initiatives that drove nine-figure impact in automated ordering, experimentation methodology, and forecasting. Co-developed C2-DPO for LLM preference optimization and LLMForecaster for demand forecasting with LLM embeddings.
Senior Quantitative Analyst, Advanced Measurement Technologies
Led a team of statisticians on cross-media measurement. Designed experiments for face recognition and gaze estimation systems. Developed indoor positioning models using WiFi, Bluetooth, and sensor fusion.
Research Scientist, Demand Forecasting
Developed first demand forecasting models for Amazon Pantry at launch. Scaled Bayesian and quantile regression to big data via MapReduce. Designed methods for imputing demand data at scale.
University of Washington, 2012
Advisor: Marina Meilă
McGill University, 2007
McGill University, 2003