Anurag Koul

(अनुराग कौल)

Member of Engineering

Poolside

Biography

Hi! I’m a Member of Engineering (RL) at Poolside. Prior to this, I have worked at Amazon, Microsoft, and other companies, on research in reinforcement learning (RL), and completed my Ph.D. at Oregon State Universityunder the supervision of Prof. Alan Fern. My research interests span a wide range of RL topics, including explainability of RL agents, model-based RL, planning, offline RL, hierarchical RL, safety of agents, multi-agent RL, and decision/inference-time planning for robotic control and LLM reasoning. Currently, I focus on applying RL and reasoning techniques to improve code generation in large language models (LLMs). If you’re interested in my work or would like to collaborate, feel free to reach out.

Interests

Artificial Intelligence
Reinforcement Learning
Planning
Reasoning
Abstraction
Large Language Models

Education

Ph.D. in Computer Science, 2016 - 2022
Oregon State University
Thesis: Investigating Latent State and Uncertainty Representations in Reinforcement Learning
B.E. in Computer Science, 2010 - 2014
University of Mumbai

Experience

Member of Engineering

Poolside

December 2025 – Present New York

Research and development of reasoning and reinforcement learning for code generation with Large Language Models.

Applied Scientist 2

Amazon

August 2024 – November 2025 New York

Research and development of large language models for code generation.

PostDoctoral Researcher

Microsoft Research

October 2022 – July 2024 New York

Research on planning and latent-state representation for reinforcement learning

Research Intern

Microsoft Research

June 2022 – September 2022 New York

Research on safe reinforcement learning for systems

Research Intern

Intel AI Labs

June 2020 – September 2020 Remote

Research on model-based reinforcement learning for control

Research Intern

SAS

May 2019 – August 2019 North Carolina

Research on multi-agent reinforcement learning

Senior Software Engineer

Capgemini

August 2015 – August 2016 Gurugram, India

Developed full-stack web-products for telecommunication space.

Software Engineer

Capgemini

August 2014 – August 2015 Gurugram, India

Research Papers

Quickly discover relevant content by filtering research papers.

George Ma, Anurag Koul, Qi Chen, Yawen Wu, Sachit Kuhar, Yu Yu, Aritra Sengupta, Varun Kumar, Murali Krishna Ramanathan (2025). SpecAgent: A Speculative Retrieval and Forecasting Agent for Code Completion.

PDF Cite

Anurag Koul, Shivakanth Sujit, Shaoru Chen, Ben Evans, Lili Wu, Byron Xu, Rajan Chari, Riashat Islam, Raihan Seraj, Yonathan Efroni, Lekan Molu, Miro Dudik, John Langford, Alex Lamb (2024). PcLast: Discovering Plannable Continuous Latent States. ICML.

PDF Cite

Anurag Koul, Mariano Phielipp, Alan Fern (2022). Offline Policy Comparison with Confidence: Benchmarks and Baselines. DRL Workshop, ICLR.

PDF Cite Poster Slides Docs