About me

I am a second year MS/Ph.D. student at College of Information and Computer Sciences, University of Massachusetts, Amherst, advised by Prof. Shlomo Zilberstein. I received a B.E. (Hons.) in Computer Science from BITS-Pilani, India in 2015. Previously, I was a Research Engineer at Singapore Management University (SMU), where I worked with Pradeep Varankantham and Akshat Kumar. Before that, I worked as a software engineer at @WalmartLabs Banglore; and as an intern at Amazon Bangalore.

I am interested in making fundamental contributions to AI research, especially to sequential decision making. The problem which keeps me up at night is how to scale Reinforcement Learning to make it work for autonomous agents in the real world.
At present, my research efforts lie at the intersection of model free RL, model based RL, planning, efficient exploration & learning, heuristic search, anytime algorithms and metareasoning.
At SMU, I worked on optimizing constrained resource allocation at scale using deep RL [1].
I am a proud owner of this RL repository [2], where I have implemented a host of popular modern RL algorithms.

When I am not thinking about Bellman equations, I love consuming other science content such as documentaries on physics, evolutionary biology, neuroscience, psychology etc, in part looking for clues towards my research.

I’m a huge fan of Richard Feynman and Roger Federer.
My friends tell me I sing well.

CV download link.

[1] Bhatia, A.; Varakantham, P.; Kumar A. ICAPS 2019
Resource Constrained Deep Reinforcement Learning

[2] https://github.com/bhatiaabhinav/RL-v2