
Rupam Mahmood
Fellow & Canada CIFAR AI Chair
Academic Affiliations
Industry and Research Affiliations
Areas of Expertise
Fellow & Canada CIFAR AI Chair
Academic Affiliations
Industry and Research Affiliations
Areas of Expertise
Rupam Mahmood develops reinforcement learning algorithms and real-time learning systems for controlling physical robots.
Rupam Mahmood develops reinforcement learning algorithms and real-time learning systems for controlling physical robots. His research focuses on developing general and constructive mechanisms for continually improving robot minds. Currently, he is working on two long-term programs consisting of several short-term projects: a simple and general reinforcement learning system for robot control, and core constructive mechanisms for continually learning agents. In the first program, Rupam and his team are working to develop a reinforcement learning system that can be easily deployed in many different robots for solving various tasks. The second program has his research teams developing and analyzing algorithms for learning policies and representations in a continual learning setup, in which the agent is expected to go through a series of changes in the environment and tasks. Through these two programs, Rupam seeks to develop a system that enables scientific understanding as well as large-scale industrial adoption of robotics by analyzing and addressing shortcomings of current policy and representation learning methods.
Rupam is an Assistant Professor in the Department of Computing Science at the University of Alberta and a Fellow and Canada CIFAR AI Chair at Amii. In 2017, he achieved his Ph.D. in Statistical Machine Learning at the University of Alberta under the supervision of Richard S. Sutton (Amii Fellow, Chief Scientific Advisor and a pioneer of reinforcement learning) with his thesis focusing on incremental off-policy reinforcement learning algorithms. Previously, Rupam was a Research Scientist and then Lead of AI Research at Kindred Inc., where he now acts as a Scientific Advisor. He is an Associate Editor of the IEEE/RJS International Conference on Intelligent Robots and Systems (IROS) and a Senior Program Committee Member for the International Joint Conference on Artificial Intelligence (IJCAI). Rupam has produced software that provides a computational framework and a benchmark task suite for developing and evaluating reinforcement learning methods with physical robots.
Oct 29th 2018
Research Post
May 1st 2016
Research Post
Dec 8th 2014
Research Post
Rupam has produced software that provides a computational framework and a benchmark task suite for developing and evaluating reinforcement learning methods with physical robots.
Nov 30th 2020
News
Amii is proud to share the efforts and achievements of our researchers at the 34th Conference on Neural Information Processing Systems (NeurIPS), running online this year Dec 6 - 12.
Aug 6th 2020
News
DLRL brings together graduate students, post-docs and professionals to learn about deep learning and reinforcement learning. Learn about Amii's involvement at this year’s event.
Dec 9th 2019
News
Amii and UAlberta are pleased to welcome five new Canada CIFAR Chairs in AI to our family: Rupam Mahmood, Lili Mou, Nathan Sturtevant, Csaba Szepesvári and Adam White.
Looking to build AI capacity? Need a speaker at your event?