Related projects
Discover more projects across a range of sectors and discipline — from AI to cleantech to social innovation.
The goal of the project is to improve upon the methodology behind goal conditioned learning. In this framework, similar to the setup in traditional reinforcement learning, an agent interacts with an environment. However, instead of training the agent to maximize return, the agent is trained to reach a given goal at the end of the trajectory. That is, given a rollout-specific goal, the agent attempts to reach it. This goal conditioned paradigm is particularly promising for applications where the objective changes in every episode, for example, controlling a robot or a drone for different tasks; or self-driving vehicles, where the destination might change between episodes. In this project, we will explore potential improvements within the goal conditioned framework, both in the discrete and continuous action space settings.
Arvind Gupta
Panteha Naderian
Layer 6 AI
Computer science
Professional, scientific and technical services
University of Toronto
Accelerate
Discover more projects across a range of sectors and discipline — from AI to cleantech to social innovation.
Find the perfect opportunity to put your academic skills and knowledge into practice!
Find ProjectsThe strong support from governments across Canada, international partners, universities, colleges, companies, and community organizations has enabled Mitacs to focus on the core idea that talent and partnerships power innovation — and innovation creates a better future.