Related projects
Discover more projects across a range of sectors and discipline — from AI to cleantech to social innovation.
Deep reinforcement learning (DRL) for quadrupedal robot control has recently become tractable. In under an hour, serviceable control policies can be obtained in simulation. However, transferring reinforcement learning from simulation to the real world is still an arduous task. We propose a novel two-agent setup that leverages the good sides of both model-free and model-based DRL. We propose introducing a new agent is to the model-free control policy. By tasking the new agent with regularizing differences in operating environments by generating world models on-the-fly that will act as regularizers over the control policy’s inputs, we will obtain a reliable sim2real strategy to help take quadrupedal control polices out of simulation and into the real world.
Liam Paull
Kyoto University
Computer science
Education
Université de Montréal
Globalink Research Award
Discover more projects across a range of sectors and discipline — from AI to cleantech to social innovation.
Find the perfect opportunity to put your academic skills and knowledge into practice!
Find ProjectsThe strong support from governments across Canada, international partners, universities, colleges, companies, and community organizations has enabled Mitacs to focus on the core idea that talent and partnerships power innovation — and innovation creates a better future.