The State of Imitation Learning:
Understanding its Applications and Promoting its Adoption
June 27, 2011
Robotics Science and Systems
Los Angeles, California, USA
Imitation learning has grown to be a large field with applications across robotics, neural computation, and artificial intelligence. As the field has developed, ideas have sprouted from a wide range of motivations and applications resulting differing terminology and significant overlap; terms such as apprenticeship learning, learning from demonstration, inverse optimal control, and inverse reinforcement learning mean the same thing to some, while to others they have vastly different connotations. Imitation learning is already creating a stir within the robotics community as an effective and practical way to transfer our intuition to real world robotics systems, and has the potential to revolutionize the way we approach system development. In this workshop, we will examine the collection of subfields within imitation learning together and attempt to construct a formal taxonomy of the tools and techniques available to solidify its foundation and promote wider adoption with the robotics community.
This fullday workshop will study the numerous subareas of imitation learning in order to synthesize and summarize the lessons learned and to draw connections between the array of tools available. The structure will consist primarily of invited speakers who are leaders within the field, as well as a poster session of committee reviewed extended abstract submissions.
Submission
We welcome submissions of 2-3 page extended abstracts for participation in the poster session. Submissions that highlight recent work and results in applying imitation learning to real robotics problems are encouraged in particular. Top submissions may be invited to give a short oral presentation during the workshop.
Submissions should be emailed to ratliffn+RSSWorkshopSubmit@google.com
Important Dates
Submission deadline: | May 9 (Extended) |
Acceptance notification: | May 27 |
Workshop: | June 27 |
Organizers
Brenna Argall, École Polytechnique Fédérale de Lausanne EPFL, Lausanne, Switzerland
Nathan Ratliff, Google, Pittsburgh, PA
David Silver, Carnegie Mellon University, Pittsburgh, PA
Program Committee
S.M. Khansari-Zadeh, École Polytechnique Fédérale de Lausanne EPFL, Lausanne, Switzerland
Zico Kolter, Massachusetts Institute of Technology, Cambridge, MA
Stephane Ross, Carnegie Mellon University, Pittsburgh, PA
Matt Zucker, Swarthmore College, Philadelphia, PA
Confirmed Speakers
Pieter Abbeel, University of California, Berkeley, CA
Drew Bagnell, Carnegie Mellon University, Pittsburgh, PA
Aude Billard, École Polytechnique Fédérale de Lausanne EPFL, Lausanne, Switzerland
Chad Jenkins, Brown University, Providence, RI
Jan Peters, Max Planck Institute for Intelligent Systems / Darmstadt University of Technology
Schedule
09:00 [40m] Introduction
09:40 [40m] Chad Jenkins
10:20 [10m] Submission: Teaching Robots to Execute Verb Phrases
10:30 [15m] coffee
10:45 [40m] Organizers
11:25 [40m] Pieter Abbeel
12:05 [10m] Submission: Toward Imitating Object Manipulation Tasks
Using Sequences of Movement Dependency Graphs
12:15 [1h15m] lunch
13:30 [40m] Drew Bagnell
14:10 [40m] Aude Billard
14:50 [10m] Submission: Blending Autonomous and Apprenticeship Learning
15:00 [30m] coffee
15:30 [40m] Jan Peters
16:10 [50m] Open Discussion
17:00 Workshop Ends
Invited Talks
Chad Jenkins: rosbridge: ROS for non-ROS users
Reproducibility, interoperability, and accessibility are critical needs for a thriving ecosystem of robotics research and development. These needs are especially true for robot learning from demonstration, where interactions with actual human users is essential. The combination of off-the-shelf robotics platforms, cloud computing, and common interchange protocols has the power to enable robots to be used like any other web services, opening vast new applications for robotics and robot learning. In this talk, I cover our recent work with rosbridge as a bridge to a Robot Operating System (ROS) that is independent of any specific operating system or build environment. Treating the ROS run-time environment as a robot server (similar to Apache for web content), I describe our robot web applications implemented purely through JavaScript/HTML for web-scale robot learning and a PR2 Remote Lab. Such applications demonstrate "no-install" interfaces for reaching broader populations of users as well as platforms for common decentralized experimentation.
Pieter Abbeel: Apprenticeship Learning for Autonomous Flight and Surgical Robotics
For many problems in robotics performance under tele-operation is far higher than under autonomous operation. In this talk I will present apprenticeship learning algorithms, which enable experts to teach robots through demonstrations. Our apprenticeship learning techniques have enabled a helicopter to perform advanced aggressive maneuvers, well beyond the prior state of the art, including maneuvers such as chaos and tic-tocs, which only exceptional expert human pilots can fly. I will also describe our preliminary results towards automating selected surgical skills.
Drew Bagnell: Computational Rationalization
I'll review the roles inverse optimal control can play in real-world robotics. I'll further discuss my personal view of the frontiers of such methods, including multiple agents and transitioning learning surrogate cost functions from imitation learning to reinforcement learning.
Aude Billard: Overview of 15 years of research in imitation learning
I will review the various works we did over the past 15 years: starting from the robot Robota and its use with autistic children, revising the various computational models of human imitation we developped and highlighting how these models informed our current robotics work. I will conclude with a few pointers on topics which I view of particular interest in the field. These include how to combine imitation learning with other learning techniques, how to learn from failed demonstrations and how to bridge the gap from trajectory level imitation to behavior-based imitation.
Jan Peters
Submitted Contributions
Teaching Robots to Execute Verb Phrases, Daniel Hewlett, Thomas J. Walsh, Paul Cohen [pdf]
Toward Imitating Object Manipulation Tasks Using Sequences of Movement Dependency Graphs, Vladimir Sukhoy, Shane Griffith, and Alexander Stoytchev [pdf]
Blending Autonomous and Apprenticeship Learning,Thomas J. Walsh and Daniel Hewlett and Clayton T. Morrison [pdf]