16-633: Robot Cognition and Learning
Spring 2024
Instructor: Chris Atkeson, cga@cmu.edu
TA: Viekash Vinoth Kumar, vvinothk@andrew.cmu.edu
Class time: MW 3:30-4:20PM
Place: WEH 2302
Units: 12
For both undergraduate and graduate students

Course description

Course administration and policies

Schedule

Jan 17: Introduction:
Go over class web page
Introduction to the course
Jan 22: Object detection. notes.txt
Paper: Object Detection in 20 Years: A Survey
Jan 24: Object detection continued.
Jan 29: Video analysis: Using self supervision (bottom-up processing) to learn appearance and shape models.
Jan 31: What can we do with object detection?
Feb 5: Not making the same mistake twice.
Feb 7: Discussing class robot.
Feb 12: What should the units of behavior be? Slides
Feb 14: Using LLMs
To be scheduled.
April 22: Project presentations
April 24: Project presentations
May 6 - Graduating students have to have turned everything in.
May 12 - All students have to have turned everything in.

Assignments

Assignment 0: Due Jan 20. Send email to Chris and cc the TA: Who are you? Why are you here? Do you have any relevant experience or prior work? Any ideas about the project you want to do? ("I don't know what project I want to do." is okay.) What topics would you like the course to cover? Google and send us some interesting URLs. Are there other versions of this course at other places or useful web pages we should look at (send URLs)?
Be sure your name is obvious in the email, and you mention the course name or number in the subject line. I teach more than one course, and a random email from robotlover@cs.cmu.edu is hard for us to process.
Assignment 1: Due Jan 28. Get some kind of object detector running on your laptop or desktop computer. A popular choice is something from the YOLO series (another writeup) (and another writeup). Detectron2 is another popular object detector that also does segmentation (creates masks). CenterNet avoids the use of region proposals. Ways to find others include googling "object detector" or "object detector tutorial", Papers With Code, and searching on github: object detection and real time object detection. The software should name the object, and locate it in the image using a bounding box or a mask. Try using the object detector for tracking, using naming to help with associating objects to tracks, and the bounding box or mask centroid and some measure of angle to provide the object pose in each image.

Project

Each group will have their own robot to work with. However, I am very interested multi-robot tasks, so it would be great if groups can sometimes work together. We will start with this robot, and escalate if that is useful.

We will work out the specific project topics together. The ideal project would involve creating a system that we could deploy in Newell Simon Hall (kiosk or roving robot).

See the deadlines in the schedule (above). You can work in groups or alone. The "deliverables" include a github web page providing your code and explaining how others could do your project and improve on your results. You will also present your project, and ideally the presentation should be made public as part of your web page. There will be intermediate deliverables including draft web pages and practice presentations.

More Resources

Github
Hugging Face
Papers with code

Course poster, suitable from framing:

16-633: Robot Cognition and Learning Spring 2024 Instructor: Chris Atkeson, cga@cmu.edu TA: Viekash Vinoth Kumar, vvinothk@andrew.cmu.edu Class time: MW 3:30-4:20PM Place: WEH 2302 Units: 12 For both undergraduate and graduate students