MPC ML Task List

This is my very tentative list of features to add to MaLPi. Although they are listed in numbered order, only the first one is ‘real’. If/when I finish the first task I’ll re-asses the list, possibly moving things around or dropping some or adding new ones.

VAE based DonkeyCar pilot
Pre-train VAE on images from all but one track
Train separate policies on each track with the VAE as an image embedder
Switch to RNN policy
Train a single policy on multiple tracks/tasks
Add a small DNC as a working memory
Add an IMU and learn to detect crashes/bumps
Add more tasks
- Cone/Human/dog/etc detector (bounding box as output)
- Drive to goal with goal given as an image
- Sketches as inputs (goals) and/or outputs
- NLP description of a scene or task trajectory as output
Switch to more formal multi-task and/or meta-learning and/or lifelong learning
Add some form of Aggregate memory that includes all previous experience

Model Predictive Control is a control method based on planning n-steps into the future to find the best path, as currently determined by the model. Take the first step. Then re-plan, take the new first step. Repeat until done.

Bleyddyn