Sitemap

I had completely forgotten to normalize the images I’m feeding into MaLPi’s network, so I thought I’d try to be a bit more formal about it than my usual.

Published: December 13, 2017

1 minute read

Getting a Keras LSTM layer to work on MaLPi

Training on batch sizes and/or sequence lengths longer than one, while still being able to run one image at a time on the robot.

Published: October 19, 2017

3 minute read

Experimenting with OpenAIs Baselines code

I forked Open AI’s baseline code and made a few changes. This was my first full run before I started playing around with the model architecture.

Published: September 05, 2017

2 minute read

Single Battery

Eliminated a battery! (maybe)

Published: March 27, 2017

less than 1 minute read

State of the Hardware v2

I’m about to make large changes to MaLPi so I wanted to document the current state of the hardware.

Published: October 08, 2016

less than 1 minute read

Motors

What have I gotten myself into?!?! Motors and controllers and a breadboard, oh my!

Published: August 24, 2013

less than 1 minute read

Lego Chassis

Some progress on the hardware front.

Published: March 27, 2013

less than 1 minute read

Endurance Test

Test how long the PowerGen battery can run MaLPi on a single charge.

Published: March 18, 2013

1 minute read

MaLPi Intro

MaLPi (Machine Learning Pi)

Published: March 17, 2013

1 minute read

publications

Horde: A Scalable Real-time Architecture for Learning Knowledge from Unsupervised Sensorimotor Interaction

Using RL value functions to encode semantic knowledge, specifically by a robot.

Published: May 02, 2011

Download here

Off-Policy Actor-Critic

Off-Policy AC with linear state features. Includes elegibility traces.

Published: June 20, 2013

Download here

Human-level control through deep reinforcement learning

One of the first deep reinforcement learning papers.

Published: February 26, 2015

Download here

Deep Reinforcement Learning with Double Q-learning

Improved Q-value estimation by reducing overestimates of Deep Q-networks.

Published: December 08, 2015

Download here

Reinforcement Learning: An Introduction (2nd Edition)

In-progress second edition of an RL textbook.

Published: September 01, 2016

Download here

Attention and Augmented Recurrent Neural Networks

Overview (with references) of attention and several types of augmentation for RNNs.

Published: September 08, 2016

Download here

Reinforcement Learning with Unsupervised Auxiliary Tasks

Increase speed of a Reinforcement Learning system with auxiliary task.

Published: November 16, 2016

Download here

Questions and Intuition for Tackling Deep Learning Problems

Five questions to ask about your deep learning project.

Published: May 09, 2017

Download here

A simple neural network module for relational reasoning

Relationships between objects.

Published: June 05, 2017

Download here

One Model To Learn Them All

A single ML model used for very different tasks.

Published: June 16, 2017

Download here

Eligibility Traces

Notes on using Eligibility Traces with neural networks

Published: July 17, 2017

Download here

Deep Learning in Robotics: A Review of Recent Research

A long review of the use of DL in robotics

Published: August 15, 2017

Download here

An Overview of Multi-Task Learning in Deep Neural Networks

A long review of the use of DL in robotics

Published: August 17, 2017

Download here

Pre-training Neural Networks with Human Demonstrations for Deep Reinforcement Learning

Pre-train using supervised learning on human provided demonstations.

Published: September 14, 2017

Download here

Mastering the game of Go without human knowledge

AlphaGo Zero, all RL self-play.

Published: October 26, 2017

Download here

Tensorizing LSTMs

Tensorizing LSTMs to make them wider and deeper without adding parameters and with minimal extra compute costs.

Published: December 18, 2017

Download here

Pretraining Deep Actor-Critic Reinforcement Learning Algorithms With Expert Demonstrations

Pretraining Deep Actor-Critic Reinforcement Learning Algorithms With Expert Demonstrations.

Published: February 01, 2018

Download here

World Models

Unsupervised learning of image encoding and dynamics model.

Published: April 09, 2018

Download here

Unsupervised Predictive Memory in a Goal-Directed Agent

Unsupervised training of a memory that is used for prediction of state and reward.

Published: April 12, 2018

Download here

Learning Real-World Robot Policies by Dreaming

Unsupervised learning of image encoding, dynamics and reward models.

Published: June 25, 2018

Download here

Sample-Efficient Deep RL with Generative Adversarial Tree Search

Learned dynamics model with a GAN for image generation and MCTS for planning.

Published: June 27, 2018

Download here

Unicorn: Continual learning with a universal, off-policy agent

Continual learning with a universal, off-policy agent.

Published: July 19, 2018

Download here

On the link between conscious function and general intelligence in humans and machines

Access consciousness and it’s relation to general intelligence

Published: July 01, 2022

Download here

Bleyddyn

Sitemap

Pages

Posts

publications