Posts by Collection

publications

Horde: A Scalable Real-time Architecture for Learning Knowledge from Unsupervised Sensorimotor Interaction

Using RL value functions to encode semantic knowledge, specifically by a robot.

Published: May 02, 2011

Download here

Off-Policy Actor-Critic

Off-Policy AC with linear state features. Includes elegibility traces.

Published: June 20, 2013

Download here

Human-level control through deep reinforcement learning

One of the first deep reinforcement learning papers.

Published: February 26, 2015

Download here

Deep Reinforcement Learning with Double Q-learning

Improved Q-value estimation by reducing overestimates of Deep Q-networks.

Published: December 08, 2015

Download here

Reinforcement Learning: An Introduction (2nd Edition)

In-progress second edition of an RL textbook.

Published: September 01, 2016

Download here

Attention and Augmented Recurrent Neural Networks

Overview (with references) of attention and several types of augmentation for RNNs.

Published: September 08, 2016

Download here

Reinforcement Learning with Unsupervised Auxiliary Tasks

Increase speed of a Reinforcement Learning system with auxiliary task.

Published: November 16, 2016

Download here

Questions and Intuition for Tackling Deep Learning Problems

Five questions to ask about your deep learning project.

Published: May 09, 2017

Download here

A simple neural network module for relational reasoning

Relationships between objects.

Published: June 05, 2017

Download here

One Model To Learn Them All

A single ML model used for very different tasks.

Published: June 16, 2017

Download here

Eligibility Traces

Notes on using Eligibility Traces with neural networks

Published: July 17, 2017

Download here

Deep Learning in Robotics: A Review of Recent Research

A long review of the use of DL in robotics

Published: August 15, 2017

Download here

An Overview of Multi-Task Learning in Deep Neural Networks

A long review of the use of DL in robotics

Published: August 17, 2017

Download here

Pre-training Neural Networks with Human Demonstrations for Deep Reinforcement Learning

Pre-train using supervised learning on human provided demonstations.

Published: September 14, 2017

Download here

Mastering the game of Go without human knowledge

AlphaGo Zero, all RL self-play.

Published: October 26, 2017

Download here

Tensorizing LSTMs

Tensorizing LSTMs to make them wider and deeper without adding parameters and with minimal extra compute costs.

Published: December 18, 2017

Download here

Pretraining Deep Actor-Critic Reinforcement Learning Algorithms With Expert Demonstrations

Pretraining Deep Actor-Critic Reinforcement Learning Algorithms With Expert Demonstrations.

Published: February 01, 2018

Download here

World Models

Unsupervised learning of image encoding and dynamics model.

Published: April 09, 2018

Download here

Unsupervised Predictive Memory in a Goal-Directed Agent

Unsupervised training of a memory that is used for prediction of state and reward.

Published: April 12, 2018

Download here

Learning Real-World Robot Policies by Dreaming

Unsupervised learning of image encoding, dynamics and reward models.

Published: June 25, 2018

Download here

Sample-Efficient Deep RL with Generative Adversarial Tree Search

Learned dynamics model with a GAN for image generation and MCTS for planning.

Published: June 27, 2018

Download here

Unicorn: Continual learning with a universal, off-policy agent

Continual learning with a universal, off-policy agent.

Published: July 19, 2018

Download here

On the link between conscious function and general intelligence in humans and machines

Access consciousness and it’s relation to general intelligence

Published: July 01, 2022

Download here

Bleyddyn

Posts by Collection

publications