Code for Reinforcement Learning course

Added 2018-10-14 22:35:44 +0000 UTC

Comments

2020-07-19 07:32:04 +0000 UTC

2020-07-17 18:01:06 +0000 UTC

2020-07-17 15:03:22 +0000 UTC

2020-07-17 02:46:21 +0000 UTC

2020-07-15 04:44:14 +0000 UTC

2020-07-14 13:16:26 +0000 UTC

2020-07-07 06:49:41 +0000 UTC

2020-07-06 20:14:53 +0000 UTC

2020-07-06 05:17:29 +0000 UTC

2020-07-06 03:00:36 +0000 UTC

2020-04-21 04:06:50 +0000 UTC

2020-04-20 09:02:03 +0000 UTC

2020-04-15 04:17:14 +0000 UTC

2020-04-14 21:58:43 +0000 UTC

2019-11-29 07:27:01 +0000 UTC

2019-11-27 02:53:11 +0000 UTC

2019-09-04 16:34:43 +0000 UTC

2019-08-11 18:07:59 +0000 UTC

2019-08-11 18:06:54 +0000 UTC

2019-06-21 14:34:08 +0000 UTC

2019-06-21 12:10:32 +0000 UTC

2019-05-26 07:29:31 +0000 UTC

2019-05-25 23:26:45 +0000 UTC

2019-05-24 09:48:44 +0000 UTC

2019-05-24 02:45:30 +0000 UTC

2019-05-23 13:18:56 +0000 UTC

2019-05-23 11:33:29 +0000 UTC

2018-10-17 01:44:15 +0000 UTC

2018-10-17 01:28:28 +0000 UTC

2018-10-17 01:24:39 +0000 UTC

More Creators

tamayura_yoru

featherfeettickling

Randy Bishop

circasurvive

OutForSmokes

polygonheart

thatssojordy

Tuesday Night Thunder

FrostyCherri

IosonoOtakuman

Kamiyamaneki

DemoraAvarice

eatingtill350

Michael Guimont

Code for Reinforcement Learning course

Comments

It was an error. I don’t the the code right here, can generate it this evening. However it was the same issue as this. https://github.com/pytorch/pytorch/issues/4534

Does it give you an error when you pass a sample in to this model, or are you just not seeing good performance with it? If it gives an error, what does it say?

Yes, you would just gauge the performance across your different your different models to see which model yields best results. Mean reward over n episodes would be a good metric to use. There is no test set equivalent in this scenario. .

in general, if I change the model and want to see if I'm improving, Is there an equivalent of a test set and what metic is generally used? mean of the reward over n number of episodes?

Yes, this is the render problem I was previously referring to. There appears to be some "hacks" that people have attempted to get around this when using OpenAI's Gym in a Colab notebook, but unfortunately there doesn't appear to be any straight forward solution for this.

I tried running the notebook for the deep q network in colab but it gave errors. Has it been run recently? If it does work, is it possible to make it a public colab notebook?

Hey Ten5ei - You're welcome! The in_features parameter in a nn.Linear layer requires the size of *each* input sample. Therefore, we can pass a batch of data to the network, and the network understands that within that batch, the shape of each sample will be em.height*em.width*3.

Hi, I'm trying to run the notebook in Google Colab, the display is not working. Inside the plot function, plot.figure(2) set up two figures to plot, only the moving average was plotted, the cartPloe image was not. How do you plot the figures side by side in the vedio?

Hi Deeplizard! Thanks for the great work! can't wait for your next videos on RL &amp;DQN!

Can I please get them ? And also, is there any plans for more tutorials ? it would be really good if you could focus on robotics

Hey guys it looks like DQN Cart Pole is missing some parts, the code stops at agent class and there is no code for the last 2 videos of the series (17 &amp; 18 )

Hey Wassim - I myself haven't spent enough time on model-free RL implementation to have any solid resources I can vouch for. If you end up implementing this for Frozen Lake, I would love to hear how it goes.

If I want to implement Frozen Lake *without* a model, what would you recommend the states and the actions to be? Can you point me to some RL examples that are model-free?

Is the method of using value iteration an example of machine learning?

Hey David - I believe that the full 10,000 episodes only took a few minutes to complete on my side. What are the specs of the machine you're running? Perhaps too little memory?

So how much time should training for the Frozen Lake game take. I've copied the code but it only does about 10 episodes per 6 seconds. So 10000 episodes is going to take about 6000 (&lt; 2hrs) seconds. Does this sound reasonable?

Hey Wassim - The next videos coming to the RL series will be on DQN code implementation. At that point, the RL code here will be updated with the DQN code.

Is there code for the DQN mechanisms? If not, is there a plan to share these?

Yes, it's completely down 😭 Twitter is going crazy!

Oops! Youtube is down!

I get "500 Internal Server Error" trying to watch this video. I tried two dif. browsers, any suggestion?

More Creators

Hey Ten5ei - You're welcome! The in_features parameter in a nn.Linear layer requires the size of each input sample. Therefore, we can pass a batch of data to the network, and the network understands that within that batch, the shape of each sample will be em.heightem.width3.

Hi Deeplizard! Thanks for the great work! can't wait for your next videos on RL &DQN!

Hey guys it looks like DQN Cart Pole is missing some parts, the code stops at agent class and there is no code for the last 2 videos of the series (17 & 18 )

If I want to implement Frozen Lake without a model, what would you recommend the states and the actions to be? Can you point me to some RL examples that are model-free?

So how much time should training for the Frozen Lake game take. I've copied the code but it only does about 10 episodes per 6 seconds. So 10000 episodes is going to take about 6000 (< 2hrs) seconds. Does this sound reasonable?