Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About the average reward #3

Open
Tendimension opened this issue Dec 2, 2016 · 6 comments
Open

About the average reward #3

Tendimension opened this issue Dec 2, 2016 · 6 comments

Comments

@Tendimension
Copy link

I have run 20 million frames(time-steps) in the Breakout environment, but the average reward has not changed. After about 17 million steps, the average reward has changed in Asynchronous Methods for Deep Reinforcement Learning. I do not know where the problem is?

@kkjh0723
Copy link

kkjh0723 commented Dec 3, 2016

@Tendimension Do you find any reason? I have the same problem. The avg. reward is 2.0 and std. is 0.0 until 20 million frames. Is the reward going up after some period?

@Tendimension
Copy link
Author

@kkjh0723 I do not know what the reason is.

@yao62995
Copy link
Owner

yao62995 commented Dec 7, 2016

@Tendimension @kkjh0723 I also found this bug. I will check it soon.

@Tendimension
Copy link
Author

@yao62995 Thanks a million!

@kkjh0723
Copy link

@yao62995 Do you have any updates on this problem?

@andyxzq
Copy link

andyxzq commented Feb 12, 2018

I find the same issue. The average reward is still 0.0 after 1 million steps.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants