Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Robot moving in circle after set the multiple seed and change the Lr #163

Open
Seher-789 opened this issue Oct 31, 2024 · 5 comments
Open

Comments

@Seher-789
Copy link

Seher-789 commented Oct 31, 2024

i have trained my robot many time but robot is moving in circle while evolution and i changed the seed from 0 to 42 but no result even change the lr from 1e-3 to 1e-4 and policy noise but no result please suggest me some solution
Screenshot from 2024-10-31 23-38-19

@reiniscimurs
Copy link
Owner

Hi,

Please use the submission template when submitting the issue. This gives me the necessary information to start figuring out what the issue is.

How many epochs did you try training with each seed?

@Seher-789
Copy link
Author

Hi, Thank you for response i trained for 75 but the reward was negative.

@reiniscimurs
Copy link
Owner

Again, please provide full information as in the template.

75 epochs for each seed? Seems excessive. If it does not converge at such time for any seed there is either some change or mistake in the implementation. Here, the template information is needed.

@Seher-789
Copy link
Author

thank you for response , No i have trained the robot 75 epochs for seed 2 and seed 0 but the rewards were negative and i have trained with all the other seed values for ten epochs but the rewards were negative after the validating start the robot starts to move in circle

@reiniscimurs
Copy link
Owner

One last time, provide the information as set in the issue template. It is always very difficult to answer questions without knowing the full information.

TD3 can be a bit iffy with training and could also not converge but if no changes were made to the code I would expect it to work with at least some random seed. 10 epochs might still not be enough though to evaluate the performance. But if longer training does not help, you could try to use different learning rates.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants