Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error training controller with rank #13

Open
AliBaheri opened this issue Oct 16, 2018 · 2 comments
Open

Error training controller with rank #13

AliBaheri opened this issue Oct 16, 2018 · 2 comments

Comments

@AliBaheri
Copy link

Thanks for sharing this great repo.
I am receiving the following error when I run the controller module.
In fact, the previous suggestion from @davidADSP (in one of the closed issues) did not help me.
Any other suggestion for the following error:

Traceback (most recent call last): File "05_train_controller.py", line 521, in if "parent" == mpi_fork(args.num_worker+1): os.exit() File "05_train_controller.py", line 491, in mpi_fork subprocess.check_call(["mpirun", "-np", str(n), sys.executable] +['-u']+ sys.argv, env=env) File "/usr/lib/python3.6/subprocess.py", line 291, in check_call raise CalledProcessError(retcode, cmd) subprocess.CalledProcessError: Command '['mpirun', '-np', '2', '/home/baheri/.virtualenvs/worldmodels/bin/python', '-u', '05_train_controller.py', 'car_racing', '--num_worker', '1', '--num_worker_trial', '1', '--num_episode', '4', '--max_length', '1000', '--eval_steps', '25']' returned non-zero exit status 1.

Thanks.

@waiyc
Copy link

waiyc commented Jun 6, 2019

@AliBaheri did you managed to solve this error? I can only train the controller without using GPU which is pretty slow in terms of training time.

@davidADSP
Copy link

I have updated the codebase to run with TF2.2 - could you retry with the new codebase?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants