-
Notifications
You must be signed in to change notification settings - Fork 481
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Install on Ubuntu 14.04 uses the default OpenBlas library -> Low Performance #121
Comments
See relevant discussion here: |
Hi CDLuminate, your proposal might be an idea to build from source via the package mechanism. But I think that openblas is currently build without openmp: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=684344 a) Compiling it with openmp which fits to torch (Needs different compile switch) Friedrich |
It seems that the debian package of OpenBLAS is compiled with
|
In Ubuntu 16.04, |
@nagadomi if you send a pull request to install-deps.sh with whatever your preference is, i will merge. |
Now I totally agree with recompiling OpenBLAS locally (for Debian and Ubuntu) for sake of performance. I gave a similar hint in the Debian caffe package guide draft. BVLC/caffe#2601 |
Hi,
i installed torch via torch/distro on a Amazon EC2 g2.2xlarge machine with Ubuntu 14.04. Instead of the default luajit, I used lua51 because my test example crashed on MacOS due to memory problems for luajit. Then I checked the performance for this cifar10 net: https://github.com/szagoruyko/cifar.torch
With the default Ubuntu installation I get for the test on the CPU (not GPU)
a steptime of 31s. The steptime on my Macbook Pro i5 is approx. 8s. So this is really bad.
With
the steptime is reduced to approx. 10s. So there is much communication which reduces overall performance.
Then I installed a local OpenBLAS library via git as described in install.sh for other systems. I linked torch agains this version. The resulting performance is 3.7s steptime for using 8 cores. (4 Cores is the same...)
With
which uses the GPU, the resulting steptime is 1679ms.
The reported steptime for a GTX 980 was 700ms: http://torch.ch/blog/2015/07/30/cifar.html just for comparison.
So I think it is really important to compile and install the OpenBLAS library with
but the current install.sh and install-deps uses the default system openblas version for Ubuntu 14.04.
My proposal is to compile OpenBLAS also for Ubuntu 14.04. Doing this change will improve the CPU performance from 31s per step to 3.5s per step. Was there any reason not to do this as this is done for other distros?
Friedrich
The text was updated successfully, but these errors were encountered: