Interactive Usage on the GPU nodes

Grab a GPU node with qlogin:
[abc123@login ~]$ qlogin -q gpu.q
[abc123@login ~]$ module load caffe

Copy the caffe examples and data directories to your data directory:

[abc123@login ~]$ cd /work/abc123
[abc123@login abc123]$ mkdir caffe_workdir && cd caffe_workdir
[abc123@login caffe_workdir]$ cp -a /cm/shared/apps/caffe/1.7.0/{examples,build,data,python} .

Run the following commands to prepare the datasets and train the model:

[abc123@login caffe_workdir]$ ./data/mnist/
[abc123@login caffe_workdir]$ ./examples/mnist/
[abc123@login caffe_workdir]$ ./examples/mnist/

At this point, you will see a lot of output from the training process. Near the end, you will see two messages:

Snapshotting to binary proto file
Snapshotting solver state to binary proto file

Once finished we can plot the models:
[abc123@gpu02 caffe_workdir]$ ./python/ ./examples/siamese/mnist_siamese.prototxt ./examples/siamese/mnist_siamese.png
Drawing net to ./examples/siamese/mnist_siamese.png
[abc123@gpu02 caffe_workdir]$ ./python/ ./examples/siamese/mnist_siamese_train_test.prototxt ./examples/siamese/mnist_siamese_train_test.png
Drawing net to ./examples/siamese/mnist_siamese_train_test.png

-- AdminUser - 09 Apr 2018
Topic revision: r2 - 10 Apr 2018, AdminUser
This site is powered by FoswikiCopyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding Foswiki? Send feedback