About Shamu cluster Shamu is UTSA's primary high performance (HPC) resource; heavily utilized by researchers, students, faculty and staff from a broad range of di...
CUDA is a parallel computing platform and programming model developed by NVIDIA for general purpose computing on GPU devices. CUDA application can dramatically sp...
Here is a Slurm job script for submit a job with checkpoint feature: #!/bin/bash# Put your SLURM options here#SBATCH partition=defq # change to proper par...
Cuda Cuda is a GPU programming language developed by NVIDIA. Currently Shamu contains two (4) very high end GPU nodes consisting of eight (8) Telsa K80 GPU comput...
TensorFlow CPU Version First, grab a compute node with srun and start a Python Virtualenv environment: abc123@login 0 0 ~ $ srun n 80 N 1 time=48:00:00 pty b...
Parallelize Deep Learning Models Across Multiple GPU Devices Deep Learning models written in Tensorflow can automatically take advantage of a GPU device on a comp...
Fiji Fiji is an image processing packageāa "batteries included" distribution of ImageJ, bundling a lot of plugins which facilitate scientific image analysis. In o...
Getting Started with Shamu Shamu is the Research Support Groups premier cluster consisting of many compute cores and GPU cores. Connecting from Windows environme...
Grid Engine Job Submission In order to use the Grid Engine commands you first need to load the "sge" module: abc123@login 0 0 abc123 $ module load sge You can su...
HOOMD blue is a general purpose particle simulation toolkit. It performs hard particle Monte Carlo simulations of a variety of shape classes, and molecular dynami...
MXnet Apache MXNet is a modern open source deep learning framework used to train, and deploy deep neural networks. It is scalable, allowing for fast model trainin...
Monitoring your jobs Using the squeue Command Check the status of all jobs on Shamu using the squeue command (this is just an example compute names may be differ...
Checkpoint and Restart Checkpointing is the action of saving the state of a running process to a checkpoint image file. Restart is the actions to resume the check...
ParaView is an open source, multi platform data analysis and visualization application. ParaView users can quickly build visualizations to analyze their data usin...
Partition Definitions There are eight (8) Slurm partitions on Shamu that users are able to utilize. Below are their definitions. * defq This is the main queu...
R How to add packages to your local repository Depending on the packages you want installed, you can do one of two things within R. First we load the R module an...
Running jobs on Shamu * Submitting your Batch Job Submit your code to run on the compute nodes. Create a submission script and specify options for better...
We have the GPU version of TensorFlow installed on the GPU nodes with the Python 3.6.1 module install (native Python 2.7 version is currently not working). This d...
Torch Torch is a scientific computing framework with wide support for machine learning algorithms that puts GPUs first. It is easy to use and efficient, thanks to...
Slurm (Simple Linux Utility for Resource Management) is a highly configurable workload manager and job scheduler for an HPC cluster. It is an open source software...
The GPU resources on Shamu have been configured as a "consumable resource". This means you can request individual GPU cards, nodes, etc... We have two types of GP...
Shamu OnDemand Portal We recently introduced the OnDemand Portal which allows you as a user to connect to Shamu via a web browser instead of an SSH client. The On...
http://ccwu.me/vsfm/ VisualSFM is a GUI application for 3D reconstruction using structure from motion (SFM). The reconstruction system integrates several of my pr...